Association Rule Discovery in Data Mining by Implementing Principal Component Analysis

Gerardo, Bobby D.; Lee, Jaewan; Ra, Inho; Byun, Sangyong

doi:10.1007/978-3-540-30583-5_6

Bobby D. Gerardo¹⁹,
Jaewan Lee¹⁹,
Inho Ra¹⁹ &
…
Sangyong Byun²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3397))

Included in the following conference series:

International Conference on AI, Simulation, and Planning in High Autonomy Systems

1435 Accesses
3 Altmetric

Abstract

This paper presents the Principal Component Analysis (PCA) which is integrated in the proposed architectural model and the utilization of apriori algorithm for association rule discovery. The scope of this study includes techniques such as the use of devised data reduction technique and the deployment of association rule algorithm in data mining to efficiently process and generate association patterns. The evaluation shows that interesting association rules were generated based on the approximated data which was the result of dimensionality reduction, thus, implied rigorous and faster computation than the usual approach. This is attributed to the PCA method which reduces the dimensionality of the original data prior to the processing. Furthermore, the proposed model had verified the premise that it could handle sparse information and suitable for data of high dimensionality as compared to other technique such as the wavelet transform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Feature Selection by Principle Component Analysis for Mining Frequent Association Rules

A Comparative Analysis of Algorithms for Mining Frequent Itemsets

Enhanced Association Rules and Python

References

Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proc. of International Conference on Very Large Databases VLDB, pp. 487–499 (1994)
Google Scholar
Han, J., Kamber, M.: Data mining concepts & techniques. Morgan Kaufmann, USA (2001)
Google Scholar
Hellerstein, J.L., Ma, S., Perng, C.S.: Discovering actionable patterns in event data. IBM Systems Journal 41(3) (2002)
Google Scholar
Multi-Dimensional Constrained Gradient Mining, ftp://fas.sfu.ca/pub/cs/theses/2001/JoyceManWingLamMSc.pdf
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD International Conference on Mngt. of Data (1993)
Google Scholar
Chen, B., Haas, P., Scheuermann, P.: A new two-phase sampling based algorithm for discovering association rules. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2002)
Google Scholar
Bronnimann, H., Chen, B., Dash, M., Hass, P., Qiao, Y., Scheuermann, P.: Efficient Data-Reduction Methods for On-Line Association Rule Discovery. In: Data Mining: Next Generation Challenges & Future Directions (2004) (in press)
Google Scholar
Margaritis, D., Faloutsos, C., Thrun, S.: NetCube: A Scalable Tool for Fast Data Mining and Compression. In: 27th Conference on Very Large Databases (VLDB) Roma, Italy (September 2001)
Google Scholar
Korn, F., Labrinidis, A., Kotidis, Y., Faloutsos, C., Kaplunovich, A., Perkovic, D.: Quantifiable Data Mining Using Principal Component Analysis Technical Report, University of Maryland, College Park, Number CS-TR-3754 (February 1997)
Google Scholar
Han, E.H., Karypis, G., Kumar, V., Mobasher, B.: Clustering in a high-dimensional space using hypergraph models (1998), Available at, http://www.informatik.uni-siegen.de/~galeas/papers/general/Clustering_in_a_High-Dimensional_Space_Using_Hypergraphs_Models_%28Han1997b%29.pdf
Gerardo, B., Lee, J.W., Lee, J.S., Park, M.G., Lee, M.R.: The association rule algorithm with missing data in data mining. In: Laganá, A., Gavrilova, M.L., Kumar, V., Mun, Y., Tan, C.J.K., Gervasi, O. (eds.) ICCSA 2004. LNCS, vol. 3043, pp. 97–105. Springer, Heidelberg (2004)
Chapter Google Scholar
Principal Component Analysis, http://www.unesco.org/webworld/idams/advguide/Chapt6_2.htm

Download references

Author information

Authors and Affiliations

School of Electronic and Information Engineering, Kunsan National University, 68 Miryong-dong, Kunsan, Chonbuk, 573-701, South Korea
Bobby D. Gerardo, Jaewan Lee & Inho Ra
Faculty of Telecommunication & Computer Engineering, Cheju National University, 66 Jejudaehakno, Jeju-si, Jeju-do, 690-756, South Korea
Sangyong Byun

Authors

Bobby D. Gerardo
View author publications
You can also search for this author in PubMed Google Scholar
Jaewan Lee
View author publications
You can also search for this author in PubMed Google Scholar
Inho Ra
View author publications
You can also search for this author in PubMed Google Scholar
Sangyong Byun
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of EECS, Korea Advanced Institute of Science and Technology (KAIST), 373-1 Guseong-dong, Yuseong-gu, Daejeon, Republic of Korea
Tag Gon Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gerardo, B.D., Lee, J., Ra, I., Byun, S. (2005). Association Rule Discovery in Data Mining by Implementing Principal Component Analysis. In: Kim, T.G. (eds) Artificial Intelligence and Simulation. AIS 2004. Lecture Notes in Computer Science(), vol 3397. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30583-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-540-30583-5_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24476-9
Online ISBN: 978-3-540-30583-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics