Abstract
Existing approaches for multi-dimensional frequent patterns mining rely on the construction of data cube. Since the space of a data cube grows explosively as dimensionality or cardinality grows, it is too costly to materialize a full data cube, esp. when dimensionality or cardinality is large. In this paper, an efficient method is proposed to mine multi-dimensional frequent patterns without data cube construction. The main contributions include: (1) formally proposing the concept of multi-dimensional frequent pattern and its pruning strategy based on Extended Apriori Property, (2) proposing a novel structure called Multi-dimensional Index Tree (MDIT) and a MDIT-based multi-dimensional frequent patterns mining method (MDIT-Mining), and (3) conducting extensive experiments which show that the space consuming of MDIT is more than 4 orders of multitudes smaller than that of data cube along with the increasing of dimensionality or cardinality at most cases.
This work was supported by National Science Foundation of China (60473071), Specialized Research Fund for Doctoral Program by the Ministry of Education (SRFDP 20020610007), the grant from the State Administration of Traditional Chinese Medicine (SATCM 2003JP40) and National Science Foundation of China (90409007). Chuan Li, Tianqing Zhang, Yintian Liu, Qihong Liu, and Mingfang Zhu are Ph. D Candidates at DB&KE Lab, Sichuan University. YU Zhonghua is a professor at Sichuan University. JIANG Yongguang is a professor at Chengdu University of TCM. And TANG Changjie is the associate author.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Data Mining: Concepts and Techniques. Jiawei Han, Micheline Chamber, Morgan Kaufmann, Hardcover, ISBN 1558604898
Kamber, J.H., Chiang, J.Y.: Metarule-guided mining of multi-dimensional association rules using data cubes. In: KKD 1997 (1997)
Baeza-Yates, R., et al.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Piatetski-Shapiro, G.: Discovery, analysis, and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248 (1991)
Agrawal, R., et al.: Mining association rules between sets of items in large databases. In: Proc. of the ACM SIGMOD 1993 (1993)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. SIGMOD 1–12 (2000)
Barbara, D., Sullivan, M.: Quasi-cubes: Exploiting approximation in multidimensional databases. SIGMOD Record 26, 12–17 (1997)
Agarwal, S., et al.: On the computation of multidimensional aggregates. In: Proc. 22nd VLDB, Mumbai, pp. 506–521 (September 1996)
Sismanis, Y., Roussopoulos, N.: The dwarf data cube eliminates the high dimensionality curse. TR-CS4552, University of Maryland (2003)
Li, X., Han, J., Gonzalez, H.: High-Dimensional OLAP: A Minimal Cubing Approach. In: VLDB (2004)
Han, J., Cai, Y., Cercone, N.: Knowledge discovery in databases: An attribute-oriented approach. In: VLDB 1995, pp. 547–559 (1995)
Cai, Y.D., Cercone, N., Han, J.: Attribute-oriented induction in relational databases. In: Knowledge Discovery in Databases (1991)
Huairen, P.: The First Volume of Great Formula Dictionary of TCM. People’s Medical Publishing House (December 1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, C. et al. (2006). Mining Multi-dimensional Frequent Patterns Without Data Cube Construction. In: Yang, Q., Webb, G. (eds) PRICAI 2006: Trends in Artificial Intelligence. PRICAI 2006. Lecture Notes in Computer Science(), vol 4099. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-36668-3_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-36668-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36667-6
Online ISBN: 978-3-540-36668-3
eBook Packages: Computer ScienceComputer Science (R0)