Mining Multi-dimensional Frequent Patterns Without Data Cube Construction

Li, Chuan; Tang, Changjie; Yu, Zhonghua; Liu, Yintian; Zhang, Tianqing; Liu, Qihong; Zhu, Mingfang; Jiang, Yongguang

doi:10.1007/978-3-540-36668-3_28

Chuan Li²⁰,
Changjie Tang²⁰,
Zhonghua Yu²⁰,
Yintian Liu²⁰,
Tianqing Zhang²⁰,
Qihong Liu²⁰,
Mingfang Zhu²⁰ &
…
Yongguang Jiang²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4099))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

2146 Accesses

Abstract

Existing approaches for multi-dimensional frequent patterns mining rely on the construction of data cube. Since the space of a data cube grows explosively as dimensionality or cardinality grows, it is too costly to materialize a full data cube, esp. when dimensionality or cardinality is large. In this paper, an efficient method is proposed to mine multi-dimensional frequent patterns without data cube construction. The main contributions include: (1) formally proposing the concept of multi-dimensional frequent pattern and its pruning strategy based on Extended Apriori Property, (2) proposing a novel structure called Multi-dimensional Index Tree (MDIT) and a MDIT-based multi-dimensional frequent patterns mining method (MDIT-Mining), and (3) conducting extensive experiments which show that the space consuming of MDIT is more than 4 orders of multitudes smaller than that of data cube along with the increasing of dimensionality or cardinality at most cases.

This work was supported by National Science Foundation of China (60473071), Specialized Research Fund for Doctoral Program by the Ministry of Education (SRFDP 20020610007), the grant from the State Administration of Traditional Chinese Medicine (SATCM 2003JP40) and National Science Foundation of China (90409007). Chuan Li, Tianqing Zhang, Yintian Liu, Qihong Liu, and Mingfang Zhu are Ph. D Candidates at DB&KE Lab, Sichuan University. YU Zhonghua is a professor at Sichuan University. JIANG Yongguang is a professor at Chengdu University of TCM. And TANG Changjie is the associate author.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 239.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Data Mining: Concepts and Techniques. Jiawei Han, Micheline Chamber, Morgan Kaufmann, Hardcover, ISBN 1558604898
Google Scholar
Kamber, J.H., Chiang, J.Y.: Metarule-guided mining of multi-dimensional association rules using data cubes. In: KKD 1997 (1997)
Google Scholar
Baeza-Yates, R., et al.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Google Scholar
Piatetski-Shapiro, G.: Discovery, analysis, and presentation of strong rules. In: Knowledge Discovery in Databases, pp. 229–248 (1991)
Google Scholar
Agrawal, R., et al.: Mining association rules between sets of items in large databases. In: Proc. of the ACM SIGMOD 1993 (1993)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. SIGMOD 1–12 (2000)
Google Scholar
Barbara, D., Sullivan, M.: Quasi-cubes: Exploiting approximation in multidimensional databases. SIGMOD Record 26, 12–17 (1997)
Article Google Scholar
Agarwal, S., et al.: On the computation of multidimensional aggregates. In: Proc. 22nd VLDB, Mumbai, pp. 506–521 (September 1996)
Google Scholar
Sismanis, Y., Roussopoulos, N.: The dwarf data cube eliminates the high dimensionality curse. TR-CS4552, University of Maryland (2003)
Google Scholar
Li, X., Han, J., Gonzalez, H.: High-Dimensional OLAP: A Minimal Cubing Approach. In: VLDB (2004)
Google Scholar
Han, J., Cai, Y., Cercone, N.: Knowledge discovery in databases: An attribute-oriented approach. In: VLDB 1995, pp. 547–559 (1995)
Google Scholar
Cai, Y.D., Cercone, N., Han, J.: Attribute-oriented induction in relational databases. In: Knowledge Discovery in Databases (1991)
Google Scholar
Huairen, P.: The First Volume of Great Formula Dictionary of TCM. People’s Medical Publishing House (December 1993)
Google Scholar
http://cs.scu.edu.cn/~lichuan/lichuan.rar

Download references

Author information

Authors and Affiliations

The Data Base and Knowledge Engineering Lab (DBKE), Computer School of Sichuan University,
Chuan Li, Changjie Tang, Zhonghua Yu, Yintian Liu, Tianqing Zhang, Qihong Liu & Mingfang Zhu
Chengdu University of Traditional Chinese Medicine,
Yongguang Jiang

Authors

Chuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Changjie Tang
View author publications
You can also search for this author in PubMed Google Scholar
Zhonghua Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yintian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Tianqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Qihong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mingfang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yongguang Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Hong Kong University of Science and Technology,, Hong Kong
Qiang Yang
Clayton School of Information Technology, Monash University, P.O. Box, Australia
Geoff Webb

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, C. et al. (2006). Mining Multi-dimensional Frequent Patterns Without Data Cube Construction. In: Yang, Q., Webb, G. (eds) PRICAI 2006: Trends in Artificial Intelligence. PRICAI 2006. Lecture Notes in Computer Science(), vol 4099. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-36668-3_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-36668-3_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36667-6
Online ISBN: 978-3-540-36668-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics