Article

FAT-miner: mining frequent attribute trees

Author:
Jeroen De Knijf

Utrecht University, Utrecht, The Netherlands

Utrecht University, Utrecht, The Netherlands
View Profile

SAC '07: Proceedings of the 2007 ACM symposium on Applied computingMarch 2007Pages 417–422https://doi.org/10.1145/1244002.1244099

Published:11 March 2007Publication History

SAC '07: Proceedings of the 2007 ACM symposium on Applied computing

Pages 417–422

ABSTRACT

Data that can conceptually be viewed as tree structures abounds in domains such as bio-informatics, web logs, XML databases and multi-relational databases. Besides structural information such as nodes and edges, tree structured data also often contains attributes, that represent properties of nodes. Current algorithms for finding frequent patterns in structured data, do not take these attributes into account, and hence potentially useful information is neglected. We present FAT-miner, an algorithm for frequent pattern discovery in tree structured data with attributes. To illustrate the applicability of FAT-miner, we use it to explore the properties of good and bad loans in a well-known multi-relational financial database.

References

R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. 20th Int. Conf. Very Large Data Bases, VLDB, pages 487--499, 1994. Google ScholarDigital Library
T. Asai, K. Abe, S. Kawasoe, H. Arimura, H. Sakamoto, and S. Arikawa. Efficient substructure discovery from large semi-structured data. In Proceedings of the Second SIAM International Conference on Data Mining, 2002.Google ScholarCross Ref
R. Bayardo. Efficiently mining long patterns from databases. In A. T. Laura and M. Haas, editors, SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, pages 85--93, 1998. Google ScholarDigital Library
P. Berka. Guide to the financial data set. http://lisp.vse.cz/challenge/. Workshop notes on Discovery Challenge PKDD2000.Google Scholar
Y. Chi, R. Muntz, S. Nijssen, and J. Kok. Frequent subtree mining - an overview. Fundamenta Informaticae., 66(1--2):161--198, 2005. Google ScholarDigital Library
L. Dehaspe and L. De Raedt. Mining association rules in multiple relations. In Proceedings of the 7th International Workshop on Inductive Logic Programming, volume 1297, pages 125--132. Springer-Verlag, 1997. Google ScholarDigital Library
L. Denoyer and P. Gallinari. The Wikipedia XML Corpus. SIGIR Forum, 2006. Google ScholarDigital Library
A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In D. A. Zighed, H. J. Komorowski, and J. M. Zytkow, editors, PKDD 2000, pages 13--23, 2000. Google ScholarDigital Library
J. De Knijf. FAT-miner: Mining frequent attribute trees. Technical Report UU-CS-2006-053, Institute of Information and Computing Sciences, Utrecht University, 2006.Google Scholar
A. Knobbe. Multi-Relational Data Mining. PhD thesis, Universiteit Utrecht, 2004.Google Scholar
E. Ng, A. Fu, and K. Wang. Mining association rules from stars. In ICDM 2002, pages 322--329, 2002. Google ScholarDigital Library
K. Wang and H. Liu. Discovering structural association of semistructured data. Knowledge and Data Engineering, 12(2):353--371, 2000. Google ScholarDigital Library
X. Yan and J. Han. gspan: Graph-based substructure pattern mining. In ICDM 2002, pages 721--724, 2002. Google ScholarDigital Library
M. J. Zaki. Efficiently mining frequent trees in a forest. In KDD '02, pages 71--80, 2002. Google ScholarDigital Library

Index Terms

FAT-miner: mining frequent attribute trees
1. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

CLS-Miner: efficient and effective closed high-utility itemset mining

High-utility itemset mining (HUIM) is a popular data mining task with applications in numerous domains. However, traditional HUIM algorithms often produce a very large set of high-utility itemsets (HUIs). As a result, analyzing HUIs can be very time ...
Read More
DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

Frequent closed itemsets (FCI) play an important role in pruning redundant rules fast. Therefore, a lot of algorithms for mining FCI have been developed. Algorithms based on vertical data formats have some advantages in that they require scan databases ...
Read More
HPFP-Miner: A Novel Parallel Frequent Itemset Mining Algorithm
ICNC '09: Proceedings of the 2009 Fifth International Conference on Natural Computation - Volume 03

Frequent itemset mining is a fundamental and essential issue in data mining field and can be used in many data mining tasks. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SAC '07: Proceedings of the 2007 ACM symposium on Applied computing
March 2007
1688 pages
ISBN:1595934804
DOI:10.1145/1244002
Conference Chairs:
Yookun Cho
Seoul National University, Seoul, Korea
,
Roger L. Wainwright
University of Tulsa, Tulsa, Oklahoma
,
Hisham M. Haddad
Kennesaw State University, Kennesaw, Georgia
,
Sung Y. Shin
South Dakota State University, Brookings, South Dakota
,
Program Chair:
Yong Wan Koo
The University of Suwon, Gyeongggi-do, Korea
Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 March 2007
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
XML
attributes
frequent tree mining
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate1,650of6,669submissions,25%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 9
  Total Citations
  View Citations
- 413
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

FAT-miner: mining frequent attribute trees

SAC '07: Proceedings of the 2007 ACM symposium on Applied computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

CLS-Miner: efficient and effective closed high-utility itemset mining

DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

HPFP-Miner: A Novel Parallel Frequent Itemset Mining Algorithm

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

FAT-miner: mining frequent attribute trees

SAC '07: Proceedings of the 2007 ACM symposium on Applied computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

CLS-Miner: efficient and effective closed high-utility itemset mining

DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets

HPFP-Miner: A Novel Parallel Frequent Itemset Mining Algorithm

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media