skip to main content
10.1145/1244002.1244099acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
Article

FAT-miner: mining frequent attribute trees

Published:11 March 2007Publication History

ABSTRACT

Data that can conceptually be viewed as tree structures abounds in domains such as bio-informatics, web logs, XML databases and multi-relational databases. Besides structural information such as nodes and edges, tree structured data also often contains attributes, that represent properties of nodes. Current algorithms for finding frequent patterns in structured data, do not take these attributes into account, and hence potentially useful information is neglected. We present FAT-miner, an algorithm for frequent pattern discovery in tree structured data with attributes. To illustrate the applicability of FAT-miner, we use it to explore the properties of good and bad loans in a well-known multi-relational financial database.

References

  1. R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. 20th Int. Conf. Very Large Data Bases, VLDB, pages 487--499, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Asai, K. Abe, S. Kawasoe, H. Arimura, H. Sakamoto, and S. Arikawa. Efficient substructure discovery from large semi-structured data. In Proceedings of the Second SIAM International Conference on Data Mining, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  3. R. Bayardo. Efficiently mining long patterns from databases. In A. T. Laura and M. Haas, editors, SIGMOD 1998, Proceedings ACM SIGMOD International Conference on Management of Data, pages 85--93, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Berka. Guide to the financial data set. http://lisp.vse.cz/challenge/. Workshop notes on Discovery Challenge PKDD2000.Google ScholarGoogle Scholar
  5. Y. Chi, R. Muntz, S. Nijssen, and J. Kok. Frequent subtree mining - an overview. Fundamenta Informaticae., 66(1--2):161--198, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. L. Dehaspe and L. De Raedt. Mining association rules in multiple relations. In Proceedings of the 7th International Workshop on Inductive Logic Programming, volume 1297, pages 125--132. Springer-Verlag, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Denoyer and P. Gallinari. The Wikipedia XML Corpus. SIGIR Forum, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Inokuchi, T. Washio, and H. Motoda. An apriori-based algorithm for mining frequent substructures from graph data. In D. A. Zighed, H. J. Komorowski, and J. M. Zytkow, editors, PKDD 2000, pages 13--23, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. De Knijf. FAT-miner: Mining frequent attribute trees. Technical Report UU-CS-2006-053, Institute of Information and Computing Sciences, Utrecht University, 2006.Google ScholarGoogle Scholar
  10. A. Knobbe. Multi-Relational Data Mining. PhD thesis, Universiteit Utrecht, 2004.Google ScholarGoogle Scholar
  11. E. Ng, A. Fu, and K. Wang. Mining association rules from stars. In ICDM 2002, pages 322--329, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. K. Wang and H. Liu. Discovering structural association of semistructured data. Knowledge and Data Engineering, 12(2):353--371, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. X. Yan and J. Han. gspan: Graph-based substructure pattern mining. In ICDM 2002, pages 721--724, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. J. Zaki. Efficiently mining frequent trees in a forest. In KDD '02, pages 71--80, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. FAT-miner: mining frequent attribute trees

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SAC '07: Proceedings of the 2007 ACM symposium on Applied computing
      March 2007
      1688 pages
      ISBN:1595934804
      DOI:10.1145/1244002

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 March 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,650of6,669submissions,25%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader