Abstract
Mining frequent tree patterns is an important research problems with broad applications in bioinformatics, digital library, e-commerce, and so on. Previous studies highly suggested that pattern-growth methods are efficient in frequent pattern mining. In this paper, we systematically develop the pattern growth methods for mining frequent tree patterns. Two algorithms, Chopper and XSpanner, are devised. An extensive performance study shows that the two newly developed algorithms outperform TreeMinerVÂ [13], one of the fastest methods proposed before, in mining large databases. Furthermore, algorithm XSpanner is substantially faster than Chopper in many cases.
This research is supported in part by the Key Program of National Natural Science Foundation of China (No. 69933010), China National 863 High-Tech Projects (No. 2002AA4Z3430 and 2002AA231041), and US NSF grant IIS-0308001.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agarwal, R.C., et al.: A tree projection algorithm for generation of frequent item sets. J. of Parallel and Distributed Computing 61(3), 350–371 (2001)
Asai, T., et al.: Efficient substructure discovery from large semi-structured data. In: Proc. 2002 SIAM Int. Conf. Data Mining, Arlington, VA (2002)
Cook, D., Holder, L.: Substructure discovery using minimal description length and background knowledge. J. of Artificial Intelligence Research 1, 231–255 (1994)
Dehaspe, L., et al.: Finding frequent substructures in chemical compounds. In: KDD 1998, New York, NY (1998)
Han, J., et al.: Mining frequent patterns without candidate generation. In: SIGMOD 2000, Dallas, TX (2000)
Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: ICDM 2001, San Jose, CA (2001)
Miyahara, T., et al.: Discovery of frequent tree structured patterns in semistructured web documents. In: Cheung, D., Williams, G.J., Li, Q. (eds.) PAKDD 2001. LNCS (LNAI), vol. 2035, p. 47. Springer, Heidelberg (2001)
Pei, J., et al.: H-Mine: Hyper-structure mining of frequent patterns in large databases. In: ICDM 2001, San Jose, CA (2001)
Pei, J., et al.: PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In: ICDE 2001, Heidelberg, Germany (2001)
Srikant, R., Agrawal, R.: Mining generalized association rules. In: VLDB 1995, Zurich, Switzerland (1995)
Wang, J.T.L., et al.: Automated discovery of active motifs in multiple RNA secondary structures. In: KDD 1996, Portland, Oregon (1996)
Wang, K., Liu, H.: Schema discovery for semistructured data. In: KDD 1997, Newport Beach, CA (1997)
Zaki, M.J.: Efficiently mining frequent trees in a forest. In: KDD 2002, Edmonton, Alberta, Canada (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, C., Hong, M., Pei, J., Zhou, H., Wang, W., Shi, B. (2004). Efficient Pattern-Growth Methods for Frequent Tree Pattern Mining. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_54
Download citation
DOI: https://doi.org/10.1007/978-3-540-24775-3_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22064-0
Online ISBN: 978-3-540-24775-3
eBook Packages: Springer Book Archive