Abstract
Recently, mining sequential patterns, especially closed sequential patterns and generator patterns, has attracted much attention from both academic and industrial communities. In recent years, incremental mining of all sequential patterns (all closed sequential patterns) has been widely studied. However, to our best knowledge, there has not been any study for incremental mining of sequence generators. In this paper, by carefully examining the existing expansion strategies for mining sequential databases, we design a GenTree structure to keep track of the relevant mining information, and propose an efficient algorithm, IncGen, for incremental generator mining. We have conducted thorough experiment evaluation and the experimental results show that the IncGen algorithm outperforms state-of-the-art generator-mining method FEAT significantly.
This work was supported in part by National Natural Science Foundation of China under grant No. 60833003 and 60873171, and the Program of State Education Ministry of China for New Century Excellent Talents in University under Grant No. NCET-07-0491.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Srikant, R.: Mining sequential patterns. In: ICDE, pp. 3–14 (1995)
She, R., Chen, F., Wang, K., Ester, M., Gardy, J.L., Brinkman, F.S.L.: Frequent-subsequence-based prediction of outer membrane proteins. In: KDD, pp. 436–445 (2003)
Sun, G., Liu, X., Cong, G., Zhou, M., Xiong, Z., Lee, J., Lin, C.Y.: Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. ACL, 81–88 (2007)
Jindal, N., Liu, B.: Identifying comparative sentences in text documents. In: SIGIR, pp. 244–251 (2006)
Chen, J., Cook, T.: Mining contiguous sequential patterns from web logs. In: WWW, pp. 177–1178 (2007)
Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: EDBT (1996)
Zaki, M.J.: SPADE: An Efficient Algorithm for Mining Frequent Sequences. ML, 31–60 (2001)
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern. In: ICDE (2001)
Ayres, J., Flannick, J., Gehrke, J., Yiu, T.: Sequential PAttern mining using a bitmap representation. In: KDD, pp. 429–435 (2002)
Yan, X., Han, J., Afshar, R.: CloSpan: Mining closed sequential patterns in large datasets. In: SDM, pp. 166–177 (2003)
Wang, J., Han, J.: BIDE: efficient mining of frequent closed sequences. In: ICDE, pp. 79–90 (2004)
Gao, C., Wang, J., He, Y., Zhou, L.: Efficient mining of frequent sequence generators. In: WWW, pp. 1051–1052 (2008)
Lo, D., Khoo, S.C., Li, J.: Mining and ranking generators of sequential patterns. In: SDM, pp. 553–564 (2008)
Cheng, H., Yan, X., Han, J.: IncSpan: incremental mining of sequential patterns in large database. In: KDD, pp. 527–532 (2004)
Nguyen, S.N., Sun, X., Orlowska, M.E.: Improvements of incSpan: Incremental mining of sequential patterns in large database. In: Ho, T.-B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 442–451. Springer, Heidelberg (2005)
Kao, B., Zhang, M., Yip, C.L., Cheung, D.W., Fayyad, U.: Efficient algorithms for mining and incremental update of maximal frequent sequences. In: DMKD, pp. 87–116 (2005)
Chang, L., Wang, T., Yang, D., Luan, H., Tang, S.: Efficient algorithms for incremental maintenance of closed sequential patterns in large databases. In: DKE, pp. 68–106 (2009)
Ezeife, C.I., Lu, Y.: Mining web log sequential patterns with position coded pre-order linked wap-tree. In: DMKD, pp. 5–38 (2005)
Masseglia, F., Poncelet, P., Teisseire, M.: Incremental mining of sequential patterns in large databases. In: DKE, pp. 97–121 (2003)
Lin, M.Y., Lee, S.Y.: Incremental update on sequential patterns in large databases by implicit merging and efficient counting. In: IS, pp. 385–404 (2004)
Parthasarathy, S., Zaki, M.J., Ogihara, M., Dwarkadas, S.: Incremental and interactive sequence mining. In: CIKM, pp. 251–258 (1999)
Lin, M.Y., Hsueh, S.C., Chan, C.C.: Incremental Discovery of Sequential Patterns Using a Backward Mining Approach. In: CSE, pp. 64–70 (2009)
Ezeife, C.I., Liu, Y.: Fast incremental mining of web sequential patterns with PLWAP tree. In: DMKD, pp. 376–416 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
He, Y., Wang, J., Zhou, L. (2011). Efficient Incremental Mining of Frequent Sequence Generators. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20149-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-20149-3_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20148-6
Online ISBN: 978-3-642-20149-3
eBook Packages: Computer ScienceComputer Science (R0)