Skip to main content

Efficient Incremental Mining of Frequent Sequence Generators

  • Conference paper
  • 1309 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6587))

Abstract

Recently, mining sequential patterns, especially closed sequential patterns and generator patterns, has attracted much attention from both academic and industrial communities. In recent years, incremental mining of all sequential patterns (all closed sequential patterns) has been widely studied. However, to our best knowledge, there has not been any study for incremental mining of sequence generators. In this paper, by carefully examining the existing expansion strategies for mining sequential databases, we design a GenTree structure to keep track of the relevant mining information, and propose an efficient algorithm, IncGen, for incremental generator mining. We have conducted thorough experiment evaluation and the experimental results show that the IncGen algorithm outperforms state-of-the-art generator-mining method FEAT significantly.

This work was supported in part by National Natural Science Foundation of China under grant No. 60833003 and 60873171, and the Program of State Education Ministry of China for New Century Excellent Talents in University under Grant No. NCET-07-0491.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Srikant, R.: Mining sequential patterns. In: ICDE, pp. 3–14 (1995)

    Google Scholar 

  2. She, R., Chen, F., Wang, K., Ester, M., Gardy, J.L., Brinkman, F.S.L.: Frequent-subsequence-based prediction of outer membrane proteins. In: KDD, pp. 436–445 (2003)

    Google Scholar 

  3. Sun, G., Liu, X., Cong, G., Zhou, M., Xiong, Z., Lee, J., Lin, C.Y.: Detecting Erroneous Sentences using Automatically Mined Sequential Patterns. ACL, 81–88 (2007)

    Google Scholar 

  4. Jindal, N., Liu, B.: Identifying comparative sentences in text documents. In: SIGIR, pp. 244–251 (2006)

    Google Scholar 

  5. Chen, J., Cook, T.: Mining contiguous sequential patterns from web logs. In: WWW, pp. 177–1178 (2007)

    Google Scholar 

  6. Srikant, R., Agrawal, R.: Mining Sequential Patterns: Generalizations and Performance Improvements. In: EDBT (1996)

    Google Scholar 

  7. Zaki, M.J.: SPADE: An Efficient Algorithm for Mining Frequent Sequences. ML, 31–60 (2001)

    Google Scholar 

  8. Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern. In: ICDE (2001)

    Google Scholar 

  9. Ayres, J., Flannick, J., Gehrke, J., Yiu, T.: Sequential PAttern mining using a bitmap representation. In: KDD, pp. 429–435 (2002)

    Google Scholar 

  10. Yan, X., Han, J., Afshar, R.: CloSpan: Mining closed sequential patterns in large datasets. In: SDM, pp. 166–177 (2003)

    Google Scholar 

  11. Wang, J., Han, J.: BIDE: efficient mining of frequent closed sequences. In: ICDE, pp. 79–90 (2004)

    Google Scholar 

  12. Gao, C., Wang, J., He, Y., Zhou, L.: Efficient mining of frequent sequence generators. In: WWW, pp. 1051–1052 (2008)

    Google Scholar 

  13. Lo, D., Khoo, S.C., Li, J.: Mining and ranking generators of sequential patterns. In: SDM, pp. 553–564 (2008)

    Google Scholar 

  14. Cheng, H., Yan, X., Han, J.: IncSpan: incremental mining of sequential patterns in large database. In: KDD, pp. 527–532 (2004)

    Google Scholar 

  15. Nguyen, S.N., Sun, X., Orlowska, M.E.: Improvements of incSpan: Incremental mining of sequential patterns in large database. In: Ho, T.-B., Cheung, D., Liu, H. (eds.) PAKDD 2005. LNCS (LNAI), vol. 3518, pp. 442–451. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  16. Kao, B., Zhang, M., Yip, C.L., Cheung, D.W., Fayyad, U.: Efficient algorithms for mining and incremental update of maximal frequent sequences. In: DMKD, pp. 87–116 (2005)

    Google Scholar 

  17. Chang, L., Wang, T., Yang, D., Luan, H., Tang, S.: Efficient algorithms for incremental maintenance of closed sequential patterns in large databases. In: DKE, pp. 68–106 (2009)

    Google Scholar 

  18. Ezeife, C.I., Lu, Y.: Mining web log sequential patterns with position coded pre-order linked wap-tree. In: DMKD, pp. 5–38 (2005)

    Google Scholar 

  19. Masseglia, F., Poncelet, P., Teisseire, M.: Incremental mining of sequential patterns in large databases. In: DKE, pp. 97–121 (2003)

    Google Scholar 

  20. Lin, M.Y., Lee, S.Y.: Incremental update on sequential patterns in large databases by implicit merging and efficient counting. In: IS, pp. 385–404 (2004)

    Google Scholar 

  21. Parthasarathy, S., Zaki, M.J., Ogihara, M., Dwarkadas, S.: Incremental and interactive sequence mining. In: CIKM, pp. 251–258 (1999)

    Google Scholar 

  22. Lin, M.Y., Hsueh, S.C., Chan, C.C.: Incremental Discovery of Sequential Patterns Using a Backward Mining Approach. In: CSE, pp. 64–70 (2009)

    Google Scholar 

  23. Ezeife, C.I., Liu, Y.: Fast incremental mining of web sequential patterns with PLWAP tree. In: DMKD, pp. 376–416 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, Y., Wang, J., Zhou, L. (2011). Efficient Incremental Mining of Frequent Sequence Generators. In: Yu, J.X., Kim, M.H., Unland, R. (eds) Database Systems for Advanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6587. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20149-3_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20149-3_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20148-6

  • Online ISBN: 978-3-642-20149-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics