skip to main content
10.1145/1014052.1014114acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

IncSpan: incremental mining of sequential patterns in large database

Published: 22 August 2004 Publication History

Abstract

Many real life sequence databases grow incrementally. It is undesirable to mine sequential patterns from scratch each time when a small set of sequences grow, or when some new sequences are added into the database. Incremental algorithm should be developed for sequential pattern mining so that mining can be adapted to incremental database updates. However, it is nontrivial to mine sequential patterns incrementally, especially when the existing sequences grow incrementally because such growth may lead to the generation of many new patterns due to the interactions of the growing subsequences with the original ones. In this study, we develop an efficient algorithm, IncSpan, for incremental mining of sequential patterns, by exploring some interesting properties. Our performance study shows that IncSpan outperforms some previously proposed incremental algorithms as well as a non-incremental one with a wide margin.

References

[1]
R. Agrawal and R. Srikant. Mining sequential patterns. In Proc. 1995 Int. Conf. Data Engineering (ICDE '95), pages 3--14, March 1995.
[2]
J. Ayres, J.E. Gehrke, T. Yiu, and J. Flannik. Sequential pattern mining using bitmaps. In Proc. 2002 ACM SIGKDD Int. Conf. Knowledge Discovery in Databases (KDD '02), July 2002.
[3]
D. Cheung, J. Han, V. Ng, and C. Wong. Maintenance of discovered association rules in large databases: An incremental update technique. In Proc. of the 12th Int. Conf. on Data Engineering (ICDE '96), March 1996.
[4]
M. Garofalakis, R. Rastogi, and K. Shim. SPIRIT: Sequential pattern mining with regular expression constraints. In Proc. 1999 Int. Conf. Very Large Data Bases (VLDB '99), pages 223--234, Sept 1999.
[5]
H. Mannila, H. Toivonen, and A.I. Verkamo. Discovering frequent episodes in sequences. In Proc. 1995 Int. Conf. Knowledge Discovery and Data Mining (KDD '95), pages 210--215, Aug 1995.
[6]
F. Masseglia, P. Poncelet, and M. Teisseire. Incremental mining of sequential patterns in large databases. Data Knowl. Eng., 46(1): 97--121, 2003.
[7]
S. Parthasarathy, M. Zaki, M. Ogihara, and S. Dwarkadas. Incremental and interactive sequence mining. In Proc. of the 8th Int. Conf. on Information and Knowledge Management (CIKM '99), Nov 1999.
[8]
J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In Proc. 2001 Int. Conf. Data Engineering (ICDE '01), pages 215--224, April 2001.
[9]
A. Savasere, E. Omiecinski, and S. Navathe. An efficient algorithm for mining association rules in large databases. In Proc. 1995 Int. Conf. Very Large Data Bases (VLDB '95), Sept 1995.
[10]
R. Srikant and R. Agrawal. Mining sequential patterns: Generalizations and performance improvements. In Proc. of the 5th Int. Conf. on Extending Database Technology (EDBT '96), Mar 1996.
[11]
J. Wang and J. Han. Bide: Efficient mining of frequent losed sequences. In Proc. of 2004 Int. Conf. on Data Engineering (ICDE '04), March 2004.
[12]
X. Yan, J. Han,and R. Afshar. CloSpan: Mining closed sequential patterns in large datasets. In Proc. 2003 SIAM Int. Conf. on Data Mining (SDM '03), May 2003.
[13]
M. Zaki. SPADE: An efficient algorithm for mining frequent sequences. Machine Learning, 40:31--60, 2001.
[14]
M. Zhang, B. Kao, D. Cheung, and C. Yip. Efficient algorithms for incremental updates of frequent sequences. In Proc. of Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD '02), May 2002.

Cited By

View all
  • (2024)A new tree-based approach to mine sequential patternsExpert Systems with Applications10.1016/j.eswa.2023.122754242(122754)Online publication date: May-2024
  • (2023)An Approach for Incremental Mining of Clickstream Patterns as a Service ApplicationIEEE Transactions on Services Computing10.1109/TSC.2023.329494516:6(3892-3905)Online publication date: Nov-2023
  • (2023)Incremental Targeted Mining in Sequences2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302500(1-10)Online publication date: 9-Oct-2023
  • Show More Cited By

Index Terms

  1. IncSpan: incremental mining of sequential patterns in large database

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '04: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
    August 2004
    874 pages
    ISBN:1581138881
    DOI:10.1145/1014052
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 August 2004

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. buffering pattern
    2. incremental mining
    3. reverse pattern matching
    4. shared projection

    Qualifiers

    • Article

    Conference

    KDD04

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A new tree-based approach to mine sequential patternsExpert Systems with Applications10.1016/j.eswa.2023.122754242(122754)Online publication date: May-2024
    • (2023)An Approach for Incremental Mining of Clickstream Patterns as a Service ApplicationIEEE Transactions on Services Computing10.1109/TSC.2023.329494516:6(3892-3905)Online publication date: Nov-2023
    • (2023)Incremental Targeted Mining in Sequences2023 IEEE 10th International Conference on Data Science and Advanced Analytics (DSAA)10.1109/DSAA60987.2023.10302500(1-10)Online publication date: 9-Oct-2023
    • (2022)Pattern on demand in transactional distributed databasesInformation Systems10.1016/j.is.2021.101908104:COnline publication date: 1-Feb-2022
    • (2022)Incremental sequential patterns for multivariate temporal association rules miningExpert Systems with Applications10.1016/j.eswa.2022.118020207(118020)Online publication date: Nov-2022
    • (2022)Forecasting the User Prediction from Weblogs Using Improved IncSpan AlgorithmSustainable Communication Networks and Application10.1007/978-981-16-6605-6_58(767-777)Online publication date: 17-Jan-2022
    • (2021)Efficient Methods for Clickstream Pattern Mining on Incremental DatabasesIEEE Access10.1109/ACCESS.2021.31315779(161305-161317)Online publication date: 2021
    • (2021)An incremental framework to extract coverage patterns for dynamic databasesInternational Journal of Data Science and Analytics10.1007/s41060-021-00262-4Online publication date: 25-May-2021
    • (2021)New approaches for mining regular high utility sequential patternsApplied Intelligence10.1007/s10489-021-02536-7Online publication date: 10-Jul-2021
    • (2021)Mining Sequential Patterns in Uncertain Databases Using Hierarchical Index StructureAdvances in Knowledge Discovery and Data Mining10.1007/978-3-030-75765-6_3(29-41)Online publication date: 8-May-2021
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media