Skip to main content

Mining Frequent Sequential Patterns under a Similarity Constraint

  • Conference paper
  • First Online:
Book cover Intelligent Data Engineering and Automated Learning — IDEAL 2002 (IDEAL 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2412))

Abstract

Many practical applications are related to frequent sequential pattern mining, ranging from Web Usage Mining to Bioinformatics. To ensure an appropriate extraction cost for useful mining tasks, a key issue is to push the user-defined constraints deep inside the mining algorithms. In this paper, we study the search for frequent sequential patterns that are also similar to an user-defined reference pattern. While the effective processing of the frequency constraints is well-understood, our contribution concerns the identification of a relaxation of the similarity constraint into a convertible anti-monotone constraint. Both constraints are then used to prune the search space during a levelwise search. Preliminary experimental validations have confirmed the algorithm efficiency.

Research partially funded by the European contract cInQ IST 2000-26469.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Agrawal and R. Srikant. Mining sequential patterns. In Proc. ICDE’95, pages 3–14. IEEE Press, March 1995.

    Google Scholar 

  2. Matthieu Capelle. Extraction de motifs séquentiels sous contraintes (in french). Master’s thesis, DEA ECD, INSA Lyon, Villeurbanne, France, September 2001.

    Google Scholar 

  3. M. N. Garofalakis, R. Rastogi, and K. Shim. SPIRIT: Sequential Pattern Mining with Regular Expression Constraints. In Proc. VLDB’99, pages 223–234. Morgan Kaufmann, September 1999.

    Google Scholar 

  4. Levenshtein. Binary codes capable of corecting deletions, insertions, and reversals, 1966.

    Google Scholar 

  5. J. Liu, Kelvin Chi Kuen Wong, and Ka Keung Hui. Discovering user behavior patterns in personalized interface agents. In Proc. IDEAL 2000, pages 398–403. Springer Verlag LNCS 1983, December 2000.

    Google Scholar 

  6. H. Mannila, H. Toivonen, and A. I. Verkamo. Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery, 1(3):259–289, 1997.

    Article  Google Scholar 

  7. P. Moen. Attribute, Event Sequence, and Event Type Simarity Notions for Data Mining. PhD thesis, Dept. of Computer Science, University of Helsinki, Finland, February 2000.

    Google Scholar 

  8. R. T. Ng, L. V.S. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. In Proc. SIGMOD’98, pages 13–24. ACM Press, June 1998.

    Google Scholar 

  9. J. Pei, J. Han, and L. V.S. Lakshmanan. Mining frequent itemsets with convertible constraints. In Proc. ICDE’01, pages 433–442. IEEE Computer Press, April 2001.

    Google Scholar 

  10. M. J. Zaki. Sequence mining in categorical domains: Incorporating constraints. In Proc. CIKM’00, pages 422–429. ACM Press, November 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Capelle, M., Masson, C., Boulicaut, JF. (2002). Mining Frequent Sequential Patterns under a Similarity Constraint. In: Yin, H., Allinson, N., Freeman, R., Keane, J., Hubbard, S. (eds) Intelligent Data Engineering and Automated Learning — IDEAL 2002. IDEAL 2002. Lecture Notes in Computer Science, vol 2412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45675-9_1

Download citation

  • DOI: https://doi.org/10.1007/3-540-45675-9_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44025-3

  • Online ISBN: 978-3-540-45675-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics