Abstract
Many practical applications are related to frequent sequential pattern mining, ranging from Web Usage Mining to Bioinformatics. To ensure an appropriate extraction cost for useful mining tasks, a key issue is to push the user-defined constraints deep inside the mining algorithms. In this paper, we study the search for frequent sequential patterns that are also similar to an user-defined reference pattern. While the effective processing of the frequency constraints is well-understood, our contribution concerns the identification of a relaxation of the similarity constraint into a convertible anti-monotone constraint. Both constraints are then used to prune the search space during a levelwise search. Preliminary experimental validations have confirmed the algorithm efficiency.
Research partially funded by the European contract cInQ IST 2000-26469.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Agrawal and R. Srikant. Mining sequential patterns. In Proc. ICDE’95, pages 3–14. IEEE Press, March 1995.
Matthieu Capelle. Extraction de motifs séquentiels sous contraintes (in french). Master’s thesis, DEA ECD, INSA Lyon, Villeurbanne, France, September 2001.
M. N. Garofalakis, R. Rastogi, and K. Shim. SPIRIT: Sequential Pattern Mining with Regular Expression Constraints. In Proc. VLDB’99, pages 223–234. Morgan Kaufmann, September 1999.
Levenshtein. Binary codes capable of corecting deletions, insertions, and reversals, 1966.
J. Liu, Kelvin Chi Kuen Wong, and Ka Keung Hui. Discovering user behavior patterns in personalized interface agents. In Proc. IDEAL 2000, pages 398–403. Springer Verlag LNCS 1983, December 2000.
H. Mannila, H. Toivonen, and A. I. Verkamo. Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery, 1(3):259–289, 1997.
P. Moen. Attribute, Event Sequence, and Event Type Simarity Notions for Data Mining. PhD thesis, Dept. of Computer Science, University of Helsinki, Finland, February 2000.
R. T. Ng, L. V.S. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. In Proc. SIGMOD’98, pages 13–24. ACM Press, June 1998.
J. Pei, J. Han, and L. V.S. Lakshmanan. Mining frequent itemsets with convertible constraints. In Proc. ICDE’01, pages 433–442. IEEE Computer Press, April 2001.
M. J. Zaki. Sequence mining in categorical domains: Incorporating constraints. In Proc. CIKM’00, pages 422–429. ACM Press, November 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Capelle, M., Masson, C., Boulicaut, JF. (2002). Mining Frequent Sequential Patterns under a Similarity Constraint. In: Yin, H., Allinson, N., Freeman, R., Keane, J., Hubbard, S. (eds) Intelligent Data Engineering and Automated Learning — IDEAL 2002. IDEAL 2002. Lecture Notes in Computer Science, vol 2412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45675-9_1
Download citation
DOI: https://doi.org/10.1007/3-540-45675-9_1
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44025-3
Online ISBN: 978-3-540-45675-9
eBook Packages: Springer Book Archive