ABSTRACT
Constraints are essential for many sequential pattern mining applications. However, there is no systematic study on constraint-based sequential pattern mining. In this paper, we investigate this issue and point out that the framework developed for constrained frequent-pattern mining does not fit our missions well. An extended framework is developed based on a sequential pattern growth methodology. Our study shows that constraints can be effectively and efficiently pushed deep into sequential pattern mining under this new framework. Moreover, this framework can be extended to constraint-based structured pattern mining as well.
- R. Agrawal and R. Srikant. Fast algorithms for mining association rules. VLDB'94. Google ScholarDigital Library
- R. Agrawal and R. Srikant. Mining sequential patterns. ICDE'95. Google ScholarDigital Library
- M. Garofalakis, R. Rastogi, and K. Shim. Spirit: Sequential pattern mining with regular expression constraints. VLDB'99. Google ScholarDigital Library
- J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. SIGMOD'00. Google ScholarDigital Library
- H. Mannila, H~Toivonen, and A. I. Verkamo. Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery, 1:259--289, 1997. Google ScholarDigital Library
- R. Ng, L. V. S. Lakshmanan, J. Han, and A. Pang. Exploratory mining and pruning optimizations of constrained associations rules. SIGMOD'98. Google ScholarDigital Library
- J. Pei and J. Han. Can we push more constraints into frequent pattern mining? KDD'00. Google ScholarDigital Library
- J. Pei et al. Mining frequent itemsets with convertible constraints. ICDE'01. Google ScholarDigital Library
- J. Pei et al. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. ICDE'01. Google ScholarDigital Library
- R. Srikant and R. Agrawal. Mining quantitative association rules in large relational tables. SIGMOD'96. Google ScholarDigital Library
- M. Zaki. SPADE: An efficient algorithm for mining frequent sequences. Machine Learning, 40:31--60, 2001. Google ScholarDigital Library
Index Terms
- Mining sequential patterns with constraints in large databases
Recommendations
Post sequential patterns mining: a new method for discovering structural patterns
Intelligent information processing IIIn this paper we present a novel data mining technique, known as Post Sequential Patterns Mining, which can be used to discover Structural Patterns. A Structural Pattern is a new pattern, which is composed of sequential patterns, branch patterns or ...
Mining Sequential Patterns by Pattern-Growth: The PrefixSpan Approach
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a difficult problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Most of the ...
Mining Weighted Closed Sequential Patterns in Large Databases
FSKD '08: Proceedings of the 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 05Previous algorithms mine the complete set of sequential patterns in large database efficiently, but when mining long sequential patterns in dense databases or using low minimum supports, it may produce many redundant patterns and some uninterested ...
Comments