Abstract
This paper presents MOWCATL, an efficient method for mining frequent association rules from multiple sequential data sets. Our goal is to find patterns in one or more sequences that precede the occurrence of patterns in other sequences. Recent work has highlighted the importance of using constraints to focus the mining process on the association rules relevant to the user. To refine the data mining process, this approach introduces the use of separate antecedent and consequent inclusion constraints, in addition to the traditional frequency and support constraints in sequential data mining. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by either a maximal width time lag or a fixed width time lag.
Multiple time series drought risk management data are used to show that our approach can be effectively employed in real-life problems. This approach is compared to existing methods to show how they complement each other to discover associations in the drought risk management domain. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. Both the maximal and fixed width time lags are shown to be useful when finding interesting associations.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bettini, C., Wang, X.S., and Jajodia, S. (1998). Discovering Temporal Relationships with Multiple Granularities in Time Sequences. IEEE Transactions on Knowledge and Data Engineering, 10(2), 222-237.
Cong, G. and Liu, B. (2002). Speed-up Iterative Frequent Itemset Mining with Constraint Changes. In Proceedings of the 2002 IEEE International Conference on Data Mining, Maebashi, Japan.
Feng, L., Lu, H., Yu, J.X., and Han, J. (1999). Mining Inter-Transaction Associations with Templates. In Proceedings of the 1999 International Conference on Information and Knowledge Management [CIKM'99], Kansas City, Missouri, USA.
Goddard, S., Harms, S.K., Reichenbach, S.E., Tadesse, T., and Waltman, W.J. (2003). Geospatial Decision Support for Drought Risk Management. Communications of the ACM, 46(1), 35-37.
Goldin, D.Q. and Kanellakis, P.C. (1995). On Similarity Queries for Time-Series Data: Constraint Specification and Implementation. In Proceedings of the 1995 International Conference on the Principles and Practice of Constraint Programming (pp. 137-153). Marseilles, France.
Harms, S.K., Deogun, J., Saquer, J., and Tadesse, T. (2001). Discovering Representative Episodal Association Rules from Event Sequences Using Frequent Closed Episode Sets and Event Constraints. In Proceedings of the 2001 IEEE International Conference on Data Mining (pp. 603-606). San Jose, California, USA.
Harms, S.K., Deogun, J., and Tadesse, T. (2002). Discovering Sequential Association Rules with Constraints and Time Lags in Multiple Sequences. In Proceedings of the 2002 International Symposium on Methodologies for Intelligent Systems (pp. 432-441). Lyon, France.
Kryszkiewicz, M. (1998). Fast Discovery of Representative Association Rules. In Lecture Notes in Artificial Intelligence, Vol. 1424 (pp. 214-221). Proceedings of RSCTC 98, Springer-Verlag.
Mannila, H., Toivonen, H., and Verkamo, A.I. (1995). Discovering Frequent Episodes in Sequences. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining [KDD 95] (pp. 210-215). Montreal, Canada.
Mannila, H., Toivonen, H., and Verkamo, A.I. (1997). Discovery of Frequent Episodes in Event Sequences. Technical report, Department of Computer Science, University of Helsinki, Finland. Report C-1997-15.
McGee, T.B., Doeskin, N.J., and Kliest, J. (1995). Drought Monitoring with Multiple Time Scales. In Proceedings of the 9th Conference on Applied Climatology (pp. 233-236). Boston, MA.
Ng, R., Lakshmanan, L.S., Han, J., and Pang, A. (1998). Exploratory Mining and Pruning Optimizations of Constrained Associations Rules. In Proceedings of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA.
Ross, T. and Lott, N. (2000). A Climatology of Recent Extreme Weather and Climate Events. Technical report 2000-02, National Climatic Data Center, US Dept. of Commerce.
Srikant, R. and Agrawal, R. (1996). Mining Sequential Patterns: Generalizations and Performance Improvements. In Proceedings of the Fifth International Conference on Extending Database Technology.
Srikant, R., Vu, Q., and Agrawal, R. (1997). Mining Association Rules with Item Constraints. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining [KDD 97] (pp. 67-73).
Tan, P., Potter, C., Steinbach, M., Klooster, S., Kumar, V., and Torregrosa, A. (2001). Finding Spatio-Temporal Patterns in Earth Science Data. In KDD-2001 Workshop on Temporal Data Mining, San Francisco, CA.
U.S. Drought Monitor. (2002). Hosted and Maintained by the National Drought Mitigation Center. http://enso.unl.edu/monitor/.
Wilhite, D.A. (2000). Drought as a Natural Hazard: Concepts and Definitions. In D.A. Wilhite (Ed.), Drought Volume II: A Global Assessment, Routledge Hazards and Disaster Series (pp. 3-8). New York: Routledge Publishers.
Zaki, M. (2000). Sequence mining in Categorical Domains: Incorporating Constraints. In Proceedings of the Ninth International Conference on Information and Knowledge Management [CIKM2000] (pp. 422-429). Washington DC, USA.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Harms, S.K., Deogun, J.S. Sequential Association Rule Mining with Time Lags. Journal of Intelligent Information Systems 22, 7–22 (2004). https://doi.org/10.1023/A:1025824629047
Issue Date:
DOI: https://doi.org/10.1023/A:1025824629047