Domain Structures in Filtering Irrelevant Frequent Patterns

Hätönen, Kimmo; Klemettinen, Mika

doi:10.1007/978-3-540-44497-8_15

Domain Structures in Filtering Irrelevant Frequent Patterns

Kimmo Hätönen⁹ &
Mika Klemettinen⁹

Chapter

385 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2682))

Abstract

Events are used to monitor many types of processes in several technical domains. Computers and efficient electronic communication networks make it very easy to increase the accuracy and amount of logged details. While the size of logs is growing, the collection and analysis of them are becoming harder all the time. Frequent episodes offer one possible method to structure and find information hidden in logs. Unfortunately, as events reflecting simultaneous independent processes are stored to central monitoring points, signs of several unrelated phenomena get mixed with each other. This makes the algorithm searching for frequent episodes to produce accidental and irrelevant results. As a solution to this problem, we introduce here a notion of domain constraints that are based on distance measures, which can be defined in terms of domain structure and used taxonomies. We also show how these constraints can be used to prune irrelevant event combinations.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) Proceedings of ACM SIGMOD Conference on Management of Data (SIGMOD 1993), May 1993, pp. 207–216. ACM, Washington, D.C. (1993)
Chapter Google Scholar
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast discovery of association rules. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 307–328. AAAI Press, Menlo Park (1996)
Google Scholar
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery: An overview. In: Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–34. AAAI Press, Menlo Park (1996)
Google Scholar
Garofalakis, M.N., Rastogi, R., Shim, K.: SPIRIT: Sequential pattern mining with regular expression constraints. In: Proc. of the 25th International Conference on Very Large Data Bases, Edinburgh, Scotland, pp. 223–234 (1999)
Google Scholar
Hätönen, K., Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H.: TASA: Telecommunication alarm sequence analyzer, or “How to enjoy faults in your network”. In: Proceedings of the 1996 IEEE Network Operations and Management Symposium (NOMS 1996), Kyoto, Japan, April 1996, pp. 520–529. IEEE, Los Alamitos (1996)
Google Scholar
Jakobson, G., Weissman, M.: Real-time telecommunication network management: Extending event correlation with temporal constraints. In: Integrated Network Management IV, pp. 290–301. Chapman & Hall, London (1995)
Chapter Google Scholar
Jakobson, G., Weissman, M.D.: Alarm correlation. IEEE Network 7(6), 52–59 (1993)
Article Google Scholar
Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H., Verkamo, A.I.: Finding interesting rules from large sets of discovered association rules. In: Proceedings of the Third International Conference on Information and Knowledge Management (CIKM 1994), Gaithersburg, MD, November 1994, pp. 401–407. ACM, New York (1994)
Google Scholar
Mannila, H., Toivonen, H.: Discovering generalized episodes using minimal occurrences. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD 1996), Portland, Oregon, August 1996, pp. 146–151. AAAI Press, Menlo Park (1996)
Google Scholar
Mannila, H., Toivonen, H., Verkamo, A.I.: Discovery of frequent episodes in event sequences. Data Mining and Knowledge Discovery 1(3), 259–289 (1997)
Article Google Scholar
Ng, R.T., Lakshmanan, L.V.S., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained associations rules. In: Proc. of the 1998 ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, pp. 13–24 (1998)
Google Scholar
Zaki, M.J.: Sequence mining in categorical domains: Incorporating constraints. In: Proc. of the 2000 ACM CIKM International Conference on Information and Knowledge Management, McLean, VA, USA, pp. 422–429 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Nokia Research Center, Nokia Group, P.O.Box 407, FIN-00045, Finland
Kimmo Hätönen & Mika Klemettinen

Authors

Kimmo Hätönen
View author publications
You can also search for this author in PubMed Google Scholar
Mika Klemettinen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Informatica, Università di Torino, Italy
Rosa Meo
Dipartimento di Elettronica e Informazione, Politecnico di Milano, Milano, Italy
Pier Luca Lanzi
Nokia Research Center, Nokia Group, P.O.Box 407, FIN-00045, Finland
Mika Klemettinen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hätönen, K., Klemettinen, M. (2004). Domain Structures in Filtering Irrelevant Frequent Patterns. In: Meo, R., Lanzi, P.L., Klemettinen, M. (eds) Database Support for Data Mining Applications. Lecture Notes in Computer Science(), vol 2682. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-44497-8_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-44497-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22479-2
Online ISBN: 978-3-540-44497-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics