Skip to main content

Interestingness Measures - On Determining What Is Interesting

  • Chapter
  • First Online:
Data Mining and Knowledge Discovery Handbook

Summary

As the size of databases increases, the sheer number of mined from them can easily overwhelm users of the KDD process. Users run the KDD process because they are overloaded by data. To be successful, the KDD process needs to extract interesting patterns from large masses of data. In this chapter we examine methods of tackling this challenge: how to identify interesting patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 349.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Adomavicius, G. and Tuzhilin, A. (1997). Discovery of actionable patterns in databases: The action hierarchy approach. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 111–114, Newport Beach, CA, USA. AAAI Press.

    Google Scholar 

  • Adomavicius, G. and Tuzhilin, A. (2001). Expert-driven validation of rule-based user models in personalization applications. Data Mining and Knowledge Discovery, 5(1/2):33–58.

    Article  MATH  Google Scholar 

  • Aggarwal, C. C. and Yu, P. S. (1998). A new approach to online generation of association rules. Technical Report Research Report RC 20899, IBM T J Watson Research Center.

    Google Scholar 

  • Agrawal, R., Heikki, M., Srikant, R., Toivonen, H., and Verkamo, A. I. (1996). Advances in Knowledge Discovery and Data Mining, chapter 12: Fast Discovery of Association Rules, pages 307–328. AAAI Press/The MIT Press, Menlo Park, California.

    Google Scholar 

  • Basu, S., Mooney, R. J., Pasupuleti, K. V., and Ghosh, J. (2001). Evaluating the novelty of text-mined rules using lexical knowledge. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 233–238, San Francisco, CA, USA.

    Google Scholar 

  • Bayardo Jr., R. J. and Agrawal, R. (1999). Mining the most interesting rules. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 145–154, San Diego, CA.

    Google Scholar 

  • Bayardo Jr., R. J., Agrawal, R., and Gunopulos, D. (1999). Constraint-based rule mining in large, dense databases. In Proceedings of the Fifteenth IEEE ICDE International Conference on Data Engineering, pages 188–197, Sydney, Australia.

    Google Scholar 

  • Brin, S., Motwani, R., and Silverstein, C. (1997). Beyond market baskets: Generalizing association rules to correlations. In Proceedings of ACM SIGMOD International Conference on Management of Data, pages 265–276, Tucson, AZ, USA.

    Google Scholar 

  • Fayyad, U. M., Piatetsky-Shapiro, G., and Smyth, P. (1996). Advances in Knowledge Discovery and Data Mining, chapter 1: From Data Mining to Knowledge Discovery: An Overview, pages 1–34. AAAI Press.

  • Hilderman, R. J. and Hamilton, H. J. (2000). Principles for mining summaries using objective measures of interestingness. In Proceedings of the Twelfth IEEE International Conference on Tools with Artificial Intelligence (ICTAI), pages 72–81, Vancouver, Canada.

    Google Scholar 

  • Hilderman, R. J. and Hamilton, H. J. (2001). Knowledge Discovery and Measures of Interest. Kluwer Academic Publishers.

    Google Scholar 

  • Hipp, J. and Günter, U. (2002). Is pushing constraints deeply into the mining algorithms really what we want? SIGKDD Explorations, 4(1): 50–55.

    Article  Google Scholar 

  • Kamber, M. and Shinghal, R. (1996). Evaluating the interestingness of characteristic rules. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pages 263–266, Portland, OR, USA.

    Google Scholar 

  • Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H., and Verkam, A. I. (1994). Finding interesting rules from large sets of discovered association rules. In Proceedings of the Third ACM CIKM International Conference on Information and Knowledge Management, pages 401–407, Orlando, FL, USA. ACM Press.

    Google Scholar 

  • Klösgen, W. (1996). Advances in Knowledge Discovery and Data Mining, chapter 10: Explora: a Multipattern and Multistrategy Discovery Assistant, pages 249–271. AAAI Press.

  • Liu, B., Hsu, W., and Chen, S. (1997). Using general impressions to analyze discovered classification rules. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 31–36, Newport Beach, CA, USA. AAAI Press.

    Google Scholar 

  • Liu, B., Hsu, W., and Ma, Y. (1999). Pruning and summarizing the discovered associations. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 125–134, San Diego, CA, USA.

    Google Scholar 

  • Liu, B., Hsu, W., and Ma, Y. (2001a). Discovery the set of fundamental rule changes. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 335–340, San Francisco, CA, USA.

    Google Scholar 

  • Liu, B., Hsu, W., and Ma, Y. (2001b). Identifying non-actionable association rules. In Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 329–334, San Francisco, CA, USA.

    Google Scholar 

  • Liu, B., Hu, M., and Hsu, W. (2000). Multi-level organization and summarization of the discovered rules. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 208–217, Boston, MA, USA.

    Google Scholar 

  • Major, J. A. and Mangano, J. J. (1995). Selecting among rules induced from a hurricane databases. Journal of Intelligent Information Systems, 4:39–52.

    Article  Google Scholar 

  • Ng, R. T., Lakshmanan, L. V. S., Han, J., and Pang, A. (1998). Exploratory mining and pruning optimizations of constrained association rules. In Proceedings of ACM SIGMOD International Conference on Management of Data, pages 13–24.

    Google Scholar 

  • Padmanabhan, B. and Tuzhilin, A. (2000). Small is beautiful: Discovering the minimal set of unexpected patterns. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 54–63, Boston, MA, USA.

    Google Scholar 

  • Piatetsky-Shapiro, G. (1991). Knowledge Discovery in Databases, chapter 13: Discovery, Analysis, and Presentation of Strong Rules, pages 248–292. AAAI/MIT Press.

  • Rokach, L., Averbuch, M., and Maimon, O., Information retrieval system for medical narrative reports. Lecture notes in artificial intelligence, 3055. pp. 217-228, Springer-Verlag (2004).

    Google Scholar 

  • Sahar, S. (1999). Interestingness via what is not interesting. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 332–336, San Diego, CA, USA.

    Google Scholar 

  • Sahar, S. (2001). Interestingness preprocessing. In Proceedings of the IEEE ICDM International Conference on Data Mining, pages 489–496, San Jose, CA, USA.

    Google Scholar 

  • Sahar, S. (2002a). Exploring interestingness through clustering: A framework. In Proceedings of the IEEE ICDM International Conference on Data Mining, pages 677–680, Maebashi City, Japan.

    Google Scholar 

  • Sahar, S. (2002b). On incorporating subjective interestingness into the mining process. In Proceedings of the IEEE ICDM International Conference on Data Mining, pages 681–684, Maebashi City, Japan.

    Google Scholar 

  • Sahar, S. and Mansour, Y. (1999). An empirical evaluation of objective interestingness criteria. In SPIE Conference on Data Mining and Knowledge Discovery, pages 63–74, Orlando, FL, USA.

    Google Scholar 

  • Shah, D., Lakshmanan, L. V. S., Ramamritham, K., and Sudarshan, S. (1999). Interestingness and pruning of mined patterns. In Proceedings of the ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD), Philadelphia, PA, USA.

    Google Scholar 

  • Silberschatz, A. and Tuzhilin, A. (1996). What makes patterns interesting in knowledge discovery systems. IEEE Transactions on Knowledge and Data Engineering (TKDE), 8(6):970–974.

    Article  Google Scholar 

  • Spiliopoulou, M. and Roddick, J. F. (2000). Higher order mining: Modeling and mining the results of knowledge discovery. In Proceedings of the Second Conference on Data Mining Methods and Databases, pages 309–320, Cambridge, UK. WIT Press.

    Google Scholar 

  • Srikant, R., Vu, Q., and Agrawal, R. (1997). Mining association rules with item constraints. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, pages 67–73, Newport Beach, CA, USA. AAAI Press.

    Google Scholar 

  • Subramonian, R. (1998). Defining diff as a Data Mining primitive. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining, pages 334–338, New York City, NY, USA. AAAI Press.

    Google Scholar 

  • Tan, P.-N., Kumar, V., and Srivastava, J. (2002). Selecting the right interestingness measure for association patterns. In Proceedings of the Eight ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 32–41, Edmonton, Alberta, Canada.

    Google Scholar 

  • Toivonen, H., Klemettinen, M., Ronkainen, P., Hätönen, K., and Mannila, H. (1995). Pruning and grouping discovered association rules. In Proceedings of the MLnet Familiarization Workshop on Statistics, Machine Learning and Knowledge Discovery in Databases, pages 47–52, Heraklion, Crete, Greece.

    Google Scholar 

  • Tuzhilin, A. and Adomavicius, G. (2002). Handling very large numbers of association rules in the analysis of microarray data. In Proceedings of the Eight ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 396–404, Edmonton, Alberta, Canada.

    Google Scholar 

  • Zaki, M. J. (2000). Generating non-redundant association rules. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 34–43, Boston, MA, USA.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sigal Sahar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Sahar, S. (2009). Interestingness Measures - On Determining What Is Interesting. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-09823-4_30

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-09823-4_30

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-09822-7

  • Online ISBN: 978-0-387-09823-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics