Skip to main content

Finding Functional Groups of Objective Rule Evaluation Indices Using PCA

  • Conference paper
Practical Aspects of Knowledge Management (PAKM 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5345))

Included in the following conference series:

Abstract

To support data mining post-processing, which is one of the important procedures in a data mining process, at least 40 indices are proposed to acquire valuable knowledge. However, since their behaviors have never been elucidated, domain experts are required to spend their time to understanding the meanings of each index in a given data mining result. In this paper, we present an analysis of the behavior of objective rule evaluation indices on classification rule sets by principle component analysis (PCA). Therefore, we carried out a PCA to a dataset consisting of the 39 objective rule evaluation indices. In order to obtain the dataset, we calculated the average values of the bootstrap method on 32 classification rule sets learned by information gain ratio. Then, we identified the seven functional groups of the objective indices based on the PCA. Using this result, we discuss a rule evaluation interface for use by human experts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hilderman, R.J., Hamilton, H.J.: Knowledge Discovery and Measure of Interest. Kluwer Academic Publishers, Dordrecht (2001)

    Book  MATH  Google Scholar 

  2. Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of International Conference on Knowledge Discovery and Data Mining KDD 2002, pp. 32–41 (2002)

    Google Scholar 

  3. Yao, Y.Y., Zhong, N.: An analysis of quantitative measures associated with rules. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 479–488. Springer, Heidelberg (1999)

    Chapter  Google Scholar 

  4. Abe, H., Tsumoto, S., Ohsaki, M., Yamaguchi, T.: A rule evaluation support method with learning models based on objective rule evaluation indexes. In: Proceeding of the IEEE International Conference on Data Mining ICDM 2005, pp. 549–552 (2005)

    Google Scholar 

  5. Freitas, A.A.: On rule interestingness measures. Knowledge-Based Systems 12(5-6), 309–315 (1999)

    Article  Google Scholar 

  6. Vaillant, B., Lenca, P., Lallich, S.: A clustering of interestingness measures. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 290–297. Springer, Heidelberg (2005)

    Google Scholar 

  7. Huynh, X.H., Guillet, F., Briand, H.: A data analysis approach for evaluating the behavior of interestingness measures. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS, vol. 3735, pp. 330–337. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  8. Blanchard, J., Guillet, F., Gras, R., Briand, H.: Using information-theoretic measures to assess association rule interestingness. In: Proceedings of the fifth IEEE International Conference on Data Mining ICDM 2005, pp. 66–73. IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  9. Ohsaki, M., Abe, H., Yokoi, H., Tsumoto, S., Yamaguchi, T.: Evaluation of rule interestingness measures in medical knowledge discovery in databases. Artificial Intelligence in Medicine 41(3), 177–196 (2007)

    Article  Google Scholar 

  10. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Explora: A Multipattern and Multistrategy Discovery Assistant. In: Advances in Knowledge Discovery and Data Mining, pp. 249–271. AAAI/MIT Press, California (1996)

    Google Scholar 

  11. Ali, K., Manganaris, S., Srikant, R.: Partial classification using association rules. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining KDD 1997, pp. 115–118 (1997)

    Google Scholar 

  12. Brin, S., Motwani, R., Ullman, J., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 255–264 (1997)

    Google Scholar 

  13. Rijsbergen, C.: Information retrieval, ch. 7 (1979), http://www.dcs.gla.ac.uk/Keith/Chapter.7/Ch.7.html

  14. Gray, B., Orlowska, M.E.: CCAIIA: Clustering categorical attributes into interesting association rules. In: Wu, X., Kotagiri, R., Korb, K.B. (eds.) PAKDD 1998. LNCS (LNAI), vol. 1394, pp. 132–143. Springer, Heidelberg (1998)

    Google Scholar 

  15. Hamilton, H.J., Shan, N., Ziarko, W.: Machine learning of credible classifications. In: Australian Conference on Artificial Intelligence AI 1997, pp. 330–339 (1997)

    Google Scholar 

  16. Goodman, L.A., Kruskal, W.H.: Measures of association for cross classification. Springer Series in Statistics, vol. 1. Springer, Heidelberg (1979)

    Book  MATH  Google Scholar 

  17. Smyth, P., Goodman, R.M.: Rule induction using information theory. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 159–176. AAAI/MIT Press (1991)

    Google Scholar 

  18. Ohsaki, M., Kitaguchi, S., Kume, S., Yokoi, H., Yamaguchi, T.: Evaluation of rule interestingness measures with a clinical dataset on hepatitis. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 362–373. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  19. Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)

    Google Scholar 

  20. Gago, P., Bento, C.: A metric for selection of the most promising rules. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS (LNAI), vol. 1510, pp. 19–27. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  21. Zhong, N., Yao, Y.Y., Ohshima, M.: Peculiarity oriented multi-database mining. IEEE Transactions on Knowledge and Data Engineering 15(4), 952–960 (2003)

    Article  Google Scholar 

  22. Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases, University of California, Department of Information and Computer Science, Irvine, CA (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html

  23. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)

    Google Scholar 

  24. Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: The Fifteenth International Conference on Machine Learning, pp. 144–151 (1998)

    Google Scholar 

  25. Abe, H., Tsumoto, S.: Analyzing behavior of objective rule evaluation indices based on pearson product-moment correlation coefficient. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds.) Foundations of Intelligent Systems. LNCS (LNAI), vol. 4994, pp. 84–89. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abe, H., Tsumoto, S., Ohsaki, M., Yamaguchi, T. (2008). Finding Functional Groups of Objective Rule Evaluation Indices Using PCA. In: Yamaguchi, T. (eds) Practical Aspects of Knowledge Management. PAKM 2008. Lecture Notes in Computer Science(), vol 5345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89447-6_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89447-6_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89446-9

  • Online ISBN: 978-3-540-89447-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics