Abstract
To support data mining post-processing, which is one of the important procedures in a data mining process, at least 40 indices are proposed to acquire valuable knowledge. However, since their behaviors have never been elucidated, domain experts are required to spend their time to understanding the meanings of each index in a given data mining result. In this paper, we present an analysis of the behavior of objective rule evaluation indices on classification rule sets by principle component analysis (PCA). Therefore, we carried out a PCA to a dataset consisting of the 39 objective rule evaluation indices. In order to obtain the dataset, we calculated the average values of the bootstrap method on 32 classification rule sets learned by information gain ratio. Then, we identified the seven functional groups of the objective indices based on the PCA. Using this result, we discuss a rule evaluation interface for use by human experts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hilderman, R.J., Hamilton, H.J.: Knowledge Discovery and Measure of Interest. Kluwer Academic Publishers, Dordrecht (2001)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. In: Proceedings of International Conference on Knowledge Discovery and Data Mining KDD 2002, pp. 32–41 (2002)
Yao, Y.Y., Zhong, N.: An analysis of quantitative measures associated with rules. In: Zhong, N., Zhou, L. (eds.) PAKDD 1999. LNCS (LNAI), vol. 1574, pp. 479–488. Springer, Heidelberg (1999)
Abe, H., Tsumoto, S., Ohsaki, M., Yamaguchi, T.: A rule evaluation support method with learning models based on objective rule evaluation indexes. In: Proceeding of the IEEE International Conference on Data Mining ICDM 2005, pp. 549–552 (2005)
Freitas, A.A.: On rule interestingness measures. Knowledge-Based Systems 12(5-6), 309–315 (1999)
Vaillant, B., Lenca, P., Lallich, S.: A clustering of interestingness measures. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS (LNAI), vol. 3735, pp. 290–297. Springer, Heidelberg (2005)
Huynh, X.H., Guillet, F., Briand, H.: A data analysis approach for evaluating the behavior of interestingness measures. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds.) DS 2005. LNCS, vol. 3735, pp. 330–337. Springer, Heidelberg (2005)
Blanchard, J., Guillet, F., Gras, R., Briand, H.: Using information-theoretic measures to assess association rule interestingness. In: Proceedings of the fifth IEEE International Conference on Data Mining ICDM 2005, pp. 66–73. IEEE Computer Society Press, Los Alamitos (2005)
Ohsaki, M., Abe, H., Yokoi, H., Tsumoto, S., Yamaguchi, T.: Evaluation of rule interestingness measures in medical knowledge discovery in databases. Artificial Intelligence in Medicine 41(3), 177–196 (2007)
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Explora: A Multipattern and Multistrategy Discovery Assistant. In: Advances in Knowledge Discovery and Data Mining, pp. 249–271. AAAI/MIT Press, California (1996)
Ali, K., Manganaris, S., Srikant, R.: Partial classification using association rules. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining KDD 1997, pp. 115–118 (1997)
Brin, S., Motwani, R., Ullman, J., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 255–264 (1997)
Rijsbergen, C.: Information retrieval, ch. 7 (1979), http://www.dcs.gla.ac.uk/Keith/Chapter.7/Ch.7.html
Gray, B., Orlowska, M.E.: CCAIIA: Clustering categorical attributes into interesting association rules. In: Wu, X., Kotagiri, R., Korb, K.B. (eds.) PAKDD 1998. LNCS (LNAI), vol. 1394, pp. 132–143. Springer, Heidelberg (1998)
Hamilton, H.J., Shan, N., Ziarko, W.: Machine learning of credible classifications. In: Australian Conference on Artificial Intelligence AI 1997, pp. 330–339 (1997)
Goodman, L.A., Kruskal, W.H.: Measures of association for cross classification. Springer Series in Statistics, vol. 1. Springer, Heidelberg (1979)
Smyth, P., Goodman, R.M.: Rule induction using information theory. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 159–176. AAAI/MIT Press (1991)
Ohsaki, M., Kitaguchi, S., Kume, S., Yokoi, H., Yamaguchi, T.: Evaluation of rule interestingness measures with a clinical dataset on hepatitis. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 362–373. Springer, Heidelberg (2004)
Piatetsky-Shapiro, G.: Discovery, analysis and presentation of strong rules. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)
Gago, P., Bento, C.: A metric for selection of the most promising rules. In: Żytkow, J.M. (ed.) PKDD 1998. LNCS (LNAI), vol. 1510, pp. 19–27. Springer, Heidelberg (1998)
Zhong, N., Yao, Y.Y., Ohshima, M.: Peculiarity oriented multi-database mining. IEEE Transactions on Knowledge and Data Engineering 15(4), 952–960 (2003)
Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases, University of California, Department of Information and Computer Science, Irvine, CA (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: The Fifteenth International Conference on Machine Learning, pp. 144–151 (1998)
Abe, H., Tsumoto, S.: Analyzing behavior of objective rule evaluation indices based on pearson product-moment correlation coefficient. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds.) Foundations of Intelligent Systems. LNCS (LNAI), vol. 4994, pp. 84–89. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abe, H., Tsumoto, S., Ohsaki, M., Yamaguchi, T. (2008). Finding Functional Groups of Objective Rule Evaluation Indices Using PCA. In: Yamaguchi, T. (eds) Practical Aspects of Knowledge Management. PAKM 2008. Lecture Notes in Computer Science(), vol 5345. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89447-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-540-89447-6_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89446-9
Online ISBN: 978-3-540-89447-6
eBook Packages: Computer ScienceComputer Science (R0)