Abstract
The Post-processing of mined patterns such as rules, trees and so forth is one of the key operations in a data mining process. However, it is difficult for human experts to completely evaluate several thousand rules from a large dataset with noise. To reduce the cost of this kind of rule evaluation task, we have developed a rule evaluation support method with rule evaluation models that learn from objective indices for mined classification rules and evaluations by a human expert for each rule.
In this paper, we present an evaluation of the learning algorithms of our rule evaluation support method for the post-processing of mined patterns with rule evaluation models based on objective indices. To enhance the adaptability of these rule evaluation models, we introduced a constructive meta-learning system for the construction of appropriate learning algorithms. We then have performed case studies using the meningitis as an actual problem. Furthermore, we evaluated our method with the eight rule sets obtained from eight UCI datasets. With regard to these results, we show the applicability of the constrictive meta-learning scheme as a learning algorithm selection method for our rule evaluation support method.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Hilderman, R.J., Hamilton, H.J.: Knowledge Discovery and Measure of Interest. Kluwer Academic Publishers, Dordrecht (2001)
Tan, P.N., Kumar, V., Srivastava, J.: Selecting the Right Interestingness Measure for Association Patterns. In: Proceeding of International Conference on Knowledge Discovery and Data Mining KDD 2002, pp. 32–41 (2002)
Yao, Y.Y., Zhong, N.: An Analysis of Quantitative Measures Associated with Rules. In: Proceeding of Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD 1999, pp. 479–488 (1999)
Ohsaki, M., Kitaguchi, S., Kume, S., Yokoi, H., Yamaguchi, T.: Evaluation of Rule Interestingness Measures with a Clinical Dataset on Hepatitis. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) PKDD 2004. LNCS (LNAI), vol. 3202, pp. 362–373. Springer, Heidelberg (2004)
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
Freund, Y., Schapire, R.E.: Experiments with a new boosting algorithm. In: Proc. of Thirteenth International Conference on Machine Learning, pp. 148–156 (1996)
Wolpert, D.: Stacked Generalization. Neural Network 5(2), 241–260 (1992)
Gama, J., Brazdil, P.: Cascade generalization. Machine Learning 41(3), 315–343 (2000)
Metal (2002), http://www.metal-kdd.org/
Bernstein, A., Provost, F.: An intelligent assistant for knowledge discovery process. In: Proceeding IJICAI 2001 Workshop on Wrappers for Performance Enhancement in KDD (2001)
Abe, H., Yamaguchi, T.: Constructive Meta-Learning with Machine Learning Method Repositories. In: Orchard, B., Yang, C., Ali, M. (eds.) IEA/AIE 2004. LNCS (LNAI), vol. 3029, pp. 502–511. Springer, Heidelberg (2004)
Hatazawa, H., Negishi, N., Suyama, A., Tsumoto, S., Yamaguchi, T.: Knowledge Discovery Support from a Meningoencephalitis Database Using an Automatic Composition Tool for Inductive Applications. In: Proc. of KDD Challenge, in conjunction with PAKDD 2000, pp. 28–33 (2000)
Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases, Department of Information and Computer Science, University of California, Irvine (1998), http://www.ics.uci.edu/mlearn/MLRepository.html
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Explora: A Multipattern and Multistrategy Discovery Assistant. In: Advances in Knowledge Discovery and Data Mining, pp. 249–271. AAAI/MIT Press (1996)
Ali, K., Manganaris, S., Srikant, R.: Partial classification using association rules. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining KDD 1997, pp. 115–118 (1997)
Brin, S., Motwani, R., Ullman, J., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: Proc. of ACM SIGMOD Int. Conf. on Management of Data, pp. 255–264 (1997)
Rijsbergen, C.: Information Retrieval, Ch. 7, Butterworths, London (1979), http://www.dcs.gla.ac.uk/Keith/Chapter.7/Ch.7.html
Gray, B., Orlowska, M.E.: CCAIIA: Clustering Categorical Attributes into Interesting Association Rules. In: Pacific-Asia Conf. on Knowledge Discovery and Data Mining PAKDD 1998, pp. 132–143 (1998)
Hamilton, H.J., Shan, N., Ziarko, W.: Machine Learning of Credible Classifications. In: Proc. of Australian Conf. on Artificial Intelligence AI 1997, pp. 330–339 (1997)
Goodman, L.A., Kruskal, W.H.: Measures of association for cross classifications. Springer Series in Statistics, vol. 1. Springer, Heidelberg (1979)
Smyth, P., Goodman, R.M.: Rule Induction using Information Theory. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 159–176. AAAI/MIT Press (1991)
Piatetsky-Shapiro, G.: Discovery, Analysis and Presentation of Strong Rules. In: Piatetsky-Shapiro, G., Frawley, W.J. (eds.) Knowledge Discovery in Databases, pp. 229–248. AAAI/MIT Press (1991)
Gago, P., Bento, C.: A Metric for Selection of the Most Promising Rules. In: European Conference on the Principles of Data Mining and Knowledge Discovery PKDD 1998, pp. 19–27 (1998)
Zhong, N., Yao, Y.Y., Ohshima, M.: Peculiarity Oriented Multi-Database Mining. IEEE Transaction on Knowledge and Data Engineering 15(4), 952–960 (2003)
Witten, I.H., Frank, E.: DataMining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco (2000)
Quinlan, J.R.: Programs for Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Hinton, G.E.: Learning distributed representations of concepts. In: Proceedings of 8th Annual Conference of the Cognitive Science Society (1986)
Platt, J.: Fast Training of Support Vector Machines using Sequential Minimal Optimization. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning, pp. 185–208. MIT Press, Cambridge (1999)
Frank, E., Wang, Y., Inglis, S., Holmes, G., Witten, I.H.: Using model trees for classification. Machine Learning 32(1), 63–76 (1998)
R.C.H.: Very simple classification rules perform well on most commonly used datasets. Machine Learning 11, 63–91 (1993)
Mitchell, T.M.: Generalization as Search. Artificial Intelligence 18(2), 203–226 (1982)
Michalski, R., Mozetic, I., Hong, J., Lavrac, N.: The AQ15 Inductive Learning System: An Over View and Experiments, Reports of Machine Learning and Inference Laboratory, No.MLI-86-6, George Mason University (1986)
Booker, L.B., Holland, J.H., Goldberg, D.E.: Classifier Systems and Genetic Algorithms. Artificail Inteligence 40, 235–282 (1989)
Quinlan, J.R.: Induction of Decision Tree. Machine Learning 1, 81–106 (1986)
Frank, E., Witten, I.H.: Generating accurate rule sets without global optimization. In: Proc. of the Fifteenth International Conference on Machine Learning, pp. 144–151 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Abe, H., Tsumoto, S., Ohsaki, M., Yamaguchi, T. (2009). Evaluating Learning Algorithms Composed by a Constructive Meta-learning Scheme for a Rule Evaluation Support Method. In: Zighed, D.A., Tsumoto, S., Ras, Z.W., Hacid, H. (eds) Mining Complex Data. Studies in Computational Intelligence, vol 165. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88067-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-88067-7_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88066-0
Online ISBN: 978-3-540-88067-7
eBook Packages: EngineeringEngineering (R0)