Classification learning using all rules

Viswanathan, Murlikrishna; Webb, Geoffrey I.

doi:10.1007/BFb0026685

Murlikrishna Viswanathan¹ &
Geoffrey I. Webb¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1398))

Included in the following conference series:

European Conference on Machine Learning

396 Accesses
2 Citations

Abstract

The covering algorithm has been ubiquitous in the induction of classification rules. This approach to machine learning uses heuristic search that seeks to find a minimum number of rules that adequately explain the data. However, recent research has provided evidence that learning redundant classifiers can increase predictive accuracy. Learning all possible classifiers seems to be a plausible ultimate form of this notion of redundant classifiers. This paper presents an algorithm that in effect learns all classifiers. Preliminary investigation by Webb (1996b) suggested that a heuristic covering algorithm in general learns classification rules with higher predictive accuracy than those learned by this new approach. In this paper we present an extensive empirical comparison between the learning-all-rules algorithm and three varied established approaches to inductive learning, namely, a covering algorithm, an instance-based learner and a decision tree learner. Empirical evaluation provides strong evidence in support of learning-all-rules as a plausible approach to inductive learning.

Download to read the full chapter text

Chapter PDF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Aha, D.W. (1990). A Study of Instance-Based Algorithms for Supervised Learning Tasks. PhD Thesis, Department of Information and Computer Science, University of California, Irvine, Technical Report 90-42.
Google Scholar
Aha, D. W. (1997). Editorial on Lazy Learning. Artificial Intelligence Review, 11: 7–10.
Article Google Scholar
Aha, D. W., Kibler, D., and Albert, M. (1991). Instance-based learning algorithms. Machine Learning, 6: 37–66.
Google Scholar
Ali, K., Brunk, C., and Pazzani, M. (1994). On learning multiple descriptions of a concept. In Proceedings of Tools with Artificial Intelligence. New Orleans, LA.
Google Scholar
Breiman, L. (1996) Bagging predictors. Machine Learning, 24: 123–140.
Google Scholar
Clark, Peter and Niblett, T. (1989). The CN2 induction algorithm. Machine Learning, 3: 261–284.
Google Scholar
Clark, P. and Boswell, R. (1991). Rule induction with CN2: Some recent improvements. In Proceedings of the Fifth European Working Session on Learning, pp. 151–163.
Google Scholar
Dietterich, T. G. and Bakiri, G. (1994). Solving multiclass learning problems via errorcorrecting output codes. Journal of Artificial Intelligence Research, 2: 263–286.
Google Scholar
Domingos, P. (1995). Rule induction and instance-based learning: A unified approach. In Proceedings of the 13th International Joint COnference on Artificial Intelligence, Montreal, Morgan Kaufmann, pp. 226–1232.
Google Scholar
Fix, E. and J.L. Hodges (1952). Discriminatory analysis — Nonparametric discrimination: Consistency properties. From Project 21-49-004, Report Number 4, USAF School of Aviation Medicine, Randolph Field, Texas, pp. 261–279.
Google Scholar
Fayyad, U.M. and Irani, K.B. (1993). Multi-interval discretization of continuous-valued attributes for classification learning. In Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, pp. 1022–1027, Morgan Kaufmann publishers.
Google Scholar
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., and Uthurusamy, R. (1996). Advances in knowledge discovery and data mining. MIT Press, Menlo Park, Ca.
Google Scholar
Friedman, J. H., Kohavi, R., and Yun, Y. (1996). Lazy decision trees. In Proceedings of the Thirteenth National Conference on Artificial Intelligence. AAAI Press, Portland, OR, pp. 717–724.
Google Scholar
Kwok, S. W. and Carter, C. (1990). Multiple decision trees. In Shachter, R. D. and Levitt, T. S. and Kanal, L. N. and Lemmer, J. F. (Eds.) Uncertainty in Artificial Intelligence 4. North Holland, Amsterdam, pp. 327–335.
Google Scholar
Michalski, R. S. (1984) A theory and methodology of inductive learning. In Michalski, R. S. and Carbonell, J. G. and Mitchell, T. M. (Eds.) Machine Learning: An Artificial Intelligence Approach. Springer-Verlag, Berlin, pp. 83–129.
Google Scholar
Merz, C.J., and Murphy, P.M. (1997). UCI Repository of machine learning databases [http://www.ics.uci.edu/ mlearn/MLRepository.html]. Irvine, CA: University of California, Department of Information and Computer Science.
Google Scholar
Muggleton, Stephen and Feng, C. (1990). Efficient induction of logic programs. In Proceedings of the First Conference on Algorithmic Learning Theory, Tokyo.
Google Scholar
Nock, R. and Olivier G. (1995). On learning decision committees. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 413–420, Taho City, Ca. Morgan Kaufmann publishers.
Google Scholar
Oliver, J. J. and Hand, D. J. (1995). On pruning and averaging decision trees. In Proceedings of the Twelfth International Conference on Machine Learning, Morgan Kaufmann, Taho City, Ca., pp. 430–437.
Google Scholar
Quinlan, J.R. (1990) Learning logical definitions from relations. Machine Learning, 5: 239–266.
Google Scholar
Quinlan, J.R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA.
Google Scholar
Rissanen, J. (1989). Stochastic Complexity in Statistical Inquiry. World Scientific, Singapore.
Google Scholar
Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5: 197–227.
Google Scholar
Ting K. M., (1995). Common Issues in Instance-based and Naive Bayesian Classifiers. PhD thesis, Basser Dept of Computer Science, University of Sydney.
Google Scholar
Webb, G. I. (1993). Systematic search for categorical attribute-value data-driven machine learning. In AI'93 — Proceedings of the Sixth Australian Joint Conference on Artificial Intelligence, World Scientific, Melbourne, pp. 342–347.
Google Scholar
Webb, G.I. (1995). An efficient admissible algorithm for unordered search. Journal of Artificial Intelligence Research, 3: 431–465.
Google Scholar
Webb, G. I. (1996a). Further experimental evidence against the utility of Occam's razor. Journal of Artificial Intelligence Research, 4: 397–417.
Google Scholar
Webb, G. I. (1996b). A heuristic covering algorithm has higher predictive accuracy than learning all rules. In Proceedings of Information, Statistics and Induction in Science, Melbourne, pp. 20–30.
Google Scholar
Wogulis, J. and Langley, P. (1989). Improving efficiency by learning intermediate concepts. In Proceedings of the Eleventh International Joint Conference on Artificial Intelligence, Morgan Kaufmann, San Mateo, CA, pp. 657–662.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Mathematics, Deakin University, 3217, Geelong, Vic, Australia
Murlikrishna Viswanathan & Geoffrey I. Webb

Authors

Murlikrishna Viswanathan
View author publications
You can also search for this author in PubMed Google Scholar
Geoffrey I. Webb
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Claire Nédellec Céline Rouveirol

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Viswanathan, M., Webb, G.I. (1998). Classification learning using all rules. In: Nédellec, C., Rouveirol, C. (eds) Machine Learning: ECML-98. ECML 1998. Lecture Notes in Computer Science, vol 1398. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0026685

Download citation

DOI: https://doi.org/10.1007/BFb0026685
Published: 16 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64417-0
Online ISBN: 978-3-540-69781-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics