Abstract
Cohen [3] introduced a rule set improvement method, Grow, that is used in classifier learning in a similar way to standard reduced error pruning methods, but is based on “reduced error rule set re-growth”. Here we follow Cohen's suggestion that order of magnitude analysis of the time complexity of such reduced error methods on random data provides insight into their behaviour on real data sets that are noisy. We consider the growth of rule sets produced for such data by these methods, and suggest that the size of the final rule set is roughly of order n, for n training items, whereas Cohen assumed it was roughly constant. This leads to increased estimates of the relevant time complexities. We propose a simple improvement to the implementation to reduce the order of the time complexities by about n. We give experimental results in support of our rough order of magnitude claims.
Preview
Unable to display preview. Download preview PDF.
References
Brunk, C.A., Pazzani, M.J.: An investigation of noise-tolerant relational concept learning algorithms. Proceedings of the Eighth International Workshop of Machine Learning (1991) 389–393
Cameron-Jones, R.M., Quinlan, J.R.: Efficient top-down induction of logic programs. SIGART 5 (1994) 33–42
Cohen, W.W.: Efficient pruning methods for separate-and-conquer rule learning systems. Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (1993) 988–994
Cohen, W.W.: Fast effective rule induction. Proceedings of the Twelfth International Conference on Machine Learning (ML95) (1995) 115–123
Cormen, T.H., Leiserson, C.E., Rivest, R.L.: Introduction to Algorithms. MIT Press (1990)
Fürnkranz, J., Widmer, G.: Incremental reduced error pruning. Proceedings of the Eleventh International Conference on Machine Learning (ML94) (1994) 70–77
Pagallo, G., Haussler, D.: Boolean feature discovery in empirical learning. Machine Learning 5 (1990) 71–99
Quinlan, J.R.: Simplifying decision trees. International Journal of Man-Machine Studies 27 (1987) 221–234
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cameron-Jones, M. (1996). The complexity of batch approaches to reduced error rule set induction. In: Foo, N., Goebel, R. (eds) PRICAI'96: Topics in Artificial Intelligence. PRICAI 1996. Lecture Notes in Computer Science, vol 1114. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61532-6_30
Download citation
DOI: https://doi.org/10.1007/3-540-61532-6_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61532-3
Online ISBN: 978-3-540-68729-0
eBook Packages: Springer Book Archive