Abstract
We describe a new data-mining platform, CDMS, aimed at the streamlined development, comparison and application of machine learning tools. We discuss its type system, focussing on the treatment of statistical models as first-class values.
This allows rapid construction of composite models – complex models built from simpler ones – such as mixture models, Bayesian networks and decision trees. We illustrate this with a flexible decision tree tool for CDMS which rather than being limited to discrete target attributes, can model any kind of data using arbitrary probability distributions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Allison, L.: Types and classes of machine learning and data mining. In: Proceedings of the Twenty-Sixth Australasian Computer Science Conference (ACSC 2003), Adelaide, South Australia, pp. 207–215 (2003)
Blake, C., Merz, C.: UCI repository of machine learning databases (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Statistics/Probability Series. Wadsworth Publishing Company, Belmont (1984)
Fitzgibbon, L., Allison, L., Comley, J.: Probability model type sufficiency. In: Proc. 4th International Conference on Intelligent Data Engineering and Automated Learning (2003)
Ihaka, R., Gentleman, R.: R: A language for data analysis and graphics. Journal of Computational and Graphical Statistics 5(3), 299–314 (1996)
Quinlan, J.R.: C5.0., http://www.rulequest.com
Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S-PLUS, 3rd edn. Springer, Heidelberg (1999)
Wallace, C.S., Boulton, D.M.: An information measure for classification. Computer Journal 11, 185–194 (1968)
Wallace, C.S., Boulton, D.M.: An invariant Bayes method for point estimation. Classification Society Bulletin 3(3), 11–34 (1975)
Wallace, C.S., Freeman, P.R.: Estimation and inference by compact coding. J. Royal Statistical Society (Series B) 49, 240–252 (1987)
Wallace, C.S., Patrick, J.D.: Coding decision trees. Machine Learning 11, 7–22 (1993)
Witten, I.H., Frank, E.: Nuts and bolts: Machine learning algorithms in Java. Morgan Kaufmann, San Francisco (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Comley, J.W., Allison, L., Fitzgibbon, L.J. (2003). Flexible Decision Trees in a General Data-Mining Environment. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_102
Download citation
DOI: https://doi.org/10.1007/978-3-540-45080-1_102
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40550-4
Online ISBN: 978-3-540-45080-1
eBook Packages: Springer Book Archive