Abstract
A new method is proposed for selection of the optimal number of components of a mixture model for pattern classification. We approximate a class-conditional density by a mixture of Gaussian components. We estimate the parameters of the mixture components by the EM (Expectation Maximization) algorithm and select the optimal number of components on the basis of the MDL (Minimum Description Length) principle. We evaluate the goodness of an estimated model in a trade-off between the number of the misclassified training samples and the complexity of the model.
Chapter PDF
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Dempster, A. P., Laird, N. M., Rubin, D. B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. J. Royal Stat. Soc. Series B 39 (1977) 1–38
McLachlan, G. J., Basford, K. E.: Mixture Models. Marcel Dekker, Inc., New York (1988) 21–29
Ichimura, N.: Robust Clustering Based on a Maximum Likelihood Method for Estimation of the Suitable Number of Clusters. Trans. Inst. Electron. Info. Comm. Eng. D-II J78 8 (1995) 1184–1195 (in Japanese)
Rissanen, J.: A Universal Prior for Integers and Estimation by Minimum Description Length. Ann. Stat. 11 (1983) 416–431
Bezdek, J. C.: Cluster Validity with Fuzzy Sets. J. Cybern. 3 3 (1974) 58–73
Kudo, M., Shimbo, M.: Selection of Classifiers Based on the MDL Principle Using the VC Dimension. Proc. ICPR '96 (1996) 886–890
Hayamizu, S. et al.: Generation of VCV/CVC Balanced Word Sets for Speech Database. Bullet. Electro. Lab. 49 10 (1985) 803–834
Park, Y., Sklansky, J.: Automated Design of Multiple-Class Piecewise Linear Classifiers. J. Classification 6 (1989) 195–222
Pudil, P., Novoičová, J., Kittle, J.: Floating Search Methods in Feature Selection. Pattern Recogn. Letters 15 (1994) 1119–1125
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tenmoto, H., Kudo, M., Shimbo, M. (1998). MDL-based selection of the number of components in mixture models for pattern classification. In: Amin, A., Dori, D., Pudil, P., Freeman, H. (eds) Advances in Pattern Recognition. SSPR /SPR 1998. Lecture Notes in Computer Science, vol 1451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033308
Download citation
DOI: https://doi.org/10.1007/BFb0033308
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64858-1
Online ISBN: 978-3-540-68526-5
eBook Packages: Springer Book Archive