Abstract
Identifiability problems can be encountered when fitting finite mixture models and their presence should be investigated by model diagnostics. In this paper we propose diagnostic tools to check for identifiability problems based on the fact that they induce multiple (global) modes in the distribution of the parameterizations of the maximum likelihood models depending on the data generating process. The parametric bootstrap is used to approximate this distribution. In order to investigate the presence of multiple (global) modes the congruence between the results of information-based methods based on asymptotic theory and those derived using the models fitted to the bootstrap samples with initalization in the solution as well as random initialization is assessed. The methods are illustrated using a finite mixture of Gaussian regression models on data from a study on spread of viral infection.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BARINGHAUS, L. and FRANZ, C. (2004): On a New Multivariate Two-sample Test. Journal of Multivariate Analysis, 88, 190–206.
BASFORD, K.E., GREENWAY, D.R., MCLACHLAN, G.J. and PEEL, D. (1997): Standard Errors of Fitted Means Under Normal Mixture Model. Computational Statistics, 12, 1–17.
DEMPSTER, A.P., LAIRD, N.M. and RUBIN, D.B. (1977): Maximum Likelihood from Incomplete Data Via the EM Algorithm. Journal of the Royal Statistical Society B, 39, 1–38.
GRÜN, B. and LEISCH, F. (2004): Bootstrapping Finite Mixture Models. In: J. Antoch (Ed.): Compstat 2004 — Proceedings in Computational Statistics. Physica, Heidelberg, 1115–1122.
HENNIG, C. (2000): Identifiability of Models for Clusterwise Linear Regression. Journal of Classification, 17,2, 273–296.
HOTHORN, T., LEISCH, F., ZEILEIS, A. and HORNIK, K. (2005): The Design and Analysis of Benchmark Experiments. Journal of Computational and Graphical Statistics, 14,3, 1–25.
LOUIS, T.A. (1982): Finding the Observed Information Matrix When Using the EM Algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 44,2, 226–233.
MCLACHLAN, G.J. and PEEL, D. (2000): Finite Mixture Models. Wiley.
MINNOTTE, M. C. (1997): Nonparameteric Testing of the Existence of Modes. The Annals of Statistics, 25,4, 1646–1660.
STEPHENS, M. (2000): Dealing with Label Switching in Mixture Models. Journal of the Royal Statistical Society B, 62,4, 795–809.
TEICHER, H. (1963): Identifiability of Finite Mixtures. The Annals of Mathematical Statistics, 34, 1265–1269.
TURNER, T.R. (2000): Estimating the Propagation Rate of a Viral Infection of Potato Plants Via Mixtures of Regressions. Journal of the Royal Statistical Society C, 49,3, 371–384.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Grün, B., Leisch, F. (2007). Testing for Genuine Multimodality in Finite Mixture Models: Application to Linear Regression Models. In: Decker, R., Lenz, H.J. (eds) Advances in Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70981-7_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-70981-7_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70980-0
Online ISBN: 978-3-540-70981-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)