A global algorithm to estimate the expectations of the components of an observed univariate mixture

Paul, Nicolas; Terre, Michel; Fety, Luc

doi:10.1007/s11634-007-0014-z

A global algorithm to estimate the expectations of the components of an observed univariate mixture

Regular Article
Published: 26 October 2007

Volume 1, pages 201–219, (2007)
Cite this article

Advances in Data Analysis and Classification Aims and scope Submit manuscript

Nicolas Paul¹,
Michel Terre¹ &
Luc Fety¹

77 Accesses
4 Citations
Explore all metrics

Abstract

This paper deals with the unsupervised classification of univariate observations. Given a set of observations originating from a K-component mixture, we focus on the estimation of the component expectations. We propose an algorithm based on the minimization of the “K-product” (KP) criterion we introduced in a previous work. We show that the global minimum of this criterion can be reached by first solving a linear system then calculating the roots of some polynomial of order K. The KP global minimum provides a first raw estimate of the component expectations, then a nearest-neighbour classification enables to refine this estimation. Our method’s relevance is finally illustrated through simulations of various mixtures. When the mixture components do not strongly overlap, the KP algorithm provides better estimates than the Expectation-Maximization algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Comprehensive Survey of Clustering Algorithms

Article 01 June 2015

Density-Based Clustering Based on Hierarchical Density Estimates

Data clustering: application and trends

Article 27 November 2022

References

Berkin P (2006). A survey of clustering data mining techniques. In: Kogan, J, Nicholas, C and Teboulle, M (eds) Grouping multidimensional data: recent advances in clustering, pp 25–71. Springer, Berlin
Chapter Google Scholar
Bradley PS, Fayyad UM (1998) Refining initial points for K-means clustering. In: Proceedings of the 15th international conference on Machine Learning. Morgan Kaufmann, San-Fransisco, pp 91–99
Bojanczyk AW, Brent RP and Hoog FR (1995). Stability analysis of a general Toeplitz system solver. Numer Algorithms 10: 225–244
Article MATH MathSciNet Google Scholar
Celeux G, Chauveau D, Diebolt J (1995) On stochastic versions of the EM algorithm. INRIA research report no 2514, available http://www.inria.fr/rrrt/rr-2514.html
Dempster A, Laird N and Rubin D (1977). Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39: 1–38
MATH MathSciNet Google Scholar
Fisher WD (1958). On grouping for maximum homogeneity. J Am Stat Assoc 53(284): 789–798
Article MATH Google Scholar
Fitzgibbon LJ, Allison L and Dowe DL (2000). Minimum message length grouping of ordered data. In: Arimura, H and Jain, S (eds) Proceedings of the 11th international conference on algorithmic learning theory, Sydney, Australia, pp 56–70. LNAI, Springer, Berlin
Google Scholar
Hartigan J and Wong M (1979). A K-means clustering algorithm. J Appl Stat 28: 100–108
Article MATH Google Scholar
Krishna K and Narasimha Murty M (1999). Genetic K-means algorithm. IEEE Trans Syst Man Cybern B Cybern 29(3): 433–439
Article Google Scholar
Lindsay B and Furman D (1994). Measuring the relative effectiveness of moment estimators as starting values in maximizing likelihoods. Comput Stat Data Anal 17(5): 493–507
Article MATH MathSciNet Google Scholar
McLachlan G and Peel D (2000). Finite mixture models. Wiley, New York
MATH Google Scholar
Parzen E (1962). On estimation of a probability density function and mode. Ann Math Stat 33: 1065–1076
MathSciNet Google Scholar
Paul N, Terre M, Fety L (2006) The K-product criterion for Gaussian mixture estimation. In: Proceedings of the 7th Nordic signal processing symposium, Reykjavik, Iceland, pp 334–337, doi:10.1109/NORSIG.2006.275248
Pernkopf F and Bouchaffra D (2005). Genetic-based EM algorithm for learning Gaussian mixture models. IEEE Trans Pattern Anal Mach Intell 27(8): 1344–1348
Article Google Scholar
Uhlig F (1999). General polynomial roots and their multiplicities in O(n) memory and O(n ²) time. Linear Multilinear Algebra 46(4): 327–359
Article MATH MathSciNet Google Scholar
Xu R and Wunsch II D (2005). Survey of clustering algorithms. IEEE Trans Neural Netw 16(3): 645–678
Article Google Scholar

Download references

Author information

Authors and Affiliations

Conservatoire National des Arts et Metiers, Electronic and Communications, 292 rue Saint-Martin, 75003, Paris, France
Nicolas Paul, Michel Terre & Luc Fety

Authors

Nicolas Paul
View author publications
You can also search for this author in PubMed Google Scholar
Michel Terre
View author publications
You can also search for this author in PubMed Google Scholar
Luc Fety
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicolas Paul.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Paul, N., Terre, M. & Fety, L. A global algorithm to estimate the expectations of the components of an observed univariate mixture. ADAC 1, 201–219 (2007). https://doi.org/10.1007/s11634-007-0014-z

Download citation

Received: 29 March 2007
Revised: 26 July 2007
Accepted: 18 September 2007
Published: 26 October 2007
Issue Date: December 2007
DOI: https://doi.org/10.1007/s11634-007-0014-z

Keywords

Mathematics Subject Classification (2000)

68T05

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A global algorithm to estimate the expectations of the components of an observed univariate mixture

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Data clustering: application and trends

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

Navigation

A global algorithm to estimate the expectations of the components of an observed univariate mixture

Abstract

Access this article

Similar content being viewed by others

A Comprehensive Survey of Clustering Algorithms

Density-Based Clustering Based on Hierarchical Density Estimates

Data clustering: application and trends

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

Search

Navigation