Abstract
This paper deals with a machine learning task, namely probability density estimation, in the case data is composed of subsets hosted on nodes of a distributed system. Focusing on mixture models and assuming a set of local probability distribution estimates, we demonstrate how it is possible to combining local estimates in a dynamic, robust and decentralized fashion, through gossiping a global probabilistic model over the data set. Experiments are reported to illustrate the proposal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, New York (2006)
Kermarrec, A.-M., van Steen, M.: Gossiping in distributed systems. Operating Systems Review 41(5), 2–7 (2007)
Gu, D.: Distributed EM algorithm for Gaussian Mixtures in Sensor Networks. IEEE Transactions on Neural Networks 19(7), 1154–1166 (2008)
Kowalczyk, W., Vlassis, N.A.: Newscast EM. In: NIPS, pp. 713–720. MIT Press, Cambridge (2004)
Chandola, V., Banerjee, A., Kumar, V.: Anomaly detection: A survey. ACM Comput. Surv. 41(3), 1–58 (2009)
Rousseeuw, P.J., Driessen, K.V.: A fast algorithm for the minimum covariance determinant estimator. Technometrics 41(3), 212–223 (1999)
Jelasity, M., Voulgaris, S., Guerraoui, R., Kermarrec, A.-M., van Steen, M.: Gossip-based peer sampling. ACM Trans. Comput. Syst. 25(3), 8 (2007)
Goldberger, J., Roweis, S.T.: Hierarchical clustering of a mixture model. In: NIPS, pp. 505–512. MIT Press, Cambridge (2004)
Hershey, J.R., Olsen, P.A.: Approximating the Kullback Leibler divergence between Gaussian mixture models. In: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2007, vol. 4, pp. IV-317–IV-320 (2007)
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. In: SODA 2007: Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035. Society for Industrial and Applied Mathematics, PA (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
El Attar, A., Pigeau, A., Gelgon, M. (2011). A Decentralized Technique for Robust Probabilistic Mixture Modelling of a Distributed Data Set. In: Brazier, F.M.T., Nieuwenhuis, K., Pavlin, G., Warnier, M., Badica, C. (eds) Intelligent Distributed Computing V. Studies in Computational Intelligence, vol 382. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24013-3_29
Download citation
DOI: https://doi.org/10.1007/978-3-642-24013-3_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24012-6
Online ISBN: 978-3-642-24013-3
eBook Packages: EngineeringEngineering (R0)