Universal Clustering with Family of Power Loss Functions in Probabilistic Space

Nikulin, Vladimir

doi:10.1007/11508069_41

Universal Clustering with Family of Power Loss Functions in Probabilistic Space

Vladimir Nikulin¹⁹

Conference paper

1307 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

Abstract

We propose universal clustering in line with the concepts of universal estimation. In order to illustrate the model of universal clustering we consider family of power loss functions in probabilistic space which is marginally linked to the Kullback-Leibler divergence. The model proved to be effective in application to the synthetic data. Also, we consider large web-traffic dataset. The aim of the experiment is to explain and understand the way people interact with web sites.

This work was supported by the grants of the Australian Research Council

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Nikulin, V., Smola, A.: Parametric model-based clustering. In: Dasarathy, B. (ed.) Data Mining, Intrusion Detection, Information Assurance, and Data Network Security, Orlando, Florida, USA, March 28-29. SPIE, vol. 5812, pp. 190–201 (2005)
Google Scholar
Dhillon, I., Mallela, S., Kumar, R.: Divisive information-theoretic feature clustering algorithm for text classification. Journal of Machine Learning Research 3, 1265–1287 (2003)
Article MATH Google Scholar
Cohn, D., Hofmann, T.: The missing link - a probabilistic model of document content and hypertext connectivity. In: 13th Conference on Neural Information Processing Systems (2001)
Google Scholar
Hwang, J.T.: Universal domination and stochastic domination: Estimation simultaneously under a broad class of loss functions. The Annals of Statistics 13, 295–314 (1985)
Article MATH MathSciNet Google Scholar
Rukhin, A.: Universal Bayes estimators. The Annals of Statistics 6, 1345–1351 (1978)
Article MATH MathSciNet Google Scholar
Hamerly, G., Elkan, C.: Learning the k in k-means. In: 16th Conference on Neural Information Processing Systems (2003)
Google Scholar
Zhong, S., Ghosh, J.: A unified framework for model-based clustering. Journal of Machine Learning Research 4, 1001–1037 (2003)
Article MathSciNet Google Scholar
Akaike, H.: On the likelihood of a time series model. The Statistician 27, 217–235 (1978)
Article MathSciNet Google Scholar
Schwarz, G.: Estimating the dimension of a model. The Annals of Statistics 6, 461–464 (1978)
Article MATH MathSciNet Google Scholar
Fraley, C., Raftery, A.: How Many Clusters? Which Clustering Method? Answers via Model-based Cluster Analysis. The Computer Journal 41, 578–588 (1998)
Article MATH Google Scholar
Pollard, D.: Strong consistency of k-means clustering. The Annals of Statistics 10, 135–140 (1981)
Article MathSciNet Google Scholar
Bhatia, S.: Adaptive k-means clustering. FLAIRS, 695–699 (2004)
Google Scholar
Amari, S., Nagaoka, H.: Methods of Information Geometry. Oxford University Press, Oxford (1993)
Google Scholar
Cadez, I., Heckerman, D., Meek, C., Smyth, P., White, S.: Model-based clustering and visualization of navigation patterns on a web site. Data Mining and Knowledge Discovery 7, 399–424 (2003)
Article MathSciNet Google Scholar
Msnbc: msnbc.com anonymous web data. In: UCI Knowledge Discovery in Databases Archive (1999), http://kdd.ics.uci.edu/summary.data.type.html

Download references

Author information

Authors and Affiliations

Computer Science Laboratory, Australian National University, Canberra, ACT 0200, Australia
Vladimir Nikulin

Authors

Vladimir Nikulin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, University of Queensland, 4072, Australia
Marcus Gallagher
, POB 30031, FL 32503-1031, Pensacola
James P. Hogan
Faculty of Information Technology, Queensland University of Technology, Box 2434, Q 4001, Brisbane, Australia
Frederic Maire

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nikulin, V. (2005). Universal Clustering with Family of Power Loss Functions in Probabilistic Space. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_41

Download citation

DOI: https://doi.org/10.1007/11508069_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics