A Novel Clustering Algorithm Based on a Non-parametric “Anti-Bayesian” Paradigm

Hammer, Hugo Lewi; Yazidi, Anis; Oommen, B. John

doi:10.1007/978-3-319-19066-2_52

Hugo Lewi Hammer⁹,
Anis Yazidi⁹ &
B. John Oommen¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9101))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

2768 Accesses

Abstract

The problem of clustering, or unsupervised classification, has been solved by a myriad of techniques, all of which depend, either directly or implicitly, on the Bayesian principle of optimal classification. To be more specific, within a Bayesian paradigm, if one is to compare the testing sample with only a single point in the feature space from each class, the optimal Bayesian strategy would be to achieve this based on the distance from the corresponding means or central points in the respective distributions. When this principle is applied in clustering, one would assign an unassigned sample into the cluster whose mean is the closest, and this can be done in either a bottom-up or a top-down manner. This paper pioneers a clustering achieved in an “Anti-Bayesian” manner, and is based on the breakthrough classification paradigm pioneered by Oommen et al. The latter relies on a radically different approach for classifying data points based on the non-central quantiles of the distributions. Surprisingly and counter-intuitively, this turns out to work equally or close-to-equally well to an optimal supervised Bayesian scheme, which thus begs the natural extension to the unexplored arena of clustering. Our algorithm can be seen as the Anti-Bayesian counter-part of the well-known $k$-means algorithm (The fundamental Anti-Bayesian paradigm need not just be used to the $k$-means principle. Rather, we hypothesize that it can be adapted to any of the scores of techniques that is indirectly based on the Bayesian paradigm.), where we assign points to clusters using quantiles rather than the clusters’ centroids. Extensive experimentation (This paper contains the prima facie results of experiments done on one and two-dimensional data. The extensions to multi-dimensional data are not included in the interest of space, and would use the corresponding multi-dimensional Anti-Naïve-Bayes classification rules given in [1].) demonstrates that our Anti-Bayesian clustering converges fast and with precision results competitive to a $k$-means clustering.

John Oommen—Chancellor’s Professor; Fellow: IEEE and Fellow: IAPR. This author is also an Adjunct Professor with the University of Agder in Grimstad, Norway.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Machine Learning-Based Clustering Analysis: Foundational Concepts, Methods, and Applications

Predictive K-means with Local Models

Model-Based Clustering

Article Open access 01 October 2016

References

Thomas, A., Oommen, B.J.: Order statistics-based parametric classification for multi-dimensional distributions. Pattern Recognition 46(12), 3472–3482 (2013)
Article Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Dats. Prentice Hall, Englewood Cliffs (1988)
Google Scholar
Xu, R., Wunsch II, D.: Survey of clustering algorithms. Trans. Neur. Netw. 16(3), 645–678 (2005)
Article Google Scholar
Ankerst, M., Breunig, M.M., Peter Kriegel, H., Sander, J.: Optics: ordering points to identify the clustering structure, pp. 49–60. ACM Press (1999)
Google Scholar
Ester, M., Peter Kriegel, H., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise, pp. 226–231. AAAI Press (1996)
Google Scholar
Murtagh, F., Contreras, P.: Methods of hierarchical clustering. CoRR abs/1105.0121 (2011)
Google Scholar
Sibson, R.: SLINK: An optimally efficient algorithm for the single-link cluster method. The Computer Journal 16(1), 30–34 (1973)
Article MathSciNet Google Scholar
Thomas, A., Oommen, B.J.: The fundamental theory of optimal “anti-bayesian” parametric pattern classification using order statistics criteria. Pattern Recognition 46(1), 376–388 (2013)
Article MATH Google Scholar
Oommen, B.J., Thomas, A.: Anti-Bayesian parametric pattern classification using order statistics criteria for some members of the exponential family. Pattern Recognition 47(1), 40–55 (2014)
Article Google Scholar
Hyndman, R.J., Fan, Y.: Sample quantiles in statistical packages. American Statistician 50, 361–365 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Oslo and Akershus University College of Applied Sciences, Oslo, Norway
Hugo Lewi Hammer & Anis Yazidi
School of Computer Science, Carleton University, Ottawa, Canada
B. John Oommen

Authors

Hugo Lewi Hammer
View author publications
You can also search for this author in PubMed Google Scholar
Anis Yazidi
View author publications
You can also search for this author in PubMed Google Scholar
B. John Oommen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hugo Lewi Hammer .

Editor information

Editors and Affiliations

Texas State University, San Marcos, Texas, USA
Moonis Ali
Dongguk University, Seoul, Korea, Republic of (South Korea)
Young Sig Kwon
Dongguk University, Seoul, Korea, Republic of (South Korea)
Chang-Hwan Lee
Dongguk University, Seoul, Korea, Republic of (South Korea)
Juntae Kim
Seoul National University, Seoul, Korea, Republic of (South Korea)
Yongdai Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hammer, H.L., Yazidi, A., Oommen, B.J. (2015). A Novel Clustering Algorithm Based on a Non-parametric “Anti-Bayesian” Paradigm. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_52

Download citation

DOI: https://doi.org/10.1007/978-3-319-19066-2_52
Published: 01 May 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19065-5
Online ISBN: 978-3-319-19066-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics