Abstract
A myriad of works has been published for achieving data clustering based on the Bayesian paradigm, where the clustering sometimes resorts to Naïve-Bayes decisions. Within the domain of clustering, the Bayesian principle corresponds to assigning the unlabelled samples to the cluster whose mean (or centroid) is the closest. Recently, Oommen and his co-authors have proposed a novel, counter-intuitive and pioneering PR scheme that is radically opposed to the Bayesian principle. The rational for this paradigm, referred to as the “Anti-Bayesian” (AB) paradigm, involves classification based on the non-central quantiles of the distributions. The first-reported work to achieve clustering using the AB paradigm was in [1], where we proposed a flat clustering method which assigned unlabelled points to clusters based on the AB paradigm, and where the distances to the respective learned clusters was based on their quantiles rather than the clusters’ centroids for uni-dimensional and two-dimensional data. This paper, extends the results of [1] in many directions. Firstly, we generalize our previous AB clustering [1], initially proposed for handling uni-dimensional and two-dimensional spaces, to arbitrary d-dimensional spaces using their so-called “quantiloids”. Secondly, we extend the AB paradigm to consider how the clustering can be achieved in hierarchical ways, where we analyze both the Top-Down and the Bottom-Up clustering options. Extensive experimentation demonstrates that our clustering achieves results competitive to the state-of-the-art flat, Top-Down and Bottom-Up clustering approaches, demonstrating the power of the AB paradigm.
B.J. Oommen—Chancellor’s Professor; Fellow: IEEE and Fellow: IAPR. This author is also an Adjunct Professor with the University of Agder in Grimstad, Norway.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
With \(\rho _{\mathrm {max}} = 0.8\) and for \(d=3\) the matrix were almost always positive definite on the very first attempt. For \(d=5\), on the average, about every third matrix that was generated was positive definite.
References
Hammer, H.L., Yazidi, A., Oommen, B.J.: A novel clustering algorithm based on a non-parametric “Anti-Bayesian” paradigm. In: Ali, M., Kwon, Y.S., Lee, C.-H., Kim, J., Kim, Y. (eds.) IEA/AIE 2015. LNCS, vol. 9101, pp. 536–545. Springer, Heidelberg (2015)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Dats. Prentice Hall, Englewood Cliffs (1988)
Xu, R., Wunsch, D.: Survey of clustering algorithms. Trans. Neur. Netw. 16(3), 645–678 (2005)
Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: Optics: ordering points to identify the clustering structure, pp. 49–60. ACM Press (1999)
Ester, M., Kriegel, H.P., S, J., Xu, X.: A density-based algorithm for discovering clusters in large spatialdatabases with noise, pp. 226–231. AAAI Press (1996)
Murtagh, F., Contreras, P.: Methods of hierarchical clustering. CoRR abs/1105.0121 (2011)
Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. 16(1), 30–34 (1973)
Thomas, A., Oommen, B.J.: The fundamental theory of optimal “Anti-Bayesian” parametric pattern classification using order statistics criteria. Pattern Recogn. 46(1), 376–388 (2013)
Thomas, A., Oommen, B.J.: Order statistics-based parametric classification for multi-dimensional distributions. Pattern Recogn. 46(12), 3472–3482 (2013)
Oommen, B.J., Thomas, A.: Anti-Bayesian parametric pattern classification using order statistics criteria for some members of the exponential family. Pattern Recogn. 47(1), 40–55 (2014)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Yazidi, A., Hammer, H.L., Oommen, B.J. (2016). “Anti-Bayesian” Flat and Hierarchical Clustering Using Symmetric Quantiloids. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-42007-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42006-6
Online ISBN: 978-3-319-42007-3
eBook Packages: Computer ScienceComputer Science (R0)