Abstract
Spectral clustering has become very popular in recent years, due to the simplicity of its implementation and good performance in clustering non-convex data. Constructing a similarity graph based on an appropriate distance measure for modeling the local neighborhood relations among data samples is crucial for achieving an acceptable performance in spectral clustering. In this paper, we propose a fuzzy spectral clustering algorithm for poorly separated data with arbitrary shapes. Distinguishing poorly separated clusters is a challenging issue since a border point of a cluster may be more similar to the border points of the adjacent cluster than to the points in its own cluster. We propose a local mean-based distance measure which helps in separating points in cluster borders. The distance between a pair of points, in the proposed distance measure, is defined as the distance between the mean of their k nearest neighbors. We also propose a new transitive-based method for computing the membership degrees of points to clusters. Our evaluation results on both artificial and real data show that both the proposed local mean-based distance measure and the proposed membership computation method have significant impacts in obtaining performance improvement over the existing methods.







Similar content being viewed by others
References
Ertoz L, Steinbach MS, Kumar V (2003) Finding clusters of different sizes, shapes, and densities in noisy, high dimensional data, Proceedings of the Third SIAM International Conference on Data Mining. San Francisco, CA, USA, pp. 47–58
MacQueen James B (1965) Some methods for classification and analysis of multivariate observations. In Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, pp. 281–297
Von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416
Wang Bangjun, Zhang Li, Caili Wu, Li Fanzhang, Zhang Zhao (2017) Spectral clustering based on similarity and dissimilarity criterion. Pattern Anal Appl 20(2):495–506
Li Xiang, Wang Zhijian, Ronglin Hu, Zhu Quanyin, Wang Liuyang (2019) Recommendation algorithm based on improved spectral clustering and transfer learning. Pattern Anal Appl 22(2):633–647
Kong Wanzeng, Sanqing Hu, Zhang Jianhai, Dai Guojun (2013) Robust and smart spectral clustering from normalized cut. Neural Comput. Appl. 23(5):1503–1512
Newman M (2006) Finding community structure in networks using the eigenvectors of matrices. Phys Rev E 74:36–104
Ding Shifei, Jia Hongjie, Zhang Liwen, Jin Fengxiang (2014) Research of semi-supervised spectral clustering algorithm based on pairwise constraints. Neural Comput. Appl. 24(1):211–219
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8):888–905. https://doi.org/10.1109/34.868688
Tan M, Zhang S, Wu L (2020) Mutual Knn based spectral clustering. Neural Comput Appl 32:6435–6442
Cominetti O, Matzavinos A, Samarasinghe S, Kulasiri D, Liu S, Maini PK, Erban R (2010) DifFUZZY: A fuzzy spectral clustering algorithm for complex data sets. Int J Comput Intell Bioinf Syst Biol 1(4):402–417
Shi T, Belkin M, Yu B (2009) Data spectroscopy: Eigenspaces of convolution operators and clustering. Annals Stat 37(6B):3960–3984. https://doi.org/10.1214/09-AOS700
Jia H, Ding S, Xu X, Nie R (2014) The latest research progress on spectral clustering. Neural Comput Appl 24:1477–1486
Liu H, Zhang Q, Zhao F (2018) Interval fuzzy spectral clustering ensemble algorithm for color image segmentation. J Intell Fuzzy Syst 35(5):5467–5476
Zeng S, Tong X, Sang N (2014) Study on multi-center fuzzy C-means algorithm based on transitive closure and spectral clustering. Appl Soft Comput 16:89–101. https://doi.org/10.1016/j.asoc.2013.11.020
Huang T, Wang S, Zhu W (2020) An adaptive kernelized rank-order distance for clustering non-spherical data with high noise. Int J Mach Learn Cyber. https://doi.org/10.1007/s13042-020-01068-9
Tasdemir K, Yalcin B, Yildirim I (2015) Approximate spectral clustering with utilized similarity information using geodesic based hybrid distance measures. Pattern Recogn 48(4):1465–1477
Su MC, Chou CH (2001) A modified version of the K-means algorithm with a distance based on cluster symmetry. IEEE Trans Pattern Anal Mach Intell 23(6):674–680
Ng AY, Jordan MI, Weiss Y (2001) On spectral clustering: Analysis and an algorithm, NIPS’01: Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic, 849–856
Verma D, Meila M (2003) A comparison of spectral methods, Technical Report UWCSE-03-05-01. University of Washington, Department of Computer Science and Engineering
Meila M, Shi J (2001) A Random Walks View of Spectral Segmentation, AISTATS (Proceedings of the Eighth International Workshop on Artificial Intelligence and Statistics, AISTATS 2001. Key West, Florida, USA
Kannan R, Vempala S, Vetta A (2004) On clusterings: Good, bad and spectral. J ACM 51(3):497–515. https://doi.org/10.1145/990308.990313
Wang Xiaoyu, Ding Shifei, Jia Weikuan (2020) Active constraint spectral clustering based on Hessian matrix. Soft Comput 24(3):2381–2390
Saade A, Krzakala F (2014) Spectral clustering of graphs with the Bethe Hessian. Int Conf Neural Inf Process Syst, 406–414
Nataliani Y, Yang MS (2019) Powered Gaussian kernel spectral clustering. Neural Comput Appl 31:557–572
Bezdek JC (1981) Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum, New York
Krishnapuram R, Kim J (1999) A note on the Gustafson-Kessel and adaptive fuzzy clustering algorithms. IEEE Trans Fuzzy Syst 7(4):453–461
Zhao F, Liu H, Jiao L (2011) Spectral clustering with fuzzy similarity measure. Digital Signal Process 21:701–709
Zeyu L, Shiwei T, Jing X, Jun J (2001) Modified FCM clustering based on kernel mapping. Proc Internat Soc Optical Eng 4554:241–245
Memon KH, Lee DH (2018) Generalised kernel weighted fuzzy C-means clustering algorithm with local information. Fuzzy Sets Syst 340:91–108
Zhang DQ, Chen SC (2003) Clustering incomplete data using kernel-based fuzzy c-means algorithm. Neural Process Lett 18:155–162
Zhang DQ, Chen SC (2004) A novel kernelized fuzzy C-means algorithm with application in medical image segmentation. Artif Intell Med 32:37–50
Graves D, Pedrycz W (2007) Fuzzy C-Means, Gustafson-Kessel FCM, and Kernel-Based FCM: A Comparative Study. Anal Design Intell Syst Using Soft Comput Tech, 140–149
Jimenez R (2008) Fuzzy spectral clustering for identification of rock discontinuity sets. Rock Mech Rock Eng 41:929–939
Liu H, Zhao F, Jiao L (2012) Fuzzy spectral clustering with robust spatial information for image segmentation. Appl Soft Comput 12(11):3636–3647
Wang Y, Duan X, Liu X, Wang C (2018) A spectral clustering method with semantic interpretation based on axiomatic fuzzy set theory. Appl Soft Comput 64:59–74
Celikyilmaz A (2009) Soft-link spectral clustering for information extraction. Proc IEEE Int Conf Commun, 434–441
Stewart GW, Sun J (1990) Matrix Perturbation Theory, Computer Science and Scientific Computing. Academic Press, Cambridge
Zelnik-Manor L, Perona P (2004) Self-tuning spectral clustering, Neural Information Processing Systems (NIPS 2004) Vancouver, British Columbia. Canada 1601–1608
Zhang X, Li J, Yu H (2011) Local density adaptive similarity measurement for spectral clustering. Pattern Recogn Letts 32:352–358
Halkidi M, Batistakis Y, Vazirgiannis M (2002) Cluster Validity Methods: Part I. SIGMOD Rec 31(2):40–45
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Motallebi, H., Nasihatkon, R. & Jamshidi, M. A local mean-based distance measure for spectral clustering. Pattern Anal Applic 25, 351–359 (2022). https://doi.org/10.1007/s10044-021-01040-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-021-01040-5