Abstract
Conventional unsupervised Fuzzy K-means methods (FKM) usually analyze the structure of data solely without considering the influence of label information carried by data, which limits the performance and stability of clustering. How to leverage annotated label information to improve the performance of unsupervised FKM methods is still a challenging research problem. To that end, this paper proposes a new Semi-Supervised Fuzzy K-means method (SSFKM) consisting of dynamic adjustment and label discrimination. Specifically, dynamic adjustment aligns label information and clustering results to distinguish the learning difficulties of labeled data and enable the method to focus on simple but reliable label information. Moreover, a new distance measure is designed to re-evaluate the membership of labeled data with cluster centers, forcing labeled data to be classified into correct cluster for enhancing label discrimination. Comprehensive experiments demonstrate that the SSFKM method achieves the best performance compared with existing state-of-the-art semi-supervised clustering methods. In addition, the results demonstrate that the SSFKM method reduces the impact of data noise effectively during clustering.











Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The datasets used in this paper are publicly available.
References
Sulaiman SN, Isa NAM (2011) Adaptive fuzzy-K-means clustering algorithm for image segmentation. IEEE Trans Consum Electron 56(4):2661–2668
Thaipanich T, Oh BT, Wu PH, Xu DR (2011) Improved image denoising with adaptive nonlocal means (ANL-Means) algorithm. IEEE Trans Consum Electron 56(4):2623–2630
Shanthi I, Valarmathi ML (2013) SAR image despeckling using possibilistic fuzzy C-means clustering and edge detection in bandelet domain. Neural Comput Appl 23(1):279-S291
Hao T, Rusanov A, Boland M et al (2014) Clustering clinical trials with similar eligibility criteria features. J Biomed Inform 52(c):112–120
Xu C, Tao D, Xu C (2015) Multi-view intact space learning. IEEE Trans Pattern Anal Mach Intell 37(12):2531–2544
Zhou B, Liu W, Zhang W et al (2022) Multi-kernel graph fusion for spectral clustering. Inf Process Manage 59(5):103003
Choi M, Chang IJ, Kim J (2016) Optimal reference view selection algorithm for low complexity disparity estimation. IEEE Trans Consum Electron 62(1):45–52
McQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the 5th Berkeley Symposium on Mathematics Statistics and Probability pp 281–297
Ruspini E (1969) A new approach to clustering. Inf Control 15(1):22–32
Bezdek J (1981) Pattern recognition with fuzzy objective function algorithms. Plenum Press, London
Xu J, Han J, Kai X, et al (2016) Robust and sparse fuzzy k-means clustering. In: Proceedings of the 25th International Joint Conference on Artificial Intelligence, pp 2224–2230
Bui Q, Vo B, Snasel V et al (2020) SFCM: a fuzzy clustering algorithm of extracting the shape information of data. IEEE Trans Fuzzy Syst 29(1):75–89
Nie F, Wang C, Li X (2019) K-Multiple-Means: a multiple-means clustering method with specified k clusters. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining pp 959–967
Zhou J, Pedrycz W, Yue X et al (2022) Projected fuzzy C-means clustering with locality preservation. Pattern Recogn 113(6):107748
Guo Y, Sengur A (2015) NCM: neutrosophic c-means clustering algorithm. Pattern Recogn 48(8):2710–2714
Guo Y, Sengur A (2015) NECM: neutrosophic evidential c-means clustering algorithm. Neural Comput Appl 26(3):561–571
Akbulut Y, Abdulkadir S, Guo Y et al (2017) KNCM: kernel neutrosophic c-means clustering. Appl Soft Comput 52:714–724
Xi L, Zhang FB (2020) An adaptive artificial-fish-swarm-inspired fuzzy C-means algorithm. Neural Comput Appl 32(22):16891–16899
Nguyen TPQ, Kuo RJ, Le MD et al (2022) Local search genetic algorithm-based possibilistic weighted fuzzy c-means for clustering mixed numerical and categorical data. Neural Comput Appl 34(20):18059–18074
Grira N, Crucianu M, Boujemaa N (2008) Active semi-supervised fuzzy clustering. Pattern Recogn 41(5):1834–1844
Zhang H, Jing L (2009) Semi-supervised fuzzy clustering: a kernel-based approach. Knowl-Based Syst 22(6):477–481
Zhang R, Nie F, Guo M et al (2018) Joint learning of fuzzy k-means and nonnegative spectral clustering with side information. IEEE Trans Image Process 28(5):2152–2162
Li L, Garibaldi JM, He D et al (2015) Semi-supervised fuzzy clustering with feature discrimination. PLoS ONE 10(9):131–160
Zhang D, Ma Y, Zhu H et al (2022) A label-guided weighted semi-supervised neutrosophic clustering algorithm. J Intell Fuzzy Syst 43(5):5661–5672
Shi C, Gu Z, Duan C et al (2020) Multi-view adaptive semi-supervised feature selection with the self-paced learning. Signal Process 168:107332
Chen L, Lu J (2021) Adaptive graph learning for semi-supervised self-paced classification. Neural Process Lett 54(4):2695–2716
Basu S, Banerjee A, Mooney R (2002) Semi-supervised clustering by seeding. In: Proceedings of the 19th International Conference on Machine Learning, pp 19–26
Wagstaff K, Cardie C (2000) Clustering with instance level constraints. In: Proceedings of the 17th International Conference on Machine Learning, pp 1097–1103
Wagstaff K, Cardie C, Rogerss S, et al (2001) Constrained K-means clustering with background knowledge. In Proceedings of the 18th International Conference on Machine Learning, pp 577–584
Nie F, Cai G, Li X (2017) Multi-view clustering and semi-supervised classification with adaptive neighbours. In: Proceedings of the 31st AAAI Conference on Artificial Intelligence, pp 2415–2421
Zhuang L, Zhou Z, Gao S et al (2017) Label information guided graph construction for semi-supervised learning. IEEE Trans Image Process 26(9):4182–4192
Chen L, Zhong Z (2022) Adaptive and structured graph learning for semi-supervised clustering. Inf Process Manage 59(4):102949
Wang D, Nie F, Huang H (2014) Large-scale adaptive semi-supervised learning via unified inductive and transductive model. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp 482–491
Nie F, Hua W, Huang H, et al (2013) Adaptive loss minimization for semi-supervised elastic embedding. In: Proceedings of the 23th International Joint Conference on Artificial Intelligence, pp 1565–1571
Qiu S, Nie F, Xu X et al (2018) Accelerating flexible manifold embedding for scalable semi-supervised learning. IEEE Trans Circuits Syst Video Technol 29(9):2286–2295
Chang X, Nie F, Yang Y, et a1 (2014) A convex formulation for semi-supervised multi-label feature selection. In: Proceedings of the 28th AAAI Conference on Artificial Intelligence, pp 1171–1177
Ding S, Jia H, Du M et al (2018) A semi-supervised approximate spectral clustering algorithm based on HMRF model. Inf Sci 429:215–228
Nie F, Wang X, Jordan M I, et al (2016) The constrained laplacian rank algorithm for graph-based clustering. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp 1969–1976
Strehl A, Ghosh J (2002) Cluster ensembles-a knowledge reuse framework for combining multiple partitions. J Mach Learn Res 3(3):583–617
William M (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66(336):846–850
Demiar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Acknowledgements
This work was supported in part by the National Social Science Fund of China (18ZDA153, 19BFX127, 19BYY125), in part by the Research Grants Council of the Hong Kong Special Administrative Region, China (UGC/FDS16/E09/22), and in part by the Natural Science Foundation of Guangdong Province (2021A1515011339).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhu, H., Xie, W., Mu, Y. et al. A new semi-supervised fuzzy K-means clustering method with dynamic adjustment and label discrimination. Neural Comput & Applic 36, 4709–4725 (2024). https://doi.org/10.1007/s00521-023-09115-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-023-09115-6