Abstract
For most FCM-based fuzzy clustering algorithms, several problems, such as noise, non-spherical clusters, and size-imbalanced clusters, are difficult to solve. Different fuzzy clustering algorithms are developed to deal with these problems from different perspectives. However, no comprehensive viewpoint to generalize these problems has been put forward. In this paper, we reveal the inherent deficiency of FCM and propose a new fuzzy clustering method called Gaussian Collaborative Fuzzy C-means (GCFCM) to solve these problems. In GCFCM, Gaussian mixture model (GMM) and collaborative technology are adopted to enhance the ability of recognizing the intrinsic structure of clusters. Experimental results confirm that GCFCM is effective in dealing with noise, non-spherical clusters, size-imbalanced clusters, and those also show excellent performance in dealing with real-world data sets.
Similar content being viewed by others
References
Jain, A.K.: Data Clustering: 50 Years Beyond K-Means. Springer, Berlin (2008)
Deng, Z., Jiang, Y., Chung, F.L., Ishibuchi, H., Choi, K.S., Wang, S.: Transfer prototype-based fuzzy clustering. IEEE Trans. Fuzzy Syst. 24(5), 1210–1232 (2014)
Macqueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Reynolds, D.: Gaussian Mixture Models. Springer, New York (2009)
Huang, J.Z., Ng, M.K., Rong, H., Li, Z.: Automated variable weighting in k-means type clustering. IEEE Trans. Pattern Anal. Mach. Intell. 27(5), 657–668 (2005)
Li, R.P., Mukaidono, M.: Maximum-entropy approach to fuzzy clustering. In: Proceedings of the 4th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE/IFES’95), vol. 4, pp. 2227–2232 (1995)
Bradley, P.S., Mangasarian, O.L.: k-plane clustering. J. Glob. Optim. 16(1), 23–32 (2000)
Dunn, J.C.: A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters. J. Cybern 3(3), 32–57 (1974)
Liu, J., Pham, T.D.: A spatially constrained fuzzy hyper-prototype clustering algorithm. Pattern Recogn. 45(4), 1759–1771 (2012)
Ester, M., Kriegel, H., Sander, J., Xiaowei, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of 2nd International Conference Knowledge in Discovery Data Mining, pp. 226–231 (1996)
Ester, M., Kriegel, H.P., Xu, X.: Density-based clustering in spatial databases: the algorithm gdbscan and its applications. Data Mining Knowl. Discov. 2(2), 169–194 (1998)
Hinneburg, A., Keim, D.A.: An efficient approach to clustering in large multimedia databases with noise. In: International Conference on Knowledge Discovery and Data Mining, pp. 58–65 (1998)
Ankerst, M., Breunig, M.M., Kriegel, H., Sander, J.: Optics: ordering points to identify the clustering structure. Int. Conf. Manag. Data 28(2), 49–60 (1999)
Pei, T., Jasra, A., Hand, D.J., Zhu, A.X., Zhou, C.: Decode: a new method for discovering clusters of different densities in spatial data. Data Mining Knowl Discov 18(3), 337–369 (2009)
Duan, L., Xu, L., Guo, F., Lee, J., Yan, B.: A local-density based spatial clustering algorithm with noise. Inf. Syst. 32(7), 978–986 (2007)
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell 22(8), 888–905 (2000)
Hartuv, E., Shamir, R.: A clustering algorithm based on graph connectivity. Inf. Process. Lett. 76(4), 175–181 (2000)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. Proc. Nips 14, 849–856 (2002)
Qian, P., Chung, F.L., Wang, S., Deng, Z.: Fast graph-based relaxed clustering for large data sets using minimal enclosing ball. IEEE Trans. Syst. Man Cybern. Part B 42(3), 672–687 (2012)
Dhillon, I.S., Guan, Y., Kulis, B.: Kernel k-means: spectral clustering and normalized cuts. In: Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 551–556 (2004)
Benhur, A.: Support vector clustering. J. Mach. Learn. Res. 2(2), 125–137 (2002)
Lin, Y.T., Yang, S.B.: A genetic approach to the automatic clustering problem. Pattern Recogn. 34(2), 415–424 (2001)
Kohonen, T.: The self-organizing map. Proc. IEEE 78(9), 1464–1480 (1990). https://doi.org/10.1109/5.58325
Zadeh, L.A.: Is there a need for fuzzy logic? In: Fuzzy Information Processing Society, 2008. Nafips 2008 Meeting of the North American, pp. 1–3 (2008)
Bezdek, J.C., Ehrlich, R., Full, W.: Fcm: the fuzzy c-means clustering algorithm. Comput. Geosci. 10(2), 191–203 (1984)
Krishnapuram, R., Keller, J.M.: A possibilistic approach to clustering. IEEE Trans. Fuzzy Syst. 1(2), 98–110 (2002)
Zarinbal, M., Fazel Zarandi, M.H., Turksen, I.B.: Relative entropy fuzzy c-means clustering. Inf. Sci. 260(1), 74–97 (2014)
Dave, R.N.: Characterization and detection of noise in clustering. Pattern Recogn. 12(11), 657–664 (1992)
Miyamoto, S., Umayahara, K.: Fuzzy clustering by quadratic regularization. In: IEEE International Conference on Fuzzy Systems Proceedings, 1998. IEEE World Congress on Computational Intelligence, vol. 2, pp. 1394–1399 (2002)
Pham, D.L.: Spatial models for fuzzy clustering. Comput. Vis. Image Understand. 84(2), 285–297 (2001)
Gao, Y., Wang, D., Pan, J., Wang, Z., Chen, B.: A novel fuzzy c-means clustering algorithm using adaptive norm. Int. J. Fuzzy Syst. 21(8), 2632–2649 (2019)
Gan, G., Wu, J., Yang, Z.: A fuzzy subspace algorithm for clustering high dimensional data. In: International Conference on Advanced Data Mining and Applications, pp. 271–278 (2006)
Agrawal, R., Gehrke, J., Gunopulos, D., Raghavan, P.: Automatic subspace clustering of high dimensional data for data mining applications. ACM (1998)
Jing, L., Ng, M.K., Huang, J.Z.: An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE Trans. Knowl. Data Eng. 19(8), 1026–1041 (2007)
Deng, Z., Choi, K.S., Chung, F.L., Wang, S.: Enhanced soft subspace clustering integrating within-cluster and between-cluster information. Pattern Recogn. 43(3), 767–781 (2010)
Zhou, J., Chen, L., Chen, C.L.P., Zhang, Y., Li, H.X.: Fuzzy clustering with the entropy of attribute weights. Neurocomputing 198(C), 125–134 (2016)
Hwang, C., Rhee, C.H.: Uncertain fuzzy clustering: interval type-2 fuzzy approach to c-means. IEEE Trans. Fuzzy Syst. 15(1), 107–120 (2007)
Min, J.H., Shim, E.A., Rhee, C.H.: An interval type-2 fuzzy pcm algorithm for pattern recognition. In: IEEE International Conference on Fuzzy Systems, 2009. Fuzz-Ieee, pp. 480–483 (2009)
Ji, Z., Xia, Y., Sun, Q., Cao, G.: Interval-valued possibilistic fuzzy c-means clustering algorithm. Fuzzy Sets Syst. 253(3), 138–156 (2014)
Huang, Y.P., Singh, P., Kuo, W.L., Chu, H.C.: A type-2 fuzzy clustering and quantum optimization approach for crops image segmentation. Int. J. Fuzzy Syst. 2, 1 (2021)
Pedrycz, W.: Collaborative fuzzy clustering. Pattern Recogn. Lett. 23(14), 1675–1686 (2002)
Zarinbal, M., Zarandi, M.H.F., Turksen, I.B.: Relative entropy collaborative fuzzy clustering method. Pattern Recogn. 48(3), 933–940 (2015)
Pedrycz, W., Rai, P.: Collaborative clustering with the use of fuzzy c-means and its quantification. Fuzzy Sets Syst. 159(18), 2399–2427 (2008)
Mitra, S., Banka, H., Pedrycz, W.: Rough-Fuzzy Collaborative Clustering. IEEE Press, New York (2006)
Oliveira, J.V.D., Pedrycz, W.: Advances in Fuzzy Clustering and Its Applications. Wiley, New York (2007)
Cao, Y., Wu, J.: Projective art for clustering data sets in high dimensional spaces. Neural Netw. 15(1), 105–120 (2002)
Zhao, Y.P., Chen, L., Chen, C.L.P.: Fuzzy clustering in cascaded feature space. Int. J. Fuzzy Syst. 21(7), 2155–2167 (2019)
Mendel, J.M.: Uncertain rule-based fuzzy logic systems: introduction and new directions. Ptr Upper Saddle River Nj 133, 02 (2001)
Mendel, J.M., John, R.I.B.: Type-2 fuzzy sets made simple. IEEE Trans. Fuzzy Syst. 10(2), 117–127 (2002)
Mendel, J.M.: Computing derivatives in interval type-2 fuzzy logic systems. IEEE Trans. Fuzzy Syst. 12(1), 84–98 (2004)
Mendel, J.M., John, R.I.: A fundamental decomposition of type-2 fuzzy sets. In: Ifsa World Congress and Nafips International Conference, vol. 4, 2001. Joint, pp. 1896–1901 (2001)
Corless, R.M., Gonnet, G.H., Hare, D.E.G., Jeffrey, D.J., Knuth, D.E.: On the Lambertw function. Adv. Comput. Math. 5(1), 329–359 (1996)
Pal, N.R., Pal, K., Keller, J.M., Bezdek, J.C.: A possibilistic fuzzy c-means clustering algorithm. IEEE Trans. Fuzzy Syst. 13(4), 517–530 (2005)
Kwon, S.H.: Cluster validity index for fuzzy clustering. Electron. Lett. 34(22), 2176–2177 (1998)
Liu, J., Mohammed, J., Carter, J., Ranka, S., Kahveci, T., Baudis, M.: Distance-based clustering of cgh data. Bioinformatics 22(16), 1971–1978 (2006)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gao, Y., Wang, Z., Li, H. et al. Gaussian Collaborative Fuzzy C-Means Clustering. Int. J. Fuzzy Syst. 23, 2218–2234 (2021). https://doi.org/10.1007/s40815-021-01090-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40815-021-01090-1