A new validity index adapted to fuzzy clustering algorithm

Li, Wei; Li, Kangshun; Guo, Luyan; Huang, Ying; Xue, Yu

doi:10.1007/s11042-017-5550-8

A new validity index adapted to fuzzy clustering algorithm

Published: 27 February 2018

Volume 77, pages 11339–11361, (2018)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Wei Li^1,2,
Kangshun Li¹,
Luyan Guo¹,
Ying Huang³ &
…
Yu Xue⁴

388 Accesses
4 Citations
Explore all metrics

Abstract

The fuzzy c-means clustering algorithm is the most common clustering algorithm. It solves the unrealistic nature of data by defining the membership matrix. As the fuzzy c-means clustering algorithm needs to set the number of classifications in advance, which is almost impossible in cases with no prior knowledge of the data set, some scholars put forward the concept of the validity index. Because the validity index is related to the distance relation between the membership matrix, the data point in the data set and the center of clustering, it is hoped that the feature weighting method can be used to evaluate all the characteristics of data in a data set to obtain the optimal classification number. Therefore, this paper presents an improved validity index for the comprehensive weight index, compactness index and separability index. This validity index first determines the relationship between the features of the data points and the data point itself. By defining the new compactness function and the separability function, the weight of each feature in the data set is obtained, and then the validity index is combined with the fuzzy c-means clustering algorithm to effectively determine the number of classes to be processed. The proposed algorithm is tested on two artificial data sets and real data sets; the experimental results demonstrated the advantages of this work in image processing and showed that it can effectively obtain reliable data classification results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fuzzy C-Means Clustering Validity Function Based on Multiple Clustering Performance Evaluation Components

Article 21 February 2022

An Improved Fuzzy C-Means Clustering Algorithm Based on Intuitionistic Fuzzy Sets

A novel cluster validity index for fuzzy C-means algorithm

Article 08 December 2016

References

Al-Ayyoub M, AlZu’bi S, Jararweh Y, Shehab MA, Gupta BB (2016) Accelerating 3D medical volume segmentation using GPUs. Multimed Tools Appl. https://doi.org/10.1007/s11042-016-4218-0
Google Scholar
Almomani A, Gupta BB, Wan TC, Altaher A, Manickam S (2013) Phishing dynamic evolving neural fuzzy framework for online detection zero-day phishing email. arXiv preprint arXiv:1302.0629
An W, Liang M (2013) Fuzzy support vector machine based on within-class scatter for classification problems with outliers or noises. Neurocomputing 110:101–110
Article Google Scholar
Anderson E (1935) The irises of the Gaspe peninsula. Bull Am Ir Soc 59:2–5
Google Scholar
Ayed AB, Halima MB, Alimi AM (2014) Survey on clustering methods: Towards fuzzy clustering for big data. In Soft Computing and Pattern Recognition (SoCPaR), 2014 6th International Conference of (pp. 331–336). IEEE
Bezdek JC (1973) Cluster validity with fuzzy sets[J]. J Cybern 3(3):58–73
Article MathSciNet MATH Google Scholar
Bezdek JC (1974) Numerical taxonomy with fuzzy sets[J]. J Math Biol 1(1):57–71
Article MathSciNet MATH Google Scholar
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New York
Carazo JM, Rivera FF, Zapata EL, Radermacher M, Frank J (1990) Fuzzy sets-based classification of electron microscopy images of biological macromolecules with an application to ribosomal particles. J Microsc 157(2):187–203
Article Google Scholar
Bezdek JC, Ehrlich R, Full W (1984) FCM: The fuzzy c-means clustering algorithm. Computers & Geosciences 10(2–3):191–203
Fu Z, Wu X, Guan C, Sun X, Ren K (2016) Toward efficient multi-keyword fuzzy search over encrypted outsourced data with accuracy improvement. IEEE Trans Inf Forensics Secur 11(12):2706–2716
Article Google Scholar
Fukuyama Y (1989) A new method of choosing the number of clusters for the fuzzy c-mean method. In Proc. 5th Fuzzy Syst. Symp., 1989 (pp. 247–250)
Gu B, Sheng VS (2017) A robust regularization path algorithm for ν-support vector classification. IEEE Trans Neural Netw Learn Syst 28(5):1241–1248
Article Google Scholar
Gu B, Sheng VS, Tay KY, Romano W, Li S (2015) Incremental support vector learning for ordinal regression. IEEE Trans Neural Netw Learn Syst 26(7):1403–1416
Article MathSciNet Google Scholar
Gupta B, Agrawal DP, Yamaguchi S (eds) (2016) Handbook of research on modern cryptographic solutions for computer and cyber security. IGI Global
Kira K, Rendell LA (1992) A practical approach to feature selection. In Proceedings of the ninth international workshop on Machine learning (pp. 249–256)
Kononenko I (1994) Estimating attributes: analysis and extensions of RELIEF. In European conference on machine learning (pp. 171–182). Springer, Berlin
Krishnapuram R, Keller JM (1993) A possibilistic approach to clustering. IEEE Trans Fuzzy Syst 1(2):98–110
Article Google Scholar
Kwon SH (1998) Cluster validity index for fuzzy clustering. Electron Lett 34(22):2176–2177
Article Google Scholar
Li Z, Yuan J, Zhang W (2009) Fuzzy C-mean algorithm with morphology similarity distance. In Fuzzy Systems and Knowledge Discovery, 2009. FSKD'09. Sixth International Conference on (Vol. 3, pp. 90–94). IEEE
Li P, Li J, Huang Z, Li T, Gao CZ, Yiu SM, Chen K (2017) Multi-key privacy-preserving deep learning in cloud computing. Futur Gener Comput Syst 74:76–85
Article Google Scholar
Li P, Li J, Huang Z, Gao CZ, Chen WB, Chen K (2017) Privacy-preserving outsourced classification in cloud computing. Clust Comput. https://doi.org/10.1007/s10586-017-0849-9
Google Scholar
Liu C, Zhou A, Zhang G (2013) Automatic clustering method based on evolutionary optimisation. IET Comput Vis 7(4):258–271
Article Google Scholar
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability 1(14):281–297
MathSciNet MATH Google Scholar
Pedrycz W (1985) Algorithms of fuzzy clustering with partial supervision. Pattern Recogn Lett 3(1):13–20
Article Google Scholar
Rhazali Y, Hadi Y, Mouloudi A (2016) A based-rule method to transform CIM to PIM into MDA. Int J Cloud Appl Comput (IJCAC) 6(2):11–24
Google Scholar
Shieh SL, Liao IE (2012) A new approach for data clustering and visualization using self-organizing maps. Expert Syst Appl 39(15):11924–11933
Article Google Scholar
Skabar A, Abdalgader K (2013) Clustering sentence-level text using a novel fuzzy relational clustering algorithm. IEEE Trans Knowl Data Eng 25(1):62–75
Article Google Scholar
Vaidya J, Shafiq B, Basu A, Hong Y (2013) Differentially private naive bayes classification. In Proceedings of the 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT)-Volume 01 (pp. 571–576). IEEE Computer Society
Wang, W., Yang, J., & Muntz, R. (1997). STING: A statistical information grid approach to spatial data mining. In VLDB (Vol. 97, pp. 186–195)
Wang F, Zhang Y, Rao Q, Li K, Zhang H (2017) Exploring mutual information-based sentimental analysis with kernel-based extreme learning machine for stock prediction. Soft Comput 21(12):3193–3205
Article Google Scholar
Wu J, Guo S, Li J, Zeng D (2016) Big data meet green challenges: Big data toward green applications. IEEE Syst J 10(3):888–900
Article Google Scholar
Wu J, Guo S, Li J, Zeng D (2016) Big data meet green challenges: Greening big data. IEEE Syst J 10(3):873–887
Article Google Scholar
Wu H, Kuang L, Wang F, Rao Q, Gong M, Li Y (2017) A multiobjective box-covering algorithm for fractal modularity on complex networks. Appl Soft Comput 61:294–313
Article Google Scholar
Xie XL, Beni G (1991) A validity measure for fuzzy clustering. IEEE Trans Pattern Anal Mach Intell 13(8):841–847
Article Google Scholar
Xue Y, Jiang J, Zhao B, Ma T (2017) A self-adaptive artificial bee colony algorithm based on global best for global optimization. Soft Computing:1–18
Yu Z, Chen H, You J, Wong HS, Liu J, Li L, Han G (2014) Double selection based semi-supervised clustering ensemble for tumor clustering from gene expression profiles. IEEE/ACM Trans Comput Biol Bioinformatics (TCBB) 11(4):727–740
Article Google Scholar
Zadeh LA (1965) Fuzzy sets. Inf Control 8(3):338–353
Article MATH Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grant Nos. 61573157 and 61561024, the Science and Technology Planning Project of Guangdong Province with the Grant No. 2017A010101037, the Science and Technology Research Project of Jiangxi Province under Grant Nos. GJJ160631 and GJJ160930, the Science Foundation of Jiangxi University of Science and Technology under the grant Nos. NSFJ2015-K13 and NSFJ2014-K11.

Author information

Authors and Affiliations

College of Mathematics and Informatics, South China Agricultural University, Guangzhou, Guangdong, China
Wei Li, Kangshun Li & Luyan Guo
School of Information Engineering, Jiangxi University of Science and Technology, Ganzhou, Jiangxi, China
Wei Li
Key Laboratory of Jiangxi Province for Numerical Simulation and Emulation Techniques, Institute of Mathematical and Computer Sciences, Gannan Normal University, Ganzhou, China
Ying Huang
School of Computer and Software, Nanjing University of Information Science and Technology, Nanjing, China
Yu Xue

Authors

Wei Li
View author publications
You can also search for this author in PubMed Google Scholar
Kangshun Li
View author publications
You can also search for this author in PubMed Google Scholar
Luyan Guo
View author publications
You can also search for this author in PubMed Google Scholar
Ying Huang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xue
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kangshun Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, W., Li, K., Guo, L. et al. A new validity index adapted to fuzzy clustering algorithm. Multimed Tools Appl 77, 11339–11361 (2018). https://doi.org/10.1007/s11042-017-5550-8

Download citation

Received: 28 April 2017
Revised: 13 December 2017
Accepted: 15 December 2017
Published: 27 February 2018
Issue Date: May 2018
DOI: https://doi.org/10.1007/s11042-017-5550-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A new validity index adapted to fuzzy clustering algorithm

Abstract

Access this article

Similar content being viewed by others

Fuzzy C-Means Clustering Validity Function Based on Multiple Clustering Performance Evaluation Components

An Improved Fuzzy C-Means Clustering Algorithm Based on Intuitionistic Fuzzy Sets

A novel cluster validity index for fuzzy C-means algorithm

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A new validity index adapted to fuzzy clustering algorithm

Abstract

Access this article

Similar content being viewed by others

Fuzzy C-Means Clustering Validity Function Based on Multiple Clustering Performance Evaluation Components

An Improved Fuzzy C-Means Clustering Algorithm Based on Intuitionistic Fuzzy Sets

A novel cluster validity index for fuzzy C-means algorithm

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation