Abstract
Clustering for symbolic data type is a necessary process in many scientific disciplines, and the fuzzy c-means clustering for interval data type (IFCM) is one of the most popular algorithms. This paper presents an adaptive fuzzy c-means clustering algorithm for interval-valued data based on interval-dividing technique. This method gives a fuzzy partition and a prototype for each fuzzy cluster by optimizing an objective function. And the adaptive distance between the pattern and its cluster center varies with each algorithm iteration and may be either different from one cluster to another or the same for all clusters. The novel part of this approach is that it takes into account every point in both intervals when computing the distance between the cluster and its representative. Experiments are conducted on synthetic data sets and a real data set. To compare the comprehensive performance of the proposed method with other four existing methods, the corrected rand index, the value of objective function and iterations are introduced as the evaluation criterion. Clustering results demonstrate that the algorithm proposed in this paper has remarkable advantages.
References
de Almeida CWD, de Souza RMCR, Candeias ALB (2013) Fuzzy kohonen clustering networks for interval data. Neurocomputing 99:65–75
Graña M, Corchado E (2014) A survey of multiple classifier systems as hybrid systems. Inf Fus 16:3–17. doi:10.1016/j.inffus.2013.04.006
Guo J, Chen Y, Li W (2013) K-means clustering method for symbolic interval data. J Manag Sci China 3:21–28
Gong M, Liang Y, Shi J, Ma W, Ma J (2013) Fuzzy C-means clustering with local Information and Kernel metric for image segmentation. IEEE Trans Image Process 22:573–584
Jiang Z, Li T, Min W, Qi Z, Rao Y (2017) Fuzzy c-means clustering based on weights and gene expression programming. Pattern Recognit Lett 90:1–7. doi:10.1016/j.patrec.2017.02.015
Diday E (2017) From the statistics of data to the statistics of knowledge: symbolic data analysis. J Am Stat Assoc. doi:10.2307/30045255
Bodendorf F, Bryant PG, Critchley F, Diday E, Ihm P, Meulmann J. (n.d.). Studies in classification, data analysis, and knowledge organization. Springer, Berlin Heidelberg
Diday E (1989) Introduction to data analysis by symbolic approach. Inf Comput Sci 89:67–72
Dunn JC (1974) A Fuzzy relative of the ISODATA process and Its use in detecting compact well-separated clusters. J Cybern 3:32–57
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms
Gao X, Fan J, Xie W (1999) Fuzzy c-means clustering method for interval-valued data. J XiDian 90:1–7
de Carvalho FAT (2007) Fuzzy c-means clustering methods for symbolic interval data. Pattern Recognit Lett 28:423–437
de Carvalho FAT, Tenório CP, Junior NLC (2006) Partitional fuzzy clustering methods based on adaptive quadratic distances. Fuzzy Sets Syst 157:2833–2857
de Carvalho FAT, Barbosa GBN, Pimentel JT (2013) Partitioning fuzzy C-means clustering algorithms FOR interval-valued data based on city-block distances. In: 2013 Brazilian conference intelligent system, pp 113–118
de Carvalho FAT, Lechevallier Y (2009) Partitional clustering algorithms for symbolic interval data based on single adaptive distances. Pattern Recognit 42:1223–1236
De Souza RMCR, De Carvalho LV (2012) A novel adaptive fuzzy C-Means algorithm for interval data type. In: WCCI 2012 IEEE world congress on computational intelligence, Brisbane
Pimentel BA, De Souza RMCR (2014) A weighted multivariate Fuzzy C-Means method in interval-valued scientific production data. Expert Syst Appl 41:3223–3236
Alvim AFM (2012) A fuzzy weighted clustering method for symbolic interval data, pp 0–5
Hosni H, Montagna F (2016) Information processing and management of uncertainty in knowledge-based systems. Commun Comput Inf Sci 444:436–445
Tran L, Duckstein L (2002) Comparison of fuzzy numbers using a fuzzy distance measure. Fuzzy Sets Syst 130:331–341
Yue M (2011) A novel Fuzzy c-means clustering method for interval-valued data. Comput Eng Appl China 47:157–160
Acknowledgements
The authors would like to thank Prof. Bo Li for his proof reading and the reviewers for their valuable time. This research is supported by the National Science Foundation (91338115), National S/T Major Project (2015ZX03002006) and the 111 Project (B08038).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bao, C., Peng, H., He, D. et al. Adaptive fuzzy c-means clustering algorithm for interval data type based on interval-dividing technique. Pattern Anal Applic 21, 803–812 (2018). https://doi.org/10.1007/s10044-017-0663-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-017-0663-2