Abstract
In this paper, a novel interval possibilistic fuzzy c-means (IPFCM) clustering method is proposed for clustering symbolic interval data. Clustering algorithms are important methods that required in pattern recognition, data mining and text mining, etc. Most of cluster algorithms focus on single-valued data. The interval fuzzy c-means (IFCM) clustering method is first proposed for symbolic interval data. However, the IFCM may not have good performance when facing noisy date or data with outliers. Hence, the advantages of the proposed IPFCM clustering algorithm in this study mainly overcome the noisy date or data with outliers in symbolic interval data. In the proposed approach, two different distances; namely, Euclidean and Hausdorff distance measures are all individually considered. That is, based on Lagrange multipliers nonlinear programming method, the proposed IPFCM clustering algorithm under Euclidean and Hausdorff distance measure was individually derived for symbolic interval data. From our experimental results, the proposed IPFCM clustering algorithm indeed has better performance than the IFCM and the interval fuzzy possibilistic c-means (IFPCM) clustering algorithm under different distance measures for the noisy date or data with outlier problem. Besides, the proposed IPFCM clustering algorithm is also implemented on windows smart phone platform and shown better performance as expected. Finally, a real data set is added for testing.
Similar content being viewed by others
References
Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual statistics and Data Mining. Wiley, Hoboken (2006)
Mizuta, M.: Meta-analysis and SDA. In: 6th International Workshop Symbolic Data Analysis, Ljubljana, Slovenia (2017)
Diday, E., Noirhomme-Fraiture, M.: Symbolic Data Analysis and the SODAS Software. Wiley, Hoboken (2008)
Domingues, M.A.O., de Souza, R.M.C.R., Cysneiros, F.J.A.: A robust method for linear regression of symbolic interval data. Pattern Recogn. Lett. 31, 1991–1996 (2010)
Oliveira, M.R., Vilela, M., Pacheco, A., Valadas, R., Salvador, P.: Extracting information from interval data using symbolic principal component analysis. Austrian J. Stat. 46, 79–87 (2017)
From wikipedia for cluster analysis: http://en.wikipedia.org/wiki/Cluster_analysis
Cebeci, Z., Yildiz, F.: Comparison of k-means and fuzzy c-means algorithms on different cluster structures. J. Agric. Inform. 6, 13–23 (2015)
Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2, 1–8 (1980)
Pal, N.R., Pal, K., Keller, J.M., Bezdek, J.C.: A possibilistic fuzzy C-means clustering algorithm. IEEE Trans. Fuzzy Syst. 13, 517–530 (2005)
Maji, P., Pal, S.K.: Rough set based generalized fuzzy c-means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. B Cybern. 37, 1529–1540 (2007)
Gowda, K.C., Diday, E.: Symbolic clustering using a new dissimilarity measure. Pattern Recogn. 24(6), 567–578 (1991)
Gowda, K.C., Ravi, T.R.: Clustering of symbolic objects using gravitational approach. IEEE Trans. Syst. Man Cybern. 29(6), 888–894 (1999)
El-Sonbaty, Y., Ismail, M.A.: Fuzzy clustering for symbolic data. IEEE Trans Fuzzy Syst. 6(2), 195–204 (1998)
Ralambondrainy, H.: A conceptual version of the k-means algorithm. Pattern Recogn. Lett. 16, 1147–1157 (1995)
Beg, I., Rashid, T.: Fuzzy distance measure and fuzzy clustering algorithm. J. Interdiscip. Math. 18, 471–492 (2015)
de Carvalho, F.D.A.T.: Fuzzy C-means clustering methods for symbolic interval data. Pattern Recogn. Lett. 28, 423–437 (2007)
Evsukoff, A.G., Branco, A.C.S., Galichet, S.: Intelligent data analysis and model interpretation with spectral analysis fuzzy symbolic modeling. Int. J. Approx. Reason. 52, 728–750 (2011)
Chuang, C.-C., Jeng, J.-T., Chang, S.-C.: Hausdorff distance measure based interval fuzzy possibilistic c-means clustering algorithm. Int. J. Fuzzy Syst. 15(4), 471–479 (2013)
Jeng, J.-T., Chuang, C.-C., Chang, S.-C., Shih, N.-C.: Intelligent computation on symbolic data analysis: the proposed clustering technologies for the real symbolic data. In: Proceedings of 2015 international automatic control conference (2015)
Jeng, J.-T., Chang, S.-C., Chuang, C. C.: Interval fuzzy possibilistic c-means clustering algorithm on smart phone implement. In: 2014 Proceedings of the SICE Annual Conference (SICE), pp. 78–82 (2014)
Jeng, J.-T., Chuang, C.-C., Tao, C.W.: Interval competitive agglomeration clustering algorithm. Expert Syst. Appl. 37, 6567–6578 (2010)
Jeng, J.-T., Chuang, C.-C., Tseng, C.-C., Juan, C.-J.: Robust interval competitive agglomeration clustering algorithm with outliers. Int. J. Fuzzy Syst. 12(3), 227–236 (2010)
Su, S.-F., Chuang, C.-C., Tao, C.W., Jeng, J.-T., Hsiao, C.-C.: Radial basis function networks with linear interval regression weights for symbolic interval data. IEEE Trans. Syst. Man Cybern. B Cybern. 42(1), 69–80 (2012)
Bhattacharyya, S., De, S., Pan, I., Dutta, P.: Intelligent Multidimensional Data Clustering and Analysis. IGI Global, Hershey (2017)
Velmurugan, T., Santhanam, T.: Computational complexity between k-means and k-medoids clustering algorithms for normal and uniform distributions of data points. J. Comput. Sci. 6(3), 363–368 (2010)
De Carvalho, F.A.T., Souza, R.M.C.R., Chavent, M., Lechevallier, Y.: Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recogn. Lett. 27, 167–179 (2006)
Gowda, K.C., Ravi, T.R.: Divisive clustering of symbolic objects using the concepts of both similarity and dissimilarity. Pattern Recogn. 28(8), 1277–1282 (1995)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Chuang, C.-C., Li, C.-W., Zhang, C.-X., Lin, C.-C., Jeng, J.-T., Hsiao, C.-C.: A novel approach for linear interval regression models with fuzzy weights. In: 2016 Joint 8th International Conference on Soft Computing and Intelligent Systems and 17th International Symposium on Advanced Intelligent Systems, pp. 806–810 (2016)
Acknowledgements
This work was supported by National Science Council under Grant NSC 101-2221-E-150-061-MY3.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jeng, JT., Chen, CM., Chang, SC. et al. IPFCM Clustering Algorithm Under Euclidean and Hausdorff Distance Measure for Symbolic Interval Data. Int. J. Fuzzy Syst. 21, 2102–2119 (2019). https://doi.org/10.1007/s40815-019-00707-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40815-019-00707-w