Skip to main content
Log in

IPFCM Clustering Algorithm Under Euclidean and Hausdorff Distance Measure for Symbolic Interval Data

  • Published:
International Journal of Fuzzy Systems Aims and scope Submit manuscript

Abstract

In this paper, a novel interval possibilistic fuzzy c-means (IPFCM) clustering method is proposed for clustering symbolic interval data. Clustering algorithms are important methods that required in pattern recognition, data mining and text mining, etc. Most of cluster algorithms focus on single-valued data. The interval fuzzy c-means (IFCM) clustering method is first proposed for symbolic interval data. However, the IFCM may not have good performance when facing noisy date or data with outliers. Hence, the advantages of the proposed IPFCM clustering algorithm in this study mainly overcome the noisy date or data with outliers in symbolic interval data. In the proposed approach, two different distances; namely, Euclidean and Hausdorff distance measures are all individually considered. That is, based on Lagrange multipliers nonlinear programming method, the proposed IPFCM clustering algorithm under Euclidean and Hausdorff distance measure was individually derived for symbolic interval data. From our experimental results, the proposed IPFCM clustering algorithm indeed has better performance than the IFCM and the interval fuzzy possibilistic c-means (IFPCM) clustering algorithm under different distance measures for the noisy date or data with outlier problem. Besides, the proposed IPFCM clustering algorithm is also implemented on windows smart phone platform and shown better performance as expected. Finally, a real data set is added for testing.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual statistics and Data Mining. Wiley, Hoboken (2006)

    Book  MATH  Google Scholar 

  2. Mizuta, M.: Meta-analysis and SDA. In: 6th International Workshop Symbolic Data Analysis, Ljubljana, Slovenia (2017)

  3. Diday, E., Noirhomme-Fraiture, M.: Symbolic Data Analysis and the SODAS Software. Wiley, Hoboken (2008)

    MATH  Google Scholar 

  4. Domingues, M.A.O., de Souza, R.M.C.R., Cysneiros, F.J.A.: A robust method for linear regression of symbolic interval data. Pattern Recogn. Lett. 31, 1991–1996 (2010)

    Article  Google Scholar 

  5. Oliveira, M.R., Vilela, M., Pacheco, A., Valadas, R., Salvador, P.: Extracting information from interval data using symbolic principal component analysis. Austrian J. Stat. 46, 79–87 (2017)

    Article  Google Scholar 

  6. From wikipedia for cluster analysis: http://en.wikipedia.org/wiki/Cluster_analysis

  7. Cebeci, Z., Yildiz, F.: Comparison of k-means and fuzzy c-means algorithms on different cluster structures. J. Agric. Inform. 6, 13–23 (2015)

    Google Scholar 

  8. Bezdek, J.C.: A convergence theorem for the fuzzy ISODATA clustering algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 2, 1–8 (1980)

    Article  MathSciNet  MATH  Google Scholar 

  9. Pal, N.R., Pal, K., Keller, J.M., Bezdek, J.C.: A possibilistic fuzzy C-means clustering algorithm. IEEE Trans. Fuzzy Syst. 13, 517–530 (2005)

    Article  Google Scholar 

  10. Maji, P., Pal, S.K.: Rough set based generalized fuzzy c-means algorithm and quantitative indices. IEEE Trans. Syst. Man Cybern. B Cybern. 37, 1529–1540 (2007)

    Article  Google Scholar 

  11. Gowda, K.C., Diday, E.: Symbolic clustering using a new dissimilarity measure. Pattern Recogn. 24(6), 567–578 (1991)

    Article  Google Scholar 

  12. Gowda, K.C., Ravi, T.R.: Clustering of symbolic objects using gravitational approach. IEEE Trans. Syst. Man Cybern. 29(6), 888–894 (1999)

    Article  Google Scholar 

  13. El-Sonbaty, Y., Ismail, M.A.: Fuzzy clustering for symbolic data. IEEE Trans Fuzzy Syst. 6(2), 195–204 (1998)

    Article  Google Scholar 

  14. Ralambondrainy, H.: A conceptual version of the k-means algorithm. Pattern Recogn. Lett. 16, 1147–1157 (1995)

    Article  Google Scholar 

  15. Beg, I., Rashid, T.: Fuzzy distance measure and fuzzy clustering algorithm. J. Interdiscip. Math. 18, 471–492 (2015)

    Article  Google Scholar 

  16. de Carvalho, F.D.A.T.: Fuzzy C-means clustering methods for symbolic interval data. Pattern Recogn. Lett. 28, 423–437 (2007)

    Article  Google Scholar 

  17. Evsukoff, A.G., Branco, A.C.S., Galichet, S.: Intelligent data analysis and model interpretation with spectral analysis fuzzy symbolic modeling. Int. J. Approx. Reason. 52, 728–750 (2011)

    Article  Google Scholar 

  18. Chuang, C.-C., Jeng, J.-T., Chang, S.-C.: Hausdorff distance measure based interval fuzzy possibilistic c-means clustering algorithm. Int. J. Fuzzy Syst. 15(4), 471–479 (2013)

    MathSciNet  Google Scholar 

  19. Jeng, J.-T., Chuang, C.-C., Chang, S.-C., Shih, N.-C.: Intelligent computation on symbolic data analysis: the proposed clustering technologies for the real symbolic data. In: Proceedings of 2015 international automatic control conference (2015)

  20. Jeng, J.-T., Chang, S.-C., Chuang, C. C.: Interval fuzzy possibilistic c-means clustering algorithm on smart phone implement. In: 2014 Proceedings of the SICE Annual Conference (SICE), pp. 78–82 (2014)

  21. Jeng, J.-T., Chuang, C.-C., Tao, C.W.: Interval competitive agglomeration clustering algorithm. Expert Syst. Appl. 37, 6567–6578 (2010)

    Article  Google Scholar 

  22. Jeng, J.-T., Chuang, C.-C., Tseng, C.-C., Juan, C.-J.: Robust interval competitive agglomeration clustering algorithm with outliers. Int. J. Fuzzy Syst. 12(3), 227–236 (2010)

    Google Scholar 

  23. Su, S.-F., Chuang, C.-C., Tao, C.W., Jeng, J.-T., Hsiao, C.-C.: Radial basis function networks with linear interval regression weights for symbolic interval data. IEEE Trans. Syst. Man Cybern. B Cybern. 42(1), 69–80 (2012)

    Article  Google Scholar 

  24. Bhattacharyya, S., De, S., Pan, I., Dutta, P.: Intelligent Multidimensional Data Clustering and Analysis. IGI Global, Hershey (2017)

    Book  Google Scholar 

  25. Velmurugan, T., Santhanam, T.: Computational complexity between k-means and k-medoids clustering algorithms for normal and uniform distributions of data points. J. Comput. Sci. 6(3), 363–368 (2010)

    Article  Google Scholar 

  26. De Carvalho, F.A.T., Souza, R.M.C.R., Chavent, M., Lechevallier, Y.: Adaptive Hausdorff distances and dynamic clustering of symbolic data. Pattern Recogn. Lett. 27, 167–179 (2006)

    Article  Google Scholar 

  27. Gowda, K.C., Ravi, T.R.: Divisive clustering of symbolic objects using the concepts of both similarity and dissimilarity. Pattern Recogn. 28(8), 1277–1282 (1995)

    Article  Google Scholar 

  28. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)

    Article  Google Scholar 

  29. Chuang, C.-C., Li, C.-W., Zhang, C.-X., Lin, C.-C., Jeng, J.-T., Hsiao, C.-C.: A novel approach for linear interval regression models with fuzzy weights. In: 2016 Joint 8th International Conference on Soft Computing and Intelligent Systems and 17th International Symposium on Advanced Intelligent Systems, pp. 806–810 (2016)

Download references

Acknowledgements

This work was supported by National Science Council under Grant NSC 101-2221-E-150-061-MY3.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chen-Chia Chuang.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jeng, JT., Chen, CM., Chang, SC. et al. IPFCM Clustering Algorithm Under Euclidean and Hausdorff Distance Measure for Symbolic Interval Data. Int. J. Fuzzy Syst. 21, 2102–2119 (2019). https://doi.org/10.1007/s40815-019-00707-w

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s40815-019-00707-w

Keywords

Navigation