Skip to main content

Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes

  • Conference paper
  • First Online:
Proceedings of Sixth International Conference on Soft Computing for Problem Solving

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 546))

Abstract

Clustering is an important unsupervised learning algorithm that groups records according to their similarity. However, since uncertainty has become an inherent of real world datasets, crisp clustering leads to inefficiency. Hence, introduction of uncertainty based models like the fuzzy set and the intuitionistic fuzzy set is necessary to compensate for the ambiguity in data. Huang, in his fuzzy k-modes algorithm, introduced the fuzzy component in clustering categorical data by modifying the existing k-means algorithm. This correspondence describes an intuitionistic fuzzy k-modes algorithm for clustering categorical data and establishes it to be more efficient than the fuzzy k-modes algorithm. Metrics like accuracy, DB index and Dunn index are used to compare the efficiency of the two algorithms. The experimental analysis section shows that the proposed algorithm is more efficient than the existing one. Several graphical and tabular representations have been provided for easy comparison of the results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Michalski, R.S.: Soybean (Small) Data Set. UCI Machine Learning Repository, Irvine. https://archive.ics.uci.edu/ml/datasets/Soybean+(Small)

  2. Forsyth, R.: Zoo Data Set. UCI Machine Learning Repository, Irvine. https://archive.ics.uci.edu/ml/datasets/Zoo

  3. Huang, Z., Ng, M.K.: A fuzzy k-modes algorithm for clustering categorical data. IEEE Trans. Fuzzy Syst. 7(4), 446–452 (1999)

    Article  Google Scholar 

  4. Pawlak, Z.: Rough sets. Int. J. Comput. Inform. Sci. 11(5), 341–356 (1982)

    Article  MATH  Google Scholar 

  5. Atanassov, K.T.: Intuitionistic fuzzy sets. Fuzzy Sets Syst. 20(1), 87–96 (1986)

    Article  MathSciNet  MATH  Google Scholar 

  6. Senthamilarasu, S., Hemalatha, M.: A genetic algorithm based intuitionistic fuzzification technique for attribute selection. Indian J. Sci. Technol. 6(4), 4336–4346 (2013)

    Google Scholar 

  7. Yager, R.R.: Some aspects of intuitionistic fuzzy sets. Fuzzy Optim. Decis. Mak. 8(1), 67–90 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  8. Sugeno, M.: Fuzzy measures and fuzzy integrals: a survey. In: Gupta, M.M., Saridis, G.N., Gaines, B.R. (eds.) Fuzzy Automata and Decision Processes, pp. 89–102 (1977)

    Google Scholar 

  9. Bock, H.H.: Clustering methods: a history of k-means algorithms. In: Selected Contributions in Data Analysis and Classification, pp. 161–172. Springer, Berlin (2007)

    Google Scholar 

  10. Chaira, T., Anand, S.: A novel intuitionistic fuzzy approach for tumor/hemorrhage detection in medical images. J. Sci. Ind. Res. 70(6), 427–434 (2011)

    Google Scholar 

  11. Zadeh, L.A.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)

    Google Scholar 

  12. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 2, 224–227 (1979)

    Article  Google Scholar 

  13. Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3, 32–57 (1973)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Darshan Mehta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Mehta, D., Tripathy, B.K. (2017). Clustering of Categorical Data Using Intuitionistic Fuzzy k-modes. In: Deep, K., et al. Proceedings of Sixth International Conference on Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 546. Springer, Singapore. https://doi.org/10.1007/978-981-10-3322-3_24

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3322-3_24

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3321-6

  • Online ISBN: 978-981-10-3322-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics