Skip to main content

Advertisement

Log in

3WC-D: A feature distribution-based adaptive three-way clustering method

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Clustering is a significant unsupervised learning method in the machine learning field, which can mine the distribution pattern and attribute of data. However, traditional clustering methods can not fully represent the attribution relationship between objects and classes. Therefore, a three-way clustering (3WC), which combines three-way decision (3WD) with clustering, has gradually received widespread attention from researchers in recent years. However, existing 3WC methods mostly use traditional clustering results or randomly assigned results as initial division results, which largely ignore the distribution relation of each object. Moreover, most of 3WC methods are soft clustering, i.e., there are some objects that will belong to more than one class, which makes clustering results more ambiguous. In light of this situation, we establish a feature distribution-based adaptive three-way clustering (3WC-D) method to address the above challenge. First, 3WC-D utilizes 3WD to characterize the distribution relation of objects for obtaining initial clustering results. Then, several representative classes are selected for further processing based on the interrelationship among classes in initial clustering results. Finally, the remaining objects are divided according to the relative relation between objects and classes, so as final clustering results can be obtained, and the effectiveness of the method is illustrated by comparing with several clustering methods on diverse datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Algorithm 1
Algorithm 2
Algorithm 3
Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

References

  1. Yao YY, decision Three-way (2009) An interpretation of rules in rough set theory, RSKT, LNCS 5589, pp 642–649

  2. Pawlak Z (1982) Rough sets. Int J Com Inform Sci 11(5):341–356

    MATH  Google Scholar 

  3. Wang MW, Liang DC, Xu ZS, Cao W (2022) Consensus reaching with the externality effect of social network for three-way group decisions. Ann Oper Res 315:707–745

    MathSciNet  MATH  Google Scholar 

  4. Wang JJ, Ma XL, Xu ZS, Zhan JM (2022) Regret theory-based three-way decision model in hesitant fuzzy environments and its application to medical decision. IEEE Tran Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2022.3176686

  5. Li JH, Huang CC, Qi JJ, Qian YH, Liu WQ (2017) Three-way cognitive concept learning via multi-granularity. Inform Sci 378:244–263

    MATH  Google Scholar 

  6. Savchenko AV (2019) Sequential three-way decisions in multi-category image recognition with deep features based on distance factor. Inform Sc 489:18–36

    MathSciNet  MATH  Google Scholar 

  7. Yao YY, Zhou B (2010) Naive bayesian rough sets, RSKT, LNCS 6401, pp 719–726

  8. Gao M, Zhang QH, Zhao F, Wang GY (2020) Mean-entropy-based shadowed sets: A novel three-way approximation of fuzzy sets. Int J Approx Reason 120:102–124

    MathSciNet  MATH  Google Scholar 

  9. Wang WJ, Zhan JM, Zhang C (2021) Three-way decisions based multi-attribute decision making with probabilistic dominance relations. Inform Sci 559:75–96

    MathSciNet  MATH  Google Scholar 

  10. Zhang Y, Yao JT (2020) Game theoretic approach to shadowed sets: A three-way tradeoff perspective. Inform Sci 507:540–552

    MATH  Google Scholar 

  11. Wang JJ, Ma XL, Dai JH, Zhan JM (2021) A novel three-way decision approach under hesitant fuzzy information. Inform Sci 578:482–506

    MathSciNet  Google Scholar 

  12. Huang XF, Zhan JM, Sun BZ (2022) A three-way decision method with pre-order relations. Inform Sci 595:231–256

    Google Scholar 

  13. Zhu JX, Ma XL, Zhan JM (2022) A regret theory-based three-way decision approach with three strategies. Inform Sci 595:89–118

    Google Scholar 

  14. Zhan JM, Jiang HB, Yao YY (2021) Three-way multiattribute decision-making based on outranking relations. IEEE Tran Fuzzy Syst 29(10):2844–2858

    Google Scholar 

  15. Campagner A, Cabitza F, Ciucci D (2020) The three-way-in and three-way-out framework to treat and exploit ambiguity in data. Int J Approx Reason 119:292–312

    MathSciNet  MATH  Google Scholar 

  16. Zhao XR, Miao DQ, Fujita H (2021) Variable-precision three-way concepts in L-contexts. Int J Approx Reason 130:107–125

    MathSciNet  MATH  Google Scholar 

  17. Zhan JM, Ye J, Ding WP, Liu PD (2022) A novel three-way decision model based on utility theory in incomplete fuzzy decision systems. IEEE Tran Fuzzy Syst 30(7):2210–2226

    Google Scholar 

  18. Zhao XR, Hu BQ (2020) Three-way decisions with decision-theoretic rough sets in multiset-valued information tables. Inform Sci 507:684–699

    MathSciNet  MATH  Google Scholar 

  19. Ye J, Zhan JM, Ding WP, Fujita H (2022) A novel three-way decision approach in decision information systems. Inform Sci 584:1–30

    Google Scholar 

  20. Yao YY (2021) The geometry of three-way decision. Appl Intell 51(9):6298–6325

    Google Scholar 

  21. Yang X, Zhang YY, Fujita H, Liu D, Li TR (2020) Local temporal-spatial multi-granularity learning for sequential three-way granular computing. Inform Sci 541:75–97

    MathSciNet  MATH  Google Scholar 

  22. Qiao JS, Hu BQ (2020) On decision evaluation functions in generalized three-way decision spaces. Inform Sci 507:733– 754

    MathSciNet  MATH  Google Scholar 

  23. Yang DD, Deng TQ, Fujita H (2020) Partial-overall dominance three-way decision models in interval-valued decision systems. Int J Approx Reason 126:308–325

    MathSciNet  MATH  Google Scholar 

  24. Mandal P, Ranadive AS (2018) Multi-granulation bipolar-valued fuzzy probabilistic rough sets and their corresponding three-way decisions over two universes. Soft Comput 22(24):8207– 8226

    MATH  Google Scholar 

  25. Deng J, Zhan JM, Wu WZ (2022) A ranking method with a preference relation based on the PROMETHEE method in incomplete multi-scale information systems. Inform Sci 608:1261–1282

    Google Scholar 

  26. Luo JF, Fujita H, Yao YY, Qin KY (2020) On modeling similarity and three-way decision under incomplete information in rough set theory. Knowl-Based Syst 191:105251

    Google Scholar 

  27. Xu Y, Tang JX, Wang XS (2020) Three sequential multi-class three-way decision models. Inform Sci 537:62–90

    MathSciNet  MATH  Google Scholar 

  28. Yang X, Chen Y, Fujita H, Liu D, Li TR (2022) Mixed data-driven sequential three-way decision via subjective-objective dynamic fusion. Knowl-Based Syst 237:107728

    Google Scholar 

  29. Chen J, Chen Y, He YC, Xu Y, Zhao S, Zhang YP (2022) A classified feature representation three-way decision model for sentiment analysis. Appl Intell 52:7995–8007

    Google Scholar 

  30. Ester M, Kriegel H–P, Sander J, Xu XW (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining 96(34):226–231

    Google Scholar 

  31. Kaufman L, Rousseeuw PJ (1990) Finding groups in data: An introduction to cluster analysis. Wiley

  32. Yu H, Su T, Zeng XH (2014) A three-way decisions clustering algorithm for incomplete data, RSKT, LNCS 8818, pp 765–776

  33. Yu H, Wang XC, Wang GY, Zeng XH (2020) An active three-way clustering method via low-rank matrices for multi-view data. Inform Sci 507:823–839

    Google Scholar 

  34. Afridi MK, Azam N, Yao JT, Alanazi E (2018) A three-way clustering approach for handling missing data using GTRS. Int J Approx Reason 98:11–24

    MathSciNet  MATH  Google Scholar 

  35. Wang XL, Gong J, Song Y, Hu JH (2022) Adaptively weighted three-way decision oversampling: A cluster imbalanced-ratio based approach. Appl Intell, https://doi.org/10.1007/s10489-022-03394-7https://doi.org/10.1007/s10489-022-03394-7

  36. Jiang CM, Li ZC, Yao JT (2022) A shadowed set-based three-way clustering ensemble approach. Int J Mach Learn Cybern 13(9):2545–2558

    Google Scholar 

  37. Zhang K (2019) A three-way c-means algorithm. Appl Soft Comput 82:105536

    Google Scholar 

  38. Shen QP, Zhang QH, Zhao F, Wang GY (2022) Adaptive three-way c-means clustering based on the cognition of distance stability. Cogn Comput 14(2):563–580

    Google Scholar 

  39. Yao YY (2010) Three-way decisions with probabilistic rough sets. Inform Sci 180(3):341–353

    MathSciNet  Google Scholar 

  40. Jia F, Liu PD (2019) A novel three-way decision model under multiple-criteria environment. Inform Sci 471:29–51

    MathSciNet  MATH  Google Scholar 

  41. Wang PX, Yao YY (2018) CE3: A three-way clustering method based on mathematical morphology. Knowl-Based Syst 155:54–65

    Google Scholar 

  42. Yu H, Chang ZH, Wang GY, Chen XF (2020) An efficient three-way clustering algorithm based on gravitational search. Int J Mach Learn Cyb 11(5):1003–1016

    Google Scholar 

  43. Yu H, Chen Y, Lingras P, Wang GY (2019) A three-way cluster ensemble approach for large-scale data. Int J Approx Reason 115:32–49

    MathSciNet  MATH  Google Scholar 

  44. Chen LF, Jiang QS, Wang SR (2008) A hierarchical method for determining the number of clusters. J Software 19(1):62–72. (in Chinese)

    MATH  Google Scholar 

  45. Zhou ZH (2021) Machine Learning Tsinghua University Press

  46. Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Amer Stat Asso 66(336):846–850

    Google Scholar 

  47. Wang YT, Chen LH, Mei JP (2014) Incremental fuzzy clustering with multiple medoids for large data. IEEE Tran Fuzzy Syst 22(6):1557–1568

    Google Scholar 

  48. Xu W, Liu X, Gong YH (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pp 267–273

  49. Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496

    Google Scholar 

  50. Du MJ, Ding SF, Jia HJ (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145

    Google Scholar 

  51. Zhong JJ, Tse PW, Wei YH (2017) An intelligent and improved density and distance-based clustering approach for industrial survey data classification. Expert Syst Appl 21–28:68

    Google Scholar 

  52. Sun BZ, Bai JC, Chu XL, Sun SL, Li YW, Li HT (2022) Interval prediction approach to crude oil price based on three-way clustering and decomposition ensemble learning. Appl Soft Comput 123:108933

    Google Scholar 

  53. Xin XW, Shi CL, Sun JB, Xue ZA, Song JH, Peng WM (2022) A novel attribute reduction method based on intuitionistic fuzzy three-way cognitive clustering. Appl Intell https://doi.org/10.1007/s10489-022-03496-2

  54. Ma WC, Tu XM, Luo B, Wang GH (2022) Semantic clustering based deduction learning for image recognition and classification. Pattern Recogn 124:108440

    Google Scholar 

Download references

Acknowledgements

The authors are extremely grateful to the editors and five anonymous referees for their valuable comments which helped us improve the presentation of this article.

The research was partially supported by grants from NNSFC (12161036; 61866011; 12271146) and a Discovery Grant from NSERC, Canada.

Author information

Authors and Affiliations

Authors

Contributions

Rongtao Zhang: Conceptualization, Methodology, Investigation, Writing-original draft. Xueling Ma: Methodology, Investigation, Writing-original draft. Jianming Zhan: Methodology, Writing-Reviewing and Editing. Yiyu Yao: Methodology, Writing-Reviewing and Editing.

Corresponding authors

Correspondence to Xueling Ma or Jianming Zhan.

Ethics declarations

Conflict of Interests

The authors declared that they have no conflicts of interest to this work.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, R., Ma, X., Zhan, J. et al. 3WC-D: A feature distribution-based adaptive three-way clustering method. Appl Intell 53, 15561–15579 (2023). https://doi.org/10.1007/s10489-022-04332-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-04332-3

Keywords