Abstract
Clustering is a significant unsupervised learning method in the machine learning field, which can mine the distribution pattern and attribute of data. However, traditional clustering methods can not fully represent the attribution relationship between objects and classes. Therefore, a three-way clustering (3WC), which combines three-way decision (3WD) with clustering, has gradually received widespread attention from researchers in recent years. However, existing 3WC methods mostly use traditional clustering results or randomly assigned results as initial division results, which largely ignore the distribution relation of each object. Moreover, most of 3WC methods are soft clustering, i.e., there are some objects that will belong to more than one class, which makes clustering results more ambiguous. In light of this situation, we establish a feature distribution-based adaptive three-way clustering (3WC-D) method to address the above challenge. First, 3WC-D utilizes 3WD to characterize the distribution relation of objects for obtaining initial clustering results. Then, several representative classes are selected for further processing based on the interrelationship among classes in initial clustering results. Finally, the remaining objects are divided according to the relative relation between objects and classes, so as final clustering results can be obtained, and the effectiveness of the method is illustrated by comparing with several clustering methods on diverse datasets.






Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.References
Yao YY, decision Three-way (2009) An interpretation of rules in rough set theory, RSKT, LNCS 5589, pp 642–649
Pawlak Z (1982) Rough sets. Int J Com Inform Sci 11(5):341–356
Wang MW, Liang DC, Xu ZS, Cao W (2022) Consensus reaching with the externality effect of social network for three-way group decisions. Ann Oper Res 315:707–745
Wang JJ, Ma XL, Xu ZS, Zhan JM (2022) Regret theory-based three-way decision model in hesitant fuzzy environments and its application to medical decision. IEEE Tran Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2022.3176686
Li JH, Huang CC, Qi JJ, Qian YH, Liu WQ (2017) Three-way cognitive concept learning via multi-granularity. Inform Sci 378:244–263
Savchenko AV (2019) Sequential three-way decisions in multi-category image recognition with deep features based on distance factor. Inform Sc 489:18–36
Yao YY, Zhou B (2010) Naive bayesian rough sets, RSKT, LNCS 6401, pp 719–726
Gao M, Zhang QH, Zhao F, Wang GY (2020) Mean-entropy-based shadowed sets: A novel three-way approximation of fuzzy sets. Int J Approx Reason 120:102–124
Wang WJ, Zhan JM, Zhang C (2021) Three-way decisions based multi-attribute decision making with probabilistic dominance relations. Inform Sci 559:75–96
Zhang Y, Yao JT (2020) Game theoretic approach to shadowed sets: A three-way tradeoff perspective. Inform Sci 507:540–552
Wang JJ, Ma XL, Dai JH, Zhan JM (2021) A novel three-way decision approach under hesitant fuzzy information. Inform Sci 578:482–506
Huang XF, Zhan JM, Sun BZ (2022) A three-way decision method with pre-order relations. Inform Sci 595:231–256
Zhu JX, Ma XL, Zhan JM (2022) A regret theory-based three-way decision approach with three strategies. Inform Sci 595:89–118
Zhan JM, Jiang HB, Yao YY (2021) Three-way multiattribute decision-making based on outranking relations. IEEE Tran Fuzzy Syst 29(10):2844–2858
Campagner A, Cabitza F, Ciucci D (2020) The three-way-in and three-way-out framework to treat and exploit ambiguity in data. Int J Approx Reason 119:292–312
Zhao XR, Miao DQ, Fujita H (2021) Variable-precision three-way concepts in L-contexts. Int J Approx Reason 130:107–125
Zhan JM, Ye J, Ding WP, Liu PD (2022) A novel three-way decision model based on utility theory in incomplete fuzzy decision systems. IEEE Tran Fuzzy Syst 30(7):2210–2226
Zhao XR, Hu BQ (2020) Three-way decisions with decision-theoretic rough sets in multiset-valued information tables. Inform Sci 507:684–699
Ye J, Zhan JM, Ding WP, Fujita H (2022) A novel three-way decision approach in decision information systems. Inform Sci 584:1–30
Yao YY (2021) The geometry of three-way decision. Appl Intell 51(9):6298–6325
Yang X, Zhang YY, Fujita H, Liu D, Li TR (2020) Local temporal-spatial multi-granularity learning for sequential three-way granular computing. Inform Sci 541:75–97
Qiao JS, Hu BQ (2020) On decision evaluation functions in generalized three-way decision spaces. Inform Sci 507:733– 754
Yang DD, Deng TQ, Fujita H (2020) Partial-overall dominance three-way decision models in interval-valued decision systems. Int J Approx Reason 126:308–325
Mandal P, Ranadive AS (2018) Multi-granulation bipolar-valued fuzzy probabilistic rough sets and their corresponding three-way decisions over two universes. Soft Comput 22(24):8207– 8226
Deng J, Zhan JM, Wu WZ (2022) A ranking method with a preference relation based on the PROMETHEE method in incomplete multi-scale information systems. Inform Sci 608:1261–1282
Luo JF, Fujita H, Yao YY, Qin KY (2020) On modeling similarity and three-way decision under incomplete information in rough set theory. Knowl-Based Syst 191:105251
Xu Y, Tang JX, Wang XS (2020) Three sequential multi-class three-way decision models. Inform Sci 537:62–90
Yang X, Chen Y, Fujita H, Liu D, Li TR (2022) Mixed data-driven sequential three-way decision via subjective-objective dynamic fusion. Knowl-Based Syst 237:107728
Chen J, Chen Y, He YC, Xu Y, Zhao S, Zhang YP (2022) A classified feature representation three-way decision model for sentiment analysis. Appl Intell 52:7995–8007
Ester M, Kriegel H–P, Sander J, Xu XW (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining 96(34):226–231
Kaufman L, Rousseeuw PJ (1990) Finding groups in data: An introduction to cluster analysis. Wiley
Yu H, Su T, Zeng XH (2014) A three-way decisions clustering algorithm for incomplete data, RSKT, LNCS 8818, pp 765–776
Yu H, Wang XC, Wang GY, Zeng XH (2020) An active three-way clustering method via low-rank matrices for multi-view data. Inform Sci 507:823–839
Afridi MK, Azam N, Yao JT, Alanazi E (2018) A three-way clustering approach for handling missing data using GTRS. Int J Approx Reason 98:11–24
Wang XL, Gong J, Song Y, Hu JH (2022) Adaptively weighted three-way decision oversampling: A cluster imbalanced-ratio based approach. Appl Intell, https://doi.org/10.1007/s10489-022-03394-7https://doi.org/10.1007/s10489-022-03394-7
Jiang CM, Li ZC, Yao JT (2022) A shadowed set-based three-way clustering ensemble approach. Int J Mach Learn Cybern 13(9):2545–2558
Zhang K (2019) A three-way c-means algorithm. Appl Soft Comput 82:105536
Shen QP, Zhang QH, Zhao F, Wang GY (2022) Adaptive three-way c-means clustering based on the cognition of distance stability. Cogn Comput 14(2):563–580
Yao YY (2010) Three-way decisions with probabilistic rough sets. Inform Sci 180(3):341–353
Jia F, Liu PD (2019) A novel three-way decision model under multiple-criteria environment. Inform Sci 471:29–51
Wang PX, Yao YY (2018) CE3: A three-way clustering method based on mathematical morphology. Knowl-Based Syst 155:54–65
Yu H, Chang ZH, Wang GY, Chen XF (2020) An efficient three-way clustering algorithm based on gravitational search. Int J Mach Learn Cyb 11(5):1003–1016
Yu H, Chen Y, Lingras P, Wang GY (2019) A three-way cluster ensemble approach for large-scale data. Int J Approx Reason 115:32–49
Chen LF, Jiang QS, Wang SR (2008) A hierarchical method for determining the number of clusters. J Software 19(1):62–72. (in Chinese)
Zhou ZH (2021) Machine Learning Tsinghua University Press
Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Amer Stat Asso 66(336):846–850
Wang YT, Chen LH, Mei JP (2014) Incremental fuzzy clustering with multiple medoids for large data. IEEE Tran Fuzzy Syst 22(6):1557–1568
Xu W, Liu X, Gong YH (2003) Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pp 267–273
Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496
Du MJ, Ding SF, Jia HJ (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145
Zhong JJ, Tse PW, Wei YH (2017) An intelligent and improved density and distance-based clustering approach for industrial survey data classification. Expert Syst Appl 21–28:68
Sun BZ, Bai JC, Chu XL, Sun SL, Li YW, Li HT (2022) Interval prediction approach to crude oil price based on three-way clustering and decomposition ensemble learning. Appl Soft Comput 123:108933
Xin XW, Shi CL, Sun JB, Xue ZA, Song JH, Peng WM (2022) A novel attribute reduction method based on intuitionistic fuzzy three-way cognitive clustering. Appl Intell https://doi.org/10.1007/s10489-022-03496-2
Ma WC, Tu XM, Luo B, Wang GH (2022) Semantic clustering based deduction learning for image recognition and classification. Pattern Recogn 124:108440
Acknowledgements
The authors are extremely grateful to the editors and five anonymous referees for their valuable comments which helped us improve the presentation of this article.
The research was partially supported by grants from NNSFC (12161036; 61866011; 12271146) and a Discovery Grant from NSERC, Canada.
Author information
Authors and Affiliations
Contributions
Rongtao Zhang: Conceptualization, Methodology, Investigation, Writing-original draft. Xueling Ma: Methodology, Investigation, Writing-original draft. Jianming Zhan: Methodology, Writing-Reviewing and Editing. Yiyu Yao: Methodology, Writing-Reviewing and Editing.
Corresponding authors
Ethics declarations
Conflict of Interests
The authors declared that they have no conflicts of interest to this work.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, R., Ma, X., Zhan, J. et al. 3WC-D: A feature distribution-based adaptive three-way clustering method. Appl Intell 53, 15561–15579 (2023). https://doi.org/10.1007/s10489-022-04332-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-04332-3