Hybrid clustering of data and vague concepts based on labels semantics

Qin, Zengchang; Wan, Tao; Zhao, Hanqing

doi:10.1007/s10479-017-2541-0

Hybrid clustering of data and vague concepts based on labels semantics

IUKM2015
Published: 10 June 2017

Volume 256, pages 393–416, (2017)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

228 Accesses
3 Citations
Explore all metrics

Abstract

Data clustering is the process of dividing data elements into clusters so that items in the same cluster are as similar as possible, and items in different clusters are as dissimilar as possible. One of the key features for clustering is how to define a sensible similarity measure. Such measures usually handle data in one modality, but unable to cluster data from different modalities. Based on fuzzy set and prototype theory interpretations of label semantics, two (dis) similarity measures are proposed by which we can automatically cluster data and vague concepts represented by logical expressions of linguistic labels. Experimental results on a toy problem and one in image classification demonstrate the effectiveness of new clustering algorithms. Since our new proposed measures can be extended to measuring distance between any two granularities, the new clustering algorithms can also be extended to cluster data instance and imprecise concepts represented by other granularities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Clustering Data and Vague Concepts Using Prototype Theory Interpreted Label Semantics

Semi-supervised Fuzzy c-Means Algorithms by Revising Dissimilarity/Kernel Matrices

Fuzzy cluster analysis algorithm for image data based on the extracted feature intervals

Article 21 September 2023

Kim-Ngoc T. Le, Dan Nguyenthihong & Tai Vovan

References

Beg, M. M. S., Thint, M., & Qin, Z. (2007). PNL-enhanced restricted domain question answering system. The Proceedings of IEEE-FUZZ, 1277–1283.
Bezdek, J. (1981). Pattern recognition with fuzzy objective function algorithms. ISBN 0-306-40671-3.
Carneiro, G., Chan, A. B., Moreno, P. J., & Vasconcelos, N. (2006). Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(3), 394–410.
Article Google Scholar
Chakraborty, C., & Chakraborty, D. (2006). A theoretical development on a fuzzy distance measure for fuzzy numbers. Mathematical and Computer Modelling, 43, 254–261.
Article Google Scholar
Deng, Z., Jiang, Y., Chung, F.-L., Ishibuchi, H., Choi, K.-S., & Wang, S. (2016). Transfer prototype-based fuzzy clustering. IEEE Transactions on Fuzzy Systems, 24(5), 1210–1232.
Article Google Scholar
Diamond, P. (1988). Fuzzy least squares. Information Sciences, 46, 141–157.
Article Google Scholar
Dunn, J. C. (1973). A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics, 3, 32–57.
Article Google Scholar
Ghosh, S., & Kumar Dubey, S. (2013). Comparative analysis of K-means and fuzzy C-means algorithms. International Journal of Advanced Computer Science and Applications, 4(4), 35–39.
Article Google Scholar
Hyung, L. K., Song, Y. S., & Lee, K. M. (1994). Similarity measure between fuzzy sets and between elements. Fuzzy Sets and System, 62, 291–293.
Article Google Scholar
Jain, A. K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651–666.
Article Google Scholar
Lawry, J. (2004). A framework for linguistic modeling. Artificial Intelligence, 155, 1–39.
Article Google Scholar
Lawry, J. (2006). Modelling and reasoning with vague concepts. Berlin: Springer.
Google Scholar
Lawry, J., & Tang, Y. (2009). Uncertainty modelling for vague concepts: A prototype theory approach. Artificial Intelligence, 173.18(2009), 1539–1558.
Article Google Scholar
Li, D.-F. (2004). Some measures of dissimilarity in intuitionistic fuzzy structures. Journal of Computer and System Sciences, 8, 115–122.
Article Google Scholar
Lavrenko, V., Manmatha, R., & Jeon, J. (2004). A model for learning the semantics of pictures. Advances in Neural Information Processing Systems, 16, 553–560.
Google Scholar
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of 5th Berkeley symposium on mathematical statistics and probability (pp. 281–297). University of California Press.
Miyamoto, S. (1990). Fuzzy sets in information retrieval and cluster analysis. Dordrecht: Kluwer Academic Publishers.
Book Google Scholar
Pedrycz, W. (2005). Knowledge-based clustering. Hoboken: Wiley.
Book Google Scholar
Qin, Z., & Lawry, J. (2005). Decision tree learning with fuzzy labels. Information Sciences, 172(1–2), 91–129.
Article Google Scholar
Qin, Z., & Lawry, J. (2008). LFOIL: Linguistic rule induction in the label semantics framework. Fuzzy Sets and Systems, 159, 435–448.
Article Google Scholar
Qin, Z., & Tang, Y. (2014). Uncertainty modeling for data mining: A label semantics approach. Berlin: Springer.
Book Google Scholar
Qin, Z., Thint, M., & Beg, M. M. S. (2007). Deduction engine designs for PNL-based question answering systems. Foundations of Fuzzy Logic and Soft Computing, LNAI 4529, 253–262.
Article Google Scholar
Talavera, L., & Bejar, J. (2001). Generality-based conceptual clustering with probabilistic concepts. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23(2), 196–206.
Article Google Scholar
Yang, K., & Ko, C.-H. (1997). On cluster-wise fuzzy regression analysis. IEEE Transaction on Systems, Man and Cybernetics B, 27, 1–13.
Article Google Scholar
Yong, Y., Chongxun, Z., & Pan, L. (2004). A novel fuzzy C-means clustering algorithm for image thresholding. Measurement Science Review, 4(1), 11–19.
Google Scholar
Zadeh, L. A. (1975). The concept of linguistic variable and its application to approximate reasoning Part 2. Information Science, 8, 301–357.
Article Google Scholar
Zadeh, L. A. (1996). Fuzzy logic \(=\) computing with words. IEEE Transaction on Fuzzy Systems, 4, 103–111.
Article Google Scholar
Zadeh, L. A. (2012). Computing with words: Principal concepts and ideas. Studies in fuzziness and soft computing. Berlin: Springer.
Book Google Scholar

Download references

Acknowledgements

This work is partially supported by the Natural Science Foundation of China under Grant Nos. 61401012, 61305047 and the NUTP for Innovation and Entrepreneurship of China under No. 201510006143.

Author information

Authors and Affiliations

Intelligent Computing and Machine Learning Lab, School of Automation Science and Electrical Engineering, Beihang University, Beijing, China
Zengchang Qin
School of Biological Science and Medical Engineering, Beihang University, Beijing, China
Tao Wan
École Centrale de Pékin, Beihang University, Beijing, 100191, China
Hanqing Zhao

Authors

Zengchang Qin
View author publications
You can also search for this author in PubMed Google Scholar
Tao Wan
View author publications
You can also search for this author in PubMed Google Scholar
Hanqing Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tao Wan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Qin, Z., Wan, T. & Zhao, H. Hybrid clustering of data and vague concepts based on labels semantics. Ann Oper Res 256, 393–416 (2017). https://doi.org/10.1007/s10479-017-2541-0

Download citation

Published: 10 June 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s10479-017-2541-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hybrid clustering of data and vague concepts based on labels semantics

Abstract

Access this article

Similar content being viewed by others

Clustering Data and Vague Concepts Using Prototype Theory Interpreted Label Semantics

Semi-supervised Fuzzy c-Means Algorithms by Revising Dissimilarity/Kernel Matrices

Fuzzy cluster analysis algorithm for image data based on the extracted feature intervals

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Hybrid clustering of data and vague concepts based on labels semantics

Abstract

Access this article

Similar content being viewed by others

Clustering Data and Vague Concepts Using Prototype Theory Interpreted Label Semantics

Semi-supervised Fuzzy c-Means Algorithms by Revising Dissimilarity/Kernel Matrices

Fuzzy cluster analysis algorithm for image data based on the extracted feature intervals

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation