Abstract
Fuzzy co-clustering is a technique for extracting co-clusters of mutually familiar pairs of objects and items from co-occurrence information among them, and has been utilized in document analysis on document-keyword relations and market analysis on purchase preferences of customers with products. Recently, multi-view data clustering attracts much attentions with the goal of revealing the intrinsic features among multi-source data stored over different organizations. In this paper, three-mode document data analysis is considered under multi-view analysis of document-keyword relations in conjunction with semantic information among keywords, where the results of two different approaches are compared. Fuzzy Bag-of-Words (Fuzzy BoW) introduces semantic information among keywords such that co-occurrence degrees are counted supported by fuzzy mapping of semantically similar keywords. On the other hand, three-mode fuzzy co-clustering simultaneously considers the cluster-wise aggregation degree among documents, keywords and semantic similarities. Numerical results with a Japanese novel document demonstrate the different features of these two approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
English translation is also available in Eldritch Press (http://www.ibiblio.org/eldritch/).
References
Oh, C.-H., Honda, K., Ichihashi, H.: Fuzzy clustering for categorical multivariate data. In: Joint 9th IFSA World Congress and 20th NAFIPS International Conference, pp. 2154–2159 (2001)
Kummamuru, K., Dhawale, A., Krishnapuram, R.: Fuzzy co-clustering of documents and keywords. In: Proceedings of the 12th IEEE International Conference on Fuzzy Systems, vol. 2, pp. 772–777 (2003)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer, Heidelberg (2008)
Honda, K., Oshio, S., Notsu, A.: Fuzzy co-clustering induced by multinomial mixture models. J. Adv. Comput. Intell. Intell. Inform. 19(6), 717–726 (2015)
Yang, Y., Wang, H.: Multi-view clustering: a survey. Big Data Min. Anal. 1, 83–107 (2018)
Bisson, G., Grimal, C.: Co-clustering of multi-view datasets: a parallelizable approach. In: Proceedings of the 2012 IEEE 12th International Conference on Data Mining, pp. 828–833 (2012)
Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. J. Adv. Comput. Intell. Intell. Inform. 24(1), 65–72 (2020)
Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)
Zhao, L., Mao, K.: Fuzzy bag-of-words model for document representation. IEEE Trans. Fuzzy Syst. 26(2), 794–804 (2018)
Chen, T.-C.T., Honda, K.: Fuzzy Collaborative Forecasting and Clustering, SpringerBriefs in Applied Sciences and Technology. Springer, Heidelberg (2019)
Honda, K., Hayashi, I., Ubukata, S., Notsu, A.: Three-mode fuzzy co-clustering based on probabilistic concept and comparison with FCM-type algorithms. J. Adv. Comput. Intell. Intell. Inform. 25(4), 478–488 (2021)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference on Learning Representations (2013). https://arxiv.org/pdf/1301.3781.pdf
Honda, K., Notsu, A., Ichihashi, H.: Fuzzy PCA-guided robust \(k\)-means clustering. IEEE Trans. Fuzzy Syst. 13(4), 508–516 (2005)
Suzuki, M., Matsuda, K., Sekine, S., Okazaki, N., Inui, K.: A joint neural model for fine-grained named entity classification of Wikipedia articles. IEICE Trans. Inf. Syst. E101-D(1), 73–81 (2018)
Wikipedia Entity Vectors Homepage. https://github.com/singletongue/WikiEntVec. Accessed 28 Oct 2021
Acknowledgment
This work was supported in part by JSPS KAKENHI Grant Number JP18K11474.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Takahata, Y., Honda, K., Ubukata, S. (2022). A Comparative Study on Utilization of Semantic Information in Fuzzy Co-clustering. In: Honda, K., Entani, T., Ubukata, S., Huynh, VN., Inuiguchi, M. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2022. Lecture Notes in Computer Science(), vol 13199. Springer, Cham. https://doi.org/10.1007/978-3-030-98018-4_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-98018-4_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98017-7
Online ISBN: 978-3-030-98018-4
eBook Packages: Computer ScienceComputer Science (R0)