Skip to main content

A Comparative Study on Utilization of Semantic Information in Fuzzy Co-clustering

  • Conference paper
  • First Online:
Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2022)

Abstract

Fuzzy co-clustering is a technique for extracting co-clusters of mutually familiar pairs of objects and items from co-occurrence information among them, and has been utilized in document analysis on document-keyword relations and market analysis on purchase preferences of customers with products. Recently, multi-view data clustering attracts much attentions with the goal of revealing the intrinsic features among multi-source data stored over different organizations. In this paper, three-mode document data analysis is considered under multi-view analysis of document-keyword relations in conjunction with semantic information among keywords, where the results of two different approaches are compared. Fuzzy Bag-of-Words (Fuzzy BoW) introduces semantic information among keywords such that co-occurrence degrees are counted supported by fuzzy mapping of semantically similar keywords. On the other hand, three-mode fuzzy co-clustering simultaneously considers the cluster-wise aggregation degree among documents, keywords and semantic similarities. Numerical results with a Japanese novel document demonstrate the different features of these two approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    English translation is also available in Eldritch Press (http://www.ibiblio.org/eldritch/).

References

  1. Oh, C.-H., Honda, K., Ichihashi, H.: Fuzzy clustering for categorical multivariate data. In: Joint 9th IFSA World Congress and 20th NAFIPS International Conference, pp. 2154–2159 (2001)

    Google Scholar 

  2. Kummamuru, K., Dhawale, A., Krishnapuram, R.: Fuzzy co-clustering of documents and keywords. In: Proceedings of the 12th IEEE International Conference on Fuzzy Systems, vol. 2, pp. 772–777 (2003)

    Google Scholar 

  3. Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)

    Google Scholar 

  4. Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer, Heidelberg (2008)

    Google Scholar 

  5. Honda, K., Oshio, S., Notsu, A.: Fuzzy co-clustering induced by multinomial mixture models. J. Adv. Comput. Intell. Intell. Inform. 19(6), 717–726 (2015)

    Article  Google Scholar 

  6. Yang, Y., Wang, H.: Multi-view clustering: a survey. Big Data Min. Anal. 1, 83–107 (2018)

    Article  Google Scholar 

  7. Bisson, G., Grimal, C.: Co-clustering of multi-view datasets: a parallelizable approach. In: Proceedings of the 2012 IEEE 12th International Conference on Data Mining, pp. 828–833 (2012)

    Google Scholar 

  8. Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. J. Adv. Comput. Intell. Intell. Inform. 24(1), 65–72 (2020)

    Article  Google Scholar 

  9. Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)

    Article  Google Scholar 

  10. Zhao, L., Mao, K.: Fuzzy bag-of-words model for document representation. IEEE Trans. Fuzzy Syst. 26(2), 794–804 (2018)

    Article  Google Scholar 

  11. Chen, T.-C.T., Honda, K.: Fuzzy Collaborative Forecasting and Clustering, SpringerBriefs in Applied Sciences and Technology. Springer, Heidelberg (2019)

    Google Scholar 

  12. Honda, K., Hayashi, I., Ubukata, S., Notsu, A.: Three-mode fuzzy co-clustering based on probabilistic concept and comparison with FCM-type algorithms. J. Adv. Comput. Intell. Intell. Inform. 25(4), 478–488 (2021)

    Article  Google Scholar 

  13. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manag. 24(5), 513–523 (1988)

    Article  Google Scholar 

  14. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference on Learning Representations (2013). https://arxiv.org/pdf/1301.3781.pdf

  15. Honda, K., Notsu, A., Ichihashi, H.: Fuzzy PCA-guided robust \(k\)-means clustering. IEEE Trans. Fuzzy Syst. 13(4), 508–516 (2005)

    Article  Google Scholar 

  16. Suzuki, M., Matsuda, K., Sekine, S., Okazaki, N., Inui, K.: A joint neural model for fine-grained named entity classification of Wikipedia articles. IEICE Trans. Inf. Syst. E101-D(1), 73–81 (2018)

    Google Scholar 

  17. Wikipedia Entity Vectors Homepage. https://github.com/singletongue/WikiEntVec. Accessed 28 Oct 2021

Download references

Acknowledgment

This work was supported in part by JSPS KAKENHI Grant Number JP18K11474.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katsuhiro Honda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Takahata, Y., Honda, K., Ubukata, S. (2022). A Comparative Study on Utilization of Semantic Information in Fuzzy Co-clustering. In: Honda, K., Entani, T., Ubukata, S., Huynh, VN., Inuiguchi, M. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2022. Lecture Notes in Computer Science(), vol 13199. Springer, Cham. https://doi.org/10.1007/978-3-030-98018-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-98018-4_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-98017-7

  • Online ISBN: 978-3-030-98018-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics