Skip to main content

SOM-Based Visualization of Potential Technical Solutions with Fuzzy Bag-of-Words Utilizing Multi-view Information

  • Conference paper
  • First Online:
Integrated Uncertainty in Knowledge Modelling and Decision Making (IUKM 2020)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12482))

  • 541 Accesses

Abstract

SOM-based visualization is a promising approach for revealing potential technical solutions varied in patent documents. In this paper, we try to improve the quality of visual representation by constructing Fuzzy Bag-of-Words (FBoW) matrices through utilization of multi-view information. F-term is the special theme code given by the examiners of Japan Patent Office (JPO) for Japanese patent documents and is expected to be a potential candidate of second-view information. Document \(\times \) F-term co-occurrence information is utilized for improving the quality of FBoW representation such that semantical similarities among words are measured by considering F-term semantics. The advantage of utilizing F-term semantics in constructing FBoW is demonstrated through analysis of patent document data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://cl.asahi.com/api_data/wordembedding.html.

  2. 2.

    https://aial.shiroyagi.co.jp/2017/02/japanese-word2vec-model-builder/.

  3. 3.

    https://sustainablebrands.com/read/cleantech/trending-beverage-makers-turning-production-waste-into-biomass-carbonation.

  4. 4.

    https://iopscience.iop.org/article/10.1088/1755-1315/395/1/012090/meta.

References

  1. Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. In: Joint 10th International Conference on Soft Computing and Intelligent Systems and 19th International Symposium on Advanced Intelligent Systems, pp. 820–825 (2018)

    Google Scholar 

  2. Nishida, Y., Honda, K.: Visualization of potential technical solutions by self-organizing maps and co-cluster extraction. J. Adv. Comput. Intell. Intell. Inf. 24(1), 65–72 (2020)

    Article  Google Scholar 

  3. Kohonen, T.: Self-organizing Maps, 3rd edn. Springer, Heidelberg (2000). https://doi.org/10.1007/978-3-642-56927-2

    Book  MATH  Google Scholar 

  4. Lan, M., Tan, C.L., Su, J., Lu, Y.: Supervised and traditional term weighting methods for automatic text categorization. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 721–735 (2009)

    Article  Google Scholar 

  5. Zhao, L., Mao, K.: Fuzzy bag-of-words model for document representation. IEEE Trans. Fuzzy Syst. 26(2), 794–804 (2018)

    Article  Google Scholar 

  6. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. In: International Conference on Learning Representations (2013). https://arxiv.org/pdf/1301.3781.pdf

  7. Nishida, Y., Honda, K.: A comparative study on SOM-based visualization of potential technical solutions using fuzzy bag-of-words and co-occurrence probability of technical words. In: Seki, H., Nguyen, C.H., Huynh, V.-N., Inuiguchi, M. (eds.) IUKM 2019. LNCS (LNAI), vol. 11471, pp. 360–369. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-14815-7_30

    Chapter  Google Scholar 

  8. Bickel, S., Scheffer, T.: Multi-view clustering. In: 4th IEEE International Conference on Data Mining, pp. 19–26 (2004)

    Google Scholar 

  9. Xu, Y.M., Wang, C.D., Lai, J.H.: Weighted multi-view clustering with feature reduction. Pattern Recogn. 53, 25–35 (2016)

    Article  Google Scholar 

  10. Yang, Y., Wang, H.: Multi-view clustering: a survey. Big Data Min. Anal. 1, 83–107 (2018)

    Article  Google Scholar 

  11. Schellner, I.: Japanese file index classification and F-terms. World Patent Inf. 24, 197–201 (2002)

    Article  Google Scholar 

  12. Sato, T.: Neologism dictionary based on the language resources on the Web for mecab (2015). https://github.com/neologd/mecab-ipadic-neologd

  13. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Inf. Process. Manage. 24(5), 513–523 (1988)

    Article  Google Scholar 

  14. Wittek, P., Gao, S.-C., Lim, I.-S., Zhao, L.: Somoclu: an efficient parallel library for self-organizing maps. J. Stat. Softw. 78(9), 1–21 (2017)

    Article  Google Scholar 

  15. Honda, K., Sako, T., Ubukata, S., Notsu, A.: Visual co-cluster assessment with intuitive cluster validation through co-occurrence-sensitive ordering. J. Adv. Comput. Intell. Intell. Inf. 22(5), 585–592 (2018)

    Article  Google Scholar 

  16. Honda, K., Yamamoto, N., Ubukata, S., Notsu, A.: Noise rejection in MMMs-induced fuzzy co-clustering. J. Adv. Comput. Intell. Intell. Inf. 21(7), 1144–1151 (2017)

    Article  Google Scholar 

  17. Chen, T.-C.T., Honda, K.: Fuzzy Collaborative Forecasting and Clustering. SpringerBriefs in Applied Sciences and Technology. Springer, Heidelberg (2019). https://doi.org/10.1007/978-3-030-22574-2

    Book  Google Scholar 

Download references

Acknowledgment

This work was achieved through the use of large-scale computer systems at the Cybermedia Center, Osaka University, and was supported in part by JSPS KAKENHI Grant Number JP18K11474.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katsuhiro Honda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Nishida, Y., Honda, K. (2020). SOM-Based Visualization of Potential Technical Solutions with Fuzzy Bag-of-Words Utilizing Multi-view Information. In: Huynh, VN., Entani, T., Jeenanunta, C., Inuiguchi, M., Yenradee, P. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2020. Lecture Notes in Computer Science(), vol 12482. Springer, Cham. https://doi.org/10.1007/978-3-030-62509-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62509-2_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62508-5

  • Online ISBN: 978-3-030-62509-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics