Skip to main content

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 935))

  • 1309 Accesses

Abstract

A measure of term relevancy is important in various applications such as Web search. Although the co-occurrence probability of terms in a database is a simple way to express term relevancy, it suffers from each term having a different co-occurrence tendency. In this paper, we propose a new measure of term relevancy: a ratio of actual and predicted values of co-occurrence (RAP). We construct a model predicting co-occurrence for each query as piecewise approximation lines.

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Hatena Bookmark, http://b.hatena.ne.jp/.

  2. 2.

    MeCab: Yet Another Part-of-Speech and Morphological Analyzer, http://taku910.github.io/mecab/.

References

  1. Akamine, S., Kawahara, D., Kato, Y., Nakagawa, T., Inui, K., Kurohashi, S., Kidawara, Y.: WISDOM: a web information credibility analysis systematic. In: Proceedings of the ACL-IJCNLP 2009 Software Demonstrations, pp. 1–4 (2009)

    Google Scholar 

  2. Yamamoto, Y., Tanaka, K.: Enhancing credibility judgment of web search results. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1235–1244. ACM (2011)

    Google Scholar 

  3. Yumoto, T., Yamanaka, T., Nii, M., Kamiura, N.: Finding rare information from the web using social bookmarks and word co-occurrence. Int. J. Biomed. Soft Comput. Hum. Sci. 22(1), 9–18 (2017)

    Google Scholar 

  4. Church, K.W., Hanks, P.: Word association norms, mutual information, and lexicography. Comput. Linguist. 16(1), 22–29 (1990)

    Google Scholar 

  5. Cilibrasi, R.L., Vitanyi, P.M.: The Google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)

    Article  Google Scholar 

  6. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. ICLR Workshop (2013)

    Google Scholar 

  7. Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)

  8. Poria, S., Cambria, E., Gelbukh, A.: Aspect extraction for opinion mining with a deep convolutional neural network. Knowl. Based Syst. 108, 42–49 (2016)

    Article  Google Scholar 

  9. Kudo, T., Yamamoto, K., Matsumoto, Y.: Applying conditional random fields to japanese morphological analysis. In 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP2004), pp. 230–237 (2004)

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by JSPS KAKENHI Grant Number JP17K00429.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Yuya Koyama or Takayuki Yumoto .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Koyama, Y., Yumoto, T., Isokawa, T., Kamiura, N. (2019). Measuring Term Relevancy Based on Actual and Predicted Co-occurrence. In: Lee, S., Ismail, R., Choo, H. (eds) Proceedings of the 13th International Conference on Ubiquitous Information Management and Communication (IMCOM) 2019. IMCOM 2019. Advances in Intelligent Systems and Computing, vol 935. Springer, Cham. https://doi.org/10.1007/978-3-030-19063-7_78

Download citation

Publish with us

Policies and ethics