Skip to main content

Word Sense Disambiguation Based on Example Sentences in Dictionary and Automatically Acquired from Parallel Corpus

  • Conference paper
Book cover Advances in Natural Language Processing (JapTAL 2012)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7614))

Included in the following conference series:

Abstract

This paper presents a precision oriented example based approach for word sense disambiguation (WSD) for a reading assistant system for Japanese learners. Our WSD classifier chooses a sense associated with the most similar sentence in a dictionary only if the similarity is high enough, otherwise chooses no sense. We propose sentence similarity measures by exploiting collocations and syntactic dependency relations for a target word. The example based classifier is combined with a Robinson classifier to compensate recall. We further improve WSD performance by automatically acquiring bilingual sentences from a parallel corpus. According to the results of our experiments, the accuracy of automatically extracted sentences was 85%, while the proposed WSD method achieves 65% accuracy which is 7% higher than the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Carpuat, M., Wu, D.: Improving statistical machine translation using word sense disambiguation. In: The 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), Prague, Czech Republic, pp. 61–72 (2007)

    Google Scholar 

  2. Chan, Y.S., Ng, H.T.: Word sense disambiguation improves statistical machine translation. In: 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), Prague, Czech Republic, pp. 33–40 (2007)

    Google Scholar 

  3. Navigli, R.: Word sense disambiguation: A Survey. ACM Computing Surveys 41(2), 1–69 (2009)

    Article  Google Scholar 

  4. Okumura, M., Shirai, K., Komiya, K., Yokono, H.: SemEval-2010 task: Japanese WSD. In: Proceedings of the 5th International Workshop on Semantic Evaluation, pp. 69–74 (2010)

    Google Scholar 

  5. Natural Institute for Japanese Language and Linguistics (ed.): Bunrui Goi Hyo. Dainippon Tosho (2004)

    Google Scholar 

  6. Robinson, G.: A statistical approach to the spam problem. Linux J. 2003(107) (March 2003)

    Google Scholar 

  7. Utiyama, M., Isahara, H.: Reliable measures for aligning Japanese-English news articles and sentences. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, ACL 2003, vol. 1, pp. 72–79 (2003)

    Google Scholar 

  8. Minnen, G., Carroll, J., Pearce, D.: Robust, applied morphological generation. In: Proceedings of the First International Natural Language Generation Conference, 201–208 (2000)

    Google Scholar 

  9. Dagan, I., Itai, A.: Word sense disambiguation using a second language monolingual corpus. Computational Linguistics 20(4), 563–596 (1994)

    Google Scholar 

  10. Lee, H.A., Kim, G.C.: Translation selection through source word sense disambiguation and target word selection. In: Proceedings of the 19th International Conference on Computational Linguistics, COLING 2002, vol. 1, pp. 1–7 (2002)

    Google Scholar 

  11. Fujii, A., Inui, K., Tokunaga, T., Tanaka, H.: Selective sampling for example-based word sense disambiguation. Computational Linguistics 24(4), 573–597 (1998)

    Google Scholar 

  12. Shirai, K., Tamagaki, T.: Word Sense Disambiguation Using Heterogeneous Language Resources. In: Su, K.-Y., Tsujii, J., Lee, J.-H., Kwong, O.Y. (eds.) IJCNLP 2004. LNCS (LNAI), vol. 3248, pp. 377–385. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  13. Fujita, S., Fujino, A.: Word Sense Disambiguation by Combining Labeled Data Expansion and Semi-Supervised Learning Method. In: Proceedings of 5th International Joint Conference on Natural Language Processing, pp. 676–685 (2011)

    Google Scholar 

  14. de Melo, G., Weikum, G.: Extracting sense-disambiguated example sentences from parallel corpora. In: Proceedings of the 1st Workshop on Definition Extraction, WDE 2009, pp. 40–46 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kathuria, P., Shirai, K. (2012). Word Sense Disambiguation Based on Example Sentences in Dictionary and Automatically Acquired from Parallel Corpus. In: Isahara, H., Kanzaki, K. (eds) Advances in Natural Language Processing. JapTAL 2012. Lecture Notes in Computer Science(), vol 7614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33983-7_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33983-7_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33982-0

  • Online ISBN: 978-3-642-33983-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics