Skip to main content

Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2001)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2004))

Abstract

This paper presents corpus-based approach to word sense disambiguation where decision tree ssigns sense to an ambiguous word based on the bigrams that occur nearby. This approach is evaluated using the sense-tagged corpora from the 1998 SENSEVAL word sense disambiguation exercise. It is more ccurate than the verage results reported for 30 of 36 words, and is more accurate than the best results for 19 of 36 words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Bruce and J. Wiebe. Word-sense disambiguation using decomposable models. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics pages 139–146, 1994.

    Google Scholar 

  2. K. Church and P. Hanks. Word ssociation norms, mutual information and lexicography. In Proceedings of the 28th Annual Meeting of the Association for Com-putational Linguistics pages 76–83, 1990.

    Google Scholar 

  3. N. Cressie and T. Read. Multinomial goodness of fit tests. Journal of the Royal Statistics Society Series B 46:440–464, 1984.

    MathSciNet  MATH  Google Scholar 

  4. R. Duda and P. Hart. Pattern Classification and Scene Analysis, Wiley, NewYork, NY, 1973.

    Google Scholar 

  5. T. Dunning. Accurate methods for the statistics of surprise nd coincidence. Com-putational Linguistics 19(1):61–74, 1993.

    Google Scholar 

  6. R. Holte. Very simple classification rules perform well on most commonly used datasets. Machine Learning 11:63–91, 1993.

    Article  Google Scholar 

  7. A. Kilgarri and M. Palmer. Special issue on SENSEVAL:Evalu ting word sense disambigu tion programs. Computers and the Humanities 34(1–2), 2000.

    Google Scholar 

  8. R. Mooney. Comp rative experiments on disambiguating word senses:An illustration of the role of bias in machine learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 82–91, May 1996.

    Google Scholar 

  9. H.T. Ng and H.B. Lee. Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In Proceedings of the 34th Annual Meeting of the Society for Computational Linguistics, pages 40–47, 1996.

    Google Scholar 

  10. T. Pedersen. Fishing for exactness. In Proceedings of the South Central SAS User’s Group (SCSUG-96) Conference, pages 188–200, Austin, TX, October 1996.

    Google Scholar 

  11. T. Pedersen and R. Bruce. A newsupervised learning algorithm for word sense dis mbiguation. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pages 604–609, Providence, RI, July 1997.

    Google Scholar 

  12. F. Smadja, K. McKeown, and V. Hatzivassiloglou. Translating collocations for bilingual lexicons:A statistical pproach. Computational Linguistics 22(1):1–38, 1996.

    Google Scholar 

  13. Y. Wilks and M. Stevenson. Word sense dis mbigu tion using optimised combinations of knowledge sources. In Proceedings of COLING/ACL-98 1998.

    Google Scholar 

  14. I. Witten and E. Frank. Data Mining-Practical Machine Learning Tools and Techniques with Java Implementations, Morgan-Kaufmann, San Francisco, CA, 2000.

    Google Scholar 

  15. D. Yarowsky. Decision lists for lexical ambiguity resolution: Application to ccent restoration in Spanish and French. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics 1994.

    Google Scholar 

  16. D. Yarowsky. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting of the Association for Computa-tional Linguistics pages 189–196, Cambridge,MA, 1995.

    Google Scholar 

  17. D. Yarowsky. Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34(1-2), 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pedersen, T. (2001). Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2001. Lecture Notes in Computer Science, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44686-9_16

Download citation

  • DOI: https://doi.org/10.1007/3-540-44686-9_16

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41687-6

  • Online ISBN: 978-3-540-44686-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics