Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees

Pedersen, Ted

doi:10.1007/3-540-44686-9_16

Ted Pedersen²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2004))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

778 Accesses
2 Citations

Abstract

This paper presents corpus-based approach to word sense disambiguation where decision tree ssigns sense to an ambiguous word based on the bigrams that occur nearby. This approach is evaluated using the sense-tagged corpora from the 1998 SENSEVAL word sense disambiguation exercise. It is more ccurate than the verage results reported for 30 of 36 words, and is more accurate than the best results for 19 of 36 words.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R. Bruce and J. Wiebe. Word-sense disambiguation using decomposable models. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics pages 139–146, 1994.
Google Scholar
K. Church and P. Hanks. Word ssociation norms, mutual information and lexicography. In Proceedings of the 28th Annual Meeting of the Association for Com-putational Linguistics pages 76–83, 1990.
Google Scholar
N. Cressie and T. Read. Multinomial goodness of fit tests. Journal of the Royal Statistics Society Series B 46:440–464, 1984.
MathSciNet MATH Google Scholar
R. Duda and P. Hart. Pattern Classification and Scene Analysis, Wiley, NewYork, NY, 1973.
Google Scholar
T. Dunning. Accurate methods for the statistics of surprise nd coincidence. Com-putational Linguistics 19(1):61–74, 1993.
Google Scholar
R. Holte. Very simple classification rules perform well on most commonly used datasets. Machine Learning 11:63–91, 1993.
Article Google Scholar
A. Kilgarri and M. Palmer. Special issue on SENSEVAL:Evalu ting word sense disambigu tion programs. Computers and the Humanities 34(1–2), 2000.
Google Scholar
R. Mooney. Comp rative experiments on disambiguating word senses:An illustration of the role of bias in machine learning. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 82–91, May 1996.
Google Scholar
H.T. Ng and H.B. Lee. Integrating multiple knowledge sources to disambiguate word sense: An exemplar-based approach. In Proceedings of the 34th Annual Meeting of the Society for Computational Linguistics, pages 40–47, 1996.
Google Scholar
T. Pedersen. Fishing for exactness. In Proceedings of the South Central SAS User’s Group (SCSUG-96) Conference, pages 188–200, Austin, TX, October 1996.
Google Scholar
T. Pedersen and R. Bruce. A newsupervised learning algorithm for word sense dis mbiguation. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pages 604–609, Providence, RI, July 1997.
Google Scholar
F. Smadja, K. McKeown, and V. Hatzivassiloglou. Translating collocations for bilingual lexicons:A statistical pproach. Computational Linguistics 22(1):1–38, 1996.
Google Scholar
Y. Wilks and M. Stevenson. Word sense dis mbigu tion using optimised combinations of knowledge sources. In Proceedings of COLING/ACL-98 1998.
Google Scholar
I. Witten and E. Frank. Data Mining-Practical Machine Learning Tools and Techniques with Java Implementations, Morgan-Kaufmann, San Francisco, CA, 2000.
Google Scholar
D. Yarowsky. Decision lists for lexical ambiguity resolution: Application to ccent restoration in Spanish and French. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics 1994.
Google Scholar
D. Yarowsky. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting of the Association for Computa-tional Linguistics pages 189–196, Cambridge,MA, 1995.
Google Scholar
D. Yarowsky. Hierarchical decision lists for word sense disambiguation. Computers and the Humanities 34(1-2), 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Minnesota Duluth, MN 55812, Duluth, USA
Ted Pedersen

Authors

Ted Pedersen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CIC (Centro de Investigación en Computatción IPN (Instituto Politécnico Nacional), Av. Juan Dios Bátiz s/n esq. M. Othon Mendizabal Col. Nuevo Vallejo, CP. 07738, México, Mexico
Alexander Gelbukh (Unidad Profecional “Adolfo López Mateos”) (Unidad Profecional “Adolfo López Mateos”)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pedersen, T. (2001). Lexical Semantic Ambiguity Resolution with Bigram-Based Decision Trees. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2001. Lecture Notes in Computer Science, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44686-9_16

Download citation

DOI: https://doi.org/10.1007/3-540-44686-9_16
Published: 16 March 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41687-6
Online ISBN: 978-3-540-44686-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics