Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features

Chen, Jinying; Palmer, Martha

doi:10.1007/11562214_81

Jinying Chen²² &
Martha Palmer²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3651))

Included in the following conference series:

International Conference on Natural Language Processing

1615 Accesses

Abstract

This paper shows that our WSD system using rich linguistic features achieved high accuracy in the classification of English SENSEVAL2 verbs for both fine-grained (64.6%) and coarse-grained (73.7%) senses. We describe three specific enhancements to our treatment of rich linguistic features and present their separate and combined contributions to our system’s performance. Further experiments showed that our system had robust performance on test data without high quality rich features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

References

Sanderson, M.: Word sense disambiguation and information retrieval. In: Proceedings of the 17th Int. ACM SIGIR, Dublin, IE (1994)
Google Scholar
Stokoe, C., Oakes, M.P., Tait, J.: Word sense disambiguation and information retrieval revisited. In: Proceedings of the 26th annual int. ACM SIGIR conference on research and development in information retrieval, Toronto, Canada (2003)
Google Scholar
Edmonds, P., Cotton, S.: SENSEVAL-2: Overview. In: Proceedings of SENSEVAL-2: 2nd Int. Workshop on Evaluating WSD Systems. ACL-SIGLEX, Toulouse, France (2001)
Google Scholar
Yarowsky, D., Cucerzan, S., Florian, R., Schafer, C., Wicentowski, R.: The Johns hopkins SENSEVAL2 system description. In: Proceedings of SENSEVAL-2: 2nd Int.Workshop on Evaluating WSD Systems, Toulouse France (2001)
Google Scholar
Dang, H.T., Palmer, M.: Combining contextual features for word sense disambiguation. In: Proceedings of the SIGLEX/SENSEVAL Workshop on WSD: Recent Successes and Future Directions, in conjunction with ACL-2002, Philadelphia (2002)
Google Scholar
David, M., Enek, A., Liuis, M.: Syntactic Features for High Precision Word Sense Disambiguation. In: Proceedings of the 19th International COLING, Taipei (2002)
Google Scholar
Lin, D.: Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity. In: Proceedings of ACL-1997, Madrid, Spain (1997)
Google Scholar
Lee, Y.K., Ng, H.T.: An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 41–48 (2002)
Google Scholar
Mihalcea, R., Faruque, E.: Sense Learner: Minimally Supervised Word Sense Disambiguation for All Words in Open Text. In: Proceedings of SENSEVAL-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain (2004)
Google Scholar
Dang, H.T.: Investigations into the role of lexical semantics in word sense disambiguation. PhD Thesis. University of Pennsylvania (2004)
Google Scholar
Fellbaum, C.: WordNet - an Electronic Lexical Database. The MIT Press, Cambridge (1998)
MATH Google Scholar
McCallum, A.K.: MALLET: A Machine Learning for Language Toolkit (2002), http://www.cs.umass.edu/~mccallum/mallet
Berger, A.L., Della Piertra, S.A., Della Pietra, V.J.: A maximum entropy approach to natural language processing. Compuational Linguistics 22(1), 39–71 (1996)
Google Scholar
Chen, S.F., Rosenfeld, R.: A Gaussian prior for smoothing maximum entropy models. Technical Report CMU-CS-99-108, CMU (1999)
Google Scholar
Bikel, D.M., Schwartz, R., Weischedel, R.M.: An algorithm that learns what’s in a name. Machine Learning, Special Issue on Natural Language Learning 34(1-3) (1999)
Google Scholar
Lappin, S., Leass, H.: An algorithm for pronominal anaphora resolution. Computational Linguistics 20(4), 535–561 (1994)
Google Scholar
Kingsbury, P., Palmer, M., Marcus, M.: Adding semantic annotation to the Penn Tree-Bank. In: Proceedings of HLT 2002, San Diego, CA (2002)
Google Scholar
Ratnaparkhi, A.: Maximum entropy models for natural language ambiguity resolution. Ph.D. thesis, University of Pennsylvania (1998)
Google Scholar
Bikel, D.M.: Design of a multi-lingual, parallel-processing statistical parsing engine. In: Proceedings of HLT 2002. San Diego, CA (2002)
Google Scholar
Buitelaar, P.: Reducing lexical semantic complexity with systematic polysemous classes and underspecification. In: Poceedings of the ANLP Workshop on Syntactic and Semantic Complexity in NLP Systems, Seattle, WA (2000)
Google Scholar
Palmer, M., Malaya, O.B., Dang, H.T.: Different sense granularities for different appli-cations. In: Proceedings of HLT/NAACL-2004, Boston (2004)
Google Scholar
Marcus, M., Kim, G., Marcinkiewicz, M.A., MacIntyre, R., Ferguson, M., Katz, K., Schasberger, B.: The Penn Treebank: annotating predicate argument structure. In: Proceedings of the ARPA 1994 HLT Workshop (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, 19104, USA
Jinying Chen & Martha Palmer

Authors

Jinying Chen
View author publications
You can also search for this author in PubMed Google Scholar
Martha Palmer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Language Technology, Macquarie University, 2019, Sydney, NSW, Australia
Robert Dale
Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong
Kam-Fai Wong
Institute for Infocomm Research, 21, Heng Mui Keng Terrace, 119613, Singapore
Jian Su
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, J., Palmer, M. (2005). Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features. In: Dale, R., Wong, KF., Su, J., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2005. IJCNLP 2005. Lecture Notes in Computer Science(), vol 3651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11562214_81

Download citation

DOI: https://doi.org/10.1007/11562214_81
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29172-5
Online ISBN: 978-3-540-31724-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features

Abstract

Access this chapter

Preview

Similar content being viewed by others

WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Towards Robust High Performance Word Sense Disambiguation of English Verbs Using Rich Linguistic Features

Abstract

Access this chapter

Preview

Similar content being viewed by others

WSD-TIC: Word Sense Disambiguation Using Taxonomic Information Content

Models and Strategies for Russian Word Sense Disambiguation: A Comparative Analysis

WordNet and Wiktionary-Based Approach for Word Sense Disambiguation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation