Skip to main content

Automatic Word Sense Disambiguation Using Cooccurrence and Hierarchical Information

  • Conference paper
Book cover Natural Language Processing and Information Systems (NLDB 2010)

Abstract

We review in detail here a polished version of the systems with which we participated in the Senseval-2 competition English tasks (all words and lexical sample). It is based on a combination of selectional preference measured over a large corpus and hierarchical information taken from WordNet, as well as some additional heuristics. We use that information to expand sense glosses of the senses in WordNet and compare the similarity between the contexts vectors and the word sense vectors in a way similar to that used by Yarowsky and Schuetze. A supervised extension of the system is also discussed. We provide new and previously unpublished evaluation over the SemCor collection, which is two orders of magnitude larger than SENSEVAL-2 collections as well as comparison with baselines. Our systems scored first among unsupervised systems in both tasks. We note that the method is very sensitive to the quality of the characterizations of word senses; glosses being much better than training examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Church, K.W., Hanks, P.: Word association norms, mutual information and lexicography. In: 27th Annual Conference of the Association of Computational Linguistics, pp. 76–82 (1989)

    Google Scholar 

  2. Cowie, J., Guthrie, J., Guthrie, L.: Lexical disambiguation using simulated annealing. In: International conference in computational linguistics (COLING), Nantes, pp. 359–365 (1992)

    Google Scholar 

  3. Dunning, T.E.: Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19(1), 61–74 (1993)

    Google Scholar 

  4. Fernández-Amorós, D., Gonzalo, J., Verdejo, F.: The role of conceptual relations in word sense disambiguation. In: Applications of Natural Language to Information Systems (NLDB), Madrid, pp. 87–98 (2001)

    Google Scholar 

  5. Fernández-Amorós, D., Gonzalo, J., Verdejo, F.: The uned systems at senseval-2. In: Proceedings of the 2nd International Workshop on Evaluating Word Sense Disambiguation Systems (SENSEVAL), Toulouse (2001)

    Google Scholar 

  6. Gale, W.A., Church, K.W., Yarowsky, D.: A method for disambiguating word senses in a large corpus. Computers and the Humanities 26(5), 415–439 (1993)

    Article  Google Scholar 

  7. Lesk, M.E.: Automatic sense disambiguation using machine readable dictionaries : How to tell a pine cone from an ice cream cone. In: Proceedings of SIGDOC (Special Interest Group for Documentation Conference), Toronto, Canada (1986)

    Google Scholar 

  8. Schuetze, H., Pedersen, J.: Information retrieval based on word senses. In: Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, pp. 161–175 (1995)

    Google Scholar 

  9. Wilks, Y., Fass, D., Guo, C., McDonald, J., Plate, T., Slator, B.: Providing machine tractable dictionary tools. Machine Translation 5(2), 99–151 (1990)

    Article  Google Scholar 

  10. Yarowsky, D.: Word-sense disambiguation using statistical models of Roget’s categories trained on large corpora. In: Proceedings of COLING 1992, Nantes, France, pp. 454–460 (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fernandez-Amoros, D., Gil, R.H., Somolinos, J.A.C., Somolinos, C.C. (2010). Automatic Word Sense Disambiguation Using Cooccurrence and Hierarchical Information. In: Hopfe, C.J., Rezgui, Y., Métais, E., Preece, A., Li, H. (eds) Natural Language Processing and Information Systems. NLDB 2010. Lecture Notes in Computer Science, vol 6177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13881-2_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13881-2_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13880-5

  • Online ISBN: 978-3-642-13881-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics