Skip to main content

An Unsupervised WSD Algorithm for a NLP System

  • Conference paper
Natural Language Processing and Information Systems (NLDB 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3136))

  • 704 Accesses

Abstract

The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In this paper we propose a completely automatic WSD method for Spanish – restricted to nouns – to be used as a module in a Natural Language Processing system for unlimited text. We call it the Commutative Test. This method exploits an adaptation of EuroWordNet, Sense Discriminators, that implicitly keeps all lexical-semantic relations of its nominal hierarchy. The only requirement is the availability of a large corpus and a part-of-speech tagger, without any need of previous sense-tagging. An evaluation of the method has been done on the Senseval test corpus. The method can be easily adapted to other languages that dispose of a corpus, a WordNet component and a part-of-speech tagger.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Agirre, E., Rigau, G.: Word Sense Disambiguation using Conceptual Density. In: Proceedings of the 16th International Conference on COLING, Copenhagen (1996)

    Google Scholar 

  2. Civit, M.: Criterios de etiquetación y desambiguación morfosintáctica de corpus en español, Ph.D. dissertation, Universidad de Barcelona (2003)

    Google Scholar 

  3. Cowie, J., Guthrie, J., Guthrie, L.: Lexical disambiguation using simulated annealing. In: Proceedings of the DARPA Workshop on Speech and Natural Language, New York (1992)

    Google Scholar 

  4. Edmonds, P., Cotton, S. (eds.): SENSEVAL-2: Overview. In: Proceedings of 2nd International Workshop Evaluating Word Sense Disambiguation Systems, Toulouse (2001)

    Google Scholar 

  5. Hale, M.L.: Ms. A comparison of WordNet and Rogetś taxonomy for measuring semantic similarity

    Google Scholar 

  6. Ide, N., Véronis, J.: Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1) (1998)

    Google Scholar 

  7. Kilgariff, A.: Bridging the gap between lexicon and corpus: convergence of formalisms. In: Proceedings of LREC 1998, Granada (1998)

    Google Scholar 

  8. Leacock, C., Chodorow, M., Miler, G.A.: Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics. Special Issue on Word Sense Disambiguation, 24(1) (1998)

    Google Scholar 

  9. Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proceedings of the 1986 SIGDOC Conference, ACM, New York (1986)

    Google Scholar 

  10. Mihalcea, R., Moldovan, D.: A Method for word sense disambiguation of unrestricted text. In: Proceedings of the 37th Annual Meeting of the ACL, Maryland, USA (1999)

    Google Scholar 

  11. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: WordNet: An on-line lexical database. International Journal of Lexicography 3(4) (1990)

    Google Scholar 

  12. Nica, I., Martí, M.A., Montoyo, A.: Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automática. In: XIX SEPLN Conference, Alcalá de Henares-Madrid (2003)

    Google Scholar 

  13. Nica, I., Martí, M.A., Montoyo, A., Vázquez, S.: Combining EWN and sense untagged corpora for WSD. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 188–200. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  14. Resnik, P.: Disambiguating noun groupings with respect to WordNet senses. In: Proceedings of the Third Workshop on Very Large Corpora, Cambridge (1995)

    Google Scholar 

  15. Resnik, P., Yarowsky, D.: A perspective on word sense disambiguation methods and their evaluation. In: Proceedings of the ACL Siglex Wordshop on Tagging Text with Lexical Semantics, why, what and how?, Washington (1997)

    Google Scholar 

  16. Resnik, P.: Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research 11 (1999)

    Google Scholar 

  17. Rigau, G., Atserias, J., Agirre, E.: Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. In: Proceedings of the 35th Annual Meeting of the ACL, Madrid (1997)

    Google Scholar 

  18. Sussna, M.: Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of the Second International CIKM, Airlington, VA (1993)

    Google Scholar 

  19. Voorhees, E.: Using WordNet to disambiguation word senses for text retrieval. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, PA (1993)

    Google Scholar 

  20. Wilks, Y., Fass, D., Guo, C., McDonal, J., Plate, T., Slator, B.: Providing Machine Tractable Dictionary Tools. In: Pustejovsky, J. (ed.) S emantics and the Lexicon, Kluwer, Dordrecht (1993)

    Google Scholar 

  21. Wilks, Y., Stevenson, M.: The grammar of sense: Is word sense tagging much more than part-of-speech tagging?, Technical Report CS-96-05, University of Sheffield (1996)

    Google Scholar 

  22. Yarowsky, D.: Word Sense disambiguation using statistical models of Rogetś categories trained on large corpora. In: Proceedings of the 14th COLING, Nantes (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nica, I., Montoyo, A., Vázquez, S., Martí, M.A. (2004). An Unsupervised WSD Algorithm for a NLP System. In: Meziane, F., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2004. Lecture Notes in Computer Science, vol 3136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27779-8_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27779-8_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22564-5

  • Online ISBN: 978-3-540-27779-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics