Skip to main content

Combining EWN and Sense-Untagged Corpus for WSD

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2945))

  • 981 Accesses

Abstract

In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work with the information, also paradigmatic, contained in EWN. We evaluate the method and interpret the results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Agirre, E., Martinez, D.: Exploring automatic WSD with decision lists and the Web. In: Proceedings of the COLING Workshop on Semantic Annotation and Intelligent Content, Saabrücken (2000)

    Google Scholar 

  2. Agirre, E., Martínez, D.: Learning class-to-class selectional preferences. In: Proceedings of the ACL CONLL 2001 Workshop, Tolouse (2001)

    Google Scholar 

  3. Civit, M.: Criterios de etiquetación y desambiguación morfosintáctica de corpus en español, Ph.D.Thesis, University of Barcelona (2003)

    Google Scholar 

  4. Cruse, A.: Meaning in Language. An Introduction to Semantics and Pragmatics. Oxford University Press, Oxford (2000)

    Google Scholar 

  5. Federici, S., Montemagni, S., Pirelli, V.: ROMANSEVAL: Results for Italian by SENSE. Computers and the Humanities. Special Issue: Evaluating WSD Programs 34(1-2) (2000)

    Google Scholar 

  6. Gale, W.A., Church, K.W., Yarowsky, D.: One sense per discourse. In: Proceedings of DARPA speech and Natural Language Workshop, Harriman, NY (1992)

    Google Scholar 

  7. Hoste, V., Hendrickx, I., Daelemans, W., van den Bosch, A.: Parameter optimisation for machine-learning of WSD. Natural Language Engineering 8(4) (2002)

    Google Scholar 

  8. Ide, N., Véronis, J.: Introduction to the Special Issue on Word Sense Disambiguation. The State of the Art. Computational Linguistics. Special Issue on Word Sense Disambiguation 24(1) (1998)

    Google Scholar 

  9. Kilgariff, A.: Bridging the gap between lexicon and corpus: convergence of formalisms. In: Proceedings of LREC 1998, Granada (1998)

    Google Scholar 

  10. Leacock, C., Chodorow, M., Miller, G.A.: Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics. Special Issue on Word Sense Disambiguation 24(1) (1998)

    Google Scholar 

  11. Lin, D.: Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity. In: Proceedings of ACL and EACL 1997. Morgan Kaufman Publishers, San Francisco (1997)

    Google Scholar 

  12. Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing, cap. 7: Word Sense Disambiguation, 3rd printing, pp. 229–263. The MIT Press, Cambridge (1999)

    Google Scholar 

  13. Martínez, D., Agirre, E., Màrquez, L.: Syntactic Features for High Precision Word Sense Disambiguation. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), Taipei, Taiwan (2002)

    Google Scholar 

  14. Mihalcea, R.: WSD with pattern learning and feature selection. Natural Language Engineering 8(4) (2002)

    Google Scholar 

  15. Mihalcea, R., Moldovan, D.: An Automatic Method for Generating Sense Tagged Corpora. In: Proceedings of AAAI 1999, Orlando (1999)

    Google Scholar 

  16. Miller, G., Charles, W.: Contextual correlates of semantic similarity. Language and Cognitive Processes 6(1) (1991)

    Google Scholar 

  17. Montoyo, A., Palomar, M.: Word Sense Disambiguation with Specification Marks in Unrestricted Texts. In: Proceedings of the 11th International Workshop on DEXA, Greenwich, London (2000)

    Google Scholar 

  18. Ng, H.T., Lee, H.B.: Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach. In: Proceedings of the 34th Annual Meeting of the ACL (1996)

    Google Scholar 

  19. Nica, I., Martí, M.A., Montoyo, A.: Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automática. In: XX Congreso de la SEPLN 2003, Alcalá de Henares, Spain (2003)

    Google Scholar 

  20. Nica, I., Martí, M.A., Montoyo, A.: Automatic sense (pre-)tagging by syntactic patterns. In: Proceedings of International Conference on Recent Advances in Natural Language Processing (RANLP 2003), Borovets, Bulgaria (2003)

    Google Scholar 

  21. Pedersen, T.: A decision tree of bigrams is an accurate predictor of word sense. In: Proceedings of NAACL 2001, Pittsburg (2001)

    Google Scholar 

  22. Pedersen, T.: A Baseline Methodology for Word Sense Disambiguation. In: Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (February 2002a)

    Google Scholar 

  23. Pedersen, T.: Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2. In: Proceedings of the Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia (2002b)

    Google Scholar 

  24. Rigau, G., Taulé, M., Fernández, A., Gonzalo, J.: Framework and results in the Spanish SENSEVAL. In: Preiss, J., Yarowsky, D. (eds.) Proceedings of the SENSEVAL-2 Workshop. In conjunction with ACL 2001/EACL 2001, Toulouse (2001)

    Google Scholar 

  25. Sebastián, N., Martí, M.A., Carreiras, M.F., Cuetos Gómez, F.: Lexesp, léxico informatizado del español, Edicions de la Universitat de Barcelona (2000)

    Google Scholar 

  26. Stetina, J., Kurohashi, S., Nagao, M.: General WSD Method Based on a Full Sentential Context. In: Proceedings of COLING-ACL Workshop, Montreal (1998)

    Google Scholar 

  27. Véronis, J.: Sense tagging: does it make sense? Paper presented at the Corpus Linguistics 2001 Conference, Lancaster, U.K. (2001)

    Google Scholar 

  28. Vossen, P. (ed.): EUROWORDNET. A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Dordrecht (1998)

    MATH  Google Scholar 

  29. Yarowsky, D.: Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora. In: Proceedings of COLING 1992, Nantes, France (1992)

    Google Scholar 

  30. Yarowsky, D.: One Sense per Collocation. In: DARPA Workshop on Human Language Technology, Princeton (1993)

    Google Scholar 

  31. Yarowsky, D.: Unsupervised word sense disambiguation rivalising supervised methods. In: Proceedings of ACL 1995, Dublin (1995)

    Google Scholar 

  32. Yarowsky, D., Florian, R.: Evaluating sense disambiguation across diverse parameter spaces. Natural Language Engineering 8(4) (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nica, I., Martí, M.A., Montoyo, A., Vázquez, S. (2004). Combining EWN and Sense-Untagged Corpus for WSD. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24630-5_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21006-1

  • Online ISBN: 978-3-540-24630-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics