Abstract
In this paper we propose a mixed method for Word Sense Disambiguation, which combines lexical knowledge from EuroWordNet with corpora. The method tries to give a partial solution to the problem of the gap between lexicon and corpus by means of the approximation of the corpus to the lexicon. On the basis of the interaction that holds in natural language between the syntagmatic and the paradigmatic axes, we extract from corpus implicit information of paradigmatic type. On the information thus obtained we work with the information, also paradigmatic, contained in EWN. We evaluate the method and interpret the results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agirre, E., Martinez, D.: Exploring automatic WSD with decision lists and the Web. In: Proceedings of the COLING Workshop on Semantic Annotation and Intelligent Content, Saabrücken (2000)
Agirre, E., Martínez, D.: Learning class-to-class selectional preferences. In: Proceedings of the ACL CONLL 2001 Workshop, Tolouse (2001)
Civit, M.: Criterios de etiquetación y desambiguación morfosintáctica de corpus en español, Ph.D.Thesis, University of Barcelona (2003)
Cruse, A.: Meaning in Language. An Introduction to Semantics and Pragmatics. Oxford University Press, Oxford (2000)
Federici, S., Montemagni, S., Pirelli, V.: ROMANSEVAL: Results for Italian by SENSE. Computers and the Humanities. Special Issue: Evaluating WSD Programs 34(1-2) (2000)
Gale, W.A., Church, K.W., Yarowsky, D.: One sense per discourse. In: Proceedings of DARPA speech and Natural Language Workshop, Harriman, NY (1992)
Hoste, V., Hendrickx, I., Daelemans, W., van den Bosch, A.: Parameter optimisation for machine-learning of WSD. Natural Language Engineering 8(4) (2002)
Ide, N., Véronis, J.: Introduction to the Special Issue on Word Sense Disambiguation. The State of the Art. Computational Linguistics. Special Issue on Word Sense Disambiguation 24(1) (1998)
Kilgariff, A.: Bridging the gap between lexicon and corpus: convergence of formalisms. In: Proceedings of LREC 1998, Granada (1998)
Leacock, C., Chodorow, M., Miller, G.A.: Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics. Special Issue on Word Sense Disambiguation 24(1) (1998)
Lin, D.: Using Syntactic Dependency as Local Context to Resolve Word Sense Ambiguity. In: Proceedings of ACL and EACL 1997. Morgan Kaufman Publishers, San Francisco (1997)
Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing, cap. 7: Word Sense Disambiguation, 3rd printing, pp. 229–263. The MIT Press, Cambridge (1999)
Martínez, D., Agirre, E., Màrquez, L.: Syntactic Features for High Precision Word Sense Disambiguation. In: Proceedings of the 19th International Conference on Computational Linguistics (COLING 2002), Taipei, Taiwan (2002)
Mihalcea, R.: WSD with pattern learning and feature selection. Natural Language Engineering 8(4) (2002)
Mihalcea, R., Moldovan, D.: An Automatic Method for Generating Sense Tagged Corpora. In: Proceedings of AAAI 1999, Orlando (1999)
Miller, G., Charles, W.: Contextual correlates of semantic similarity. Language and Cognitive Processes 6(1) (1991)
Montoyo, A., Palomar, M.: Word Sense Disambiguation with Specification Marks in Unrestricted Texts. In: Proceedings of the 11th International Workshop on DEXA, Greenwich, London (2000)
Ng, H.T., Lee, H.B.: Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-Based Approach. In: Proceedings of the 34th Annual Meeting of the ACL (1996)
Nica, I., Martí, M.A., Montoyo, A.: Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automática. In: XX Congreso de la SEPLN 2003, Alcalá de Henares, Spain (2003)
Nica, I., Martí, M.A., Montoyo, A.: Automatic sense (pre-)tagging by syntactic patterns. In: Proceedings of International Conference on Recent Advances in Natural Language Processing (RANLP 2003), Borovets, Bulgaria (2003)
Pedersen, T.: A decision tree of bigrams is an accurate predictor of word sense. In: Proceedings of NAACL 2001, Pittsburg (2001)
Pedersen, T.: A Baseline Methodology for Word Sense Disambiguation. In: Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics, Mexico City (February 2002a)
Pedersen, T.: Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2. In: Proceedings of the Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia (2002b)
Rigau, G., Taulé, M., Fernández, A., Gonzalo, J.: Framework and results in the Spanish SENSEVAL. In: Preiss, J., Yarowsky, D. (eds.) Proceedings of the SENSEVAL-2 Workshop. In conjunction with ACL 2001/EACL 2001, Toulouse (2001)
Sebastián, N., Martí, M.A., Carreiras, M.F., Cuetos Gómez, F.: Lexesp, léxico informatizado del español, Edicions de la Universitat de Barcelona (2000)
Stetina, J., Kurohashi, S., Nagao, M.: General WSD Method Based on a Full Sentential Context. In: Proceedings of COLING-ACL Workshop, Montreal (1998)
Véronis, J.: Sense tagging: does it make sense? Paper presented at the Corpus Linguistics 2001 Conference, Lancaster, U.K. (2001)
Vossen, P. (ed.): EUROWORDNET. A Multilingual Database with Lexical Semantic Networks. Kluwer Academic Publishers, Dordrecht (1998)
Yarowsky, D.: Word-Sense Disambiguation Using Statistical Models of Roget’s Categories Trained on Large Corpora. In: Proceedings of COLING 1992, Nantes, France (1992)
Yarowsky, D.: One Sense per Collocation. In: DARPA Workshop on Human Language Technology, Princeton (1993)
Yarowsky, D.: Unsupervised word sense disambiguation rivalising supervised methods. In: Proceedings of ACL 1995, Dublin (1995)
Yarowsky, D., Florian, R.: Evaluating sense disambiguation across diverse parameter spaces. Natural Language Engineering 8(4) (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nica, I., Martí, M.A., Montoyo, A., Vázquez, S. (2004). Combining EWN and Sense-Untagged Corpus for WSD. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-24630-5_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive