Abstract
The increasing flow of information requires advanced free text filtering. An important part of this task consists in eliminating word occurrences with an inappropriate sense, which corresponds to a Word Sense Disambiguation operation. In this paper we propose a completely automatic WSD method for Spanish – restricted to nouns – to be used as a module in a Natural Language Processing system for unlimited text. We call it the Commutative Test. This method exploits an adaptation of EuroWordNet, Sense Discriminators, that implicitly keeps all lexical-semantic relations of its nominal hierarchy. The only requirement is the availability of a large corpus and a part-of-speech tagger, without any need of previous sense-tagging. An evaluation of the method has been done on the Senseval test corpus. The method can be easily adapted to other languages that dispose of a corpus, a WordNet component and a part-of-speech tagger.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agirre, E., Rigau, G.: Word Sense Disambiguation using Conceptual Density. In: Proceedings of the 16th International Conference on COLING, Copenhagen (1996)
Civit, M.: Criterios de etiquetación y desambiguación morfosintáctica de corpus en español, Ph.D. dissertation, Universidad de Barcelona (2003)
Cowie, J., Guthrie, J., Guthrie, L.: Lexical disambiguation using simulated annealing. In: Proceedings of the DARPA Workshop on Speech and Natural Language, New York (1992)
Edmonds, P., Cotton, S. (eds.): SENSEVAL-2: Overview. In: Proceedings of 2nd International Workshop Evaluating Word Sense Disambiguation Systems, Toulouse (2001)
Hale, M.L.: Ms. A comparison of WordNet and Rogetś taxonomy for measuring semantic similarity
Ide, N., Véronis, J.: Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics 24(1) (1998)
Kilgariff, A.: Bridging the gap between lexicon and corpus: convergence of formalisms. In: Proceedings of LREC 1998, Granada (1998)
Leacock, C., Chodorow, M., Miler, G.A.: Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics. Special Issue on Word Sense Disambiguation, 24(1) (1998)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proceedings of the 1986 SIGDOC Conference, ACM, New York (1986)
Mihalcea, R., Moldovan, D.: A Method for word sense disambiguation of unrestricted text. In: Proceedings of the 37th Annual Meeting of the ACL, Maryland, USA (1999)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: WordNet: An on-line lexical database. International Journal of Lexicography 3(4) (1990)
Nica, I., Martí, M.A., Montoyo, A.: Colaboración entre información paradigmática y sintagmática en la Desambiguación Semántica Automática. In: XIX SEPLN Conference, Alcalá de Henares-Madrid (2003)
Nica, I., Martí, M.A., Montoyo, A., Vázquez, S.: Combining EWN and sense untagged corpora for WSD. In: Gelbukh, A. (ed.) CICLing 2004. LNCS, vol. 2945, pp. 188–200. Springer, Heidelberg (2004)
Resnik, P.: Disambiguating noun groupings with respect to WordNet senses. In: Proceedings of the Third Workshop on Very Large Corpora, Cambridge (1995)
Resnik, P., Yarowsky, D.: A perspective on word sense disambiguation methods and their evaluation. In: Proceedings of the ACL Siglex Wordshop on Tagging Text with Lexical Semantics, why, what and how?, Washington (1997)
Resnik, P.: Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research 11 (1999)
Rigau, G., Atserias, J., Agirre, E.: Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. In: Proceedings of the 35th Annual Meeting of the ACL, Madrid (1997)
Sussna, M.: Word sense disambiguation for free-text indexing using a massive semantic network. In: Proceedings of the Second International CIKM, Airlington, VA (1993)
Voorhees, E.: Using WordNet to disambiguation word senses for text retrieval. In: Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburgh, PA (1993)
Wilks, Y., Fass, D., Guo, C., McDonal, J., Plate, T., Slator, B.: Providing Machine Tractable Dictionary Tools. In: Pustejovsky, J. (ed.) S emantics and the Lexicon, Kluwer, Dordrecht (1993)
Wilks, Y., Stevenson, M.: The grammar of sense: Is word sense tagging much more than part-of-speech tagging?, Technical Report CS-96-05, University of Sheffield (1996)
Yarowsky, D.: Word Sense disambiguation using statistical models of Rogetś categories trained on large corpora. In: Proceedings of the 14th COLING, Nantes (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nica, I., Montoyo, A., Vázquez, S., Martí, M.A. (2004). An Unsupervised WSD Algorithm for a NLP System. In: Meziane, F., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2004. Lecture Notes in Computer Science, vol 3136. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27779-8_25
Download citation
DOI: https://doi.org/10.1007/978-3-540-27779-8_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22564-5
Online ISBN: 978-3-540-27779-8
eBook Packages: Springer Book Archive