Abstract
Nowadays, the need of advanced free text filtering in multilingual environment is increasing. Therefore, when searching for specific keywords in multilingual information space, it is desirable to eliminate occurrences where the word or words of each language are used in an inappropriate sense. This task could be exploited in internet browsers, and resource discovery systems, relational databases containing free text fields, electronic document management systems, data warehouse and data mining systems, etc. In order to resolve this problem in this paper we present a Word Sense Disambiguation interface, which it returns the words senses in different languages and it could be employed for multilingual natural language applications. This interface resolve lexical ambiguity of nouns and verbs in some European languages (English, Spanish) input texts, using the taxonomy of the EuroWordNet lexical knowledge database, and returning a multilingual output of the words senses (English, Spanish, Catalan and Basque). In addition to the relations in WordNet 1.5, EuroWord- Net includes cross-language and cross-category relations, which are directly useful for multilingual Word Sense Disambiguation. This interface has been implemented using programming language C++ and providing a visual framework.
Preview
Unable to display preview. Download preview PDF.
References
A. Ferrández, M. Palomar, and L. Moreno. Slot Unification Grammar. In Proceedings of the Joint Conference on Declarative Programming. APPIA-GULP-PRODE, pages 523–532, Grado, Italy, 1997.
N. Ide and J. Veronis. Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics, 24(1):1–40, 1998.
D. Lin. Dependency-based Evaluation of MINIPAR. In Workshop on the Evaluation of Parsing Systems, Granada, Spain, 1998.
Rada Mihalcea and Dan Moldovan. A Method for word sense disambiguation of unrestricted text. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistic, pages 152–158, Maryland, Usa, 1999.
G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. WordNet: An on-line lexical database. International journal of lexicography, 3(4):235–244, 1990.
Andrés Montoyo and Manuel Palomar. Word Sense Disambiguation with Specification Marks in Unrestricted Texts. In Proceedings of 11th International Workshop on Database and Expert Systems Applications (DEXA 2000). 11th InternationalWorkshop on Database and Expert Systems Applications, pages 103–107, Greenwich, London, UK, September 2000. IEEE Computer Society.
Andrés Montoyo and Manuel Palomar. WSD Algorithm Applied to a NLP System. In Mokrane Bouzeghoub, Zoubida Kedad, and Elisabeth Métais, editors, Proceedings of 5th International conference on Applications of Natural Language to Information Systems (NLDB-2000). Natural Language Processing and Information Systems, Lecture Notes in Computer Science, pages 54–65, Versailles, France, June 2000. Springer-Verlag.
Andrés Montoyo and Manuel Palomar. Specification Marks forWord Sense Disambiguation: New Development. In A. Gelbukh, editor, Proceedings of 2nd International conference on Intelligent Text Processing and Computational Linguistics (CICLing-2001). Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, pages 182–191, Mexico City, February 2001. Springer-Verlag.
M. Palomar, M. Saiz-Noeda, R. Mūoz, A. Suárez, P. Martínez-Barco, and A. Montoyo. PHORA: A NLP aystem for Spanish. In A. Gelbukh, editor, Proceedings of 2nd International conference on Intelligent Text Processing and Computational Linguistics (CICLing-2001). Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, pages 126–139, Mexico City, February 2001. Springer-Verlag.
F. Pla. Etiquetado Léxico y Análisis Sintáctico Superficial basado en Modelos Estadísticos. Tesis doctoral, Departamento de Sistemas Informáticos y Computación. Universidad de Politécnica de Valencia, Septiembre 2000.
Susana Soler and Andrś Montoyo. A Proposal for WSD using Semantic Similarity. In A. Gelbukh, editor, Proceedings of Third International conference on Intelligent Text Processing and Computational Linguistics (CICLing-2002). Computational Linguistics and Intelligent Text Processing, Lecture Notes in Computer Science, pages 165–167, Mexico City, February 2002. Springer-Verlag.
J. Stetina, S. Kurohashi, and M. Nagao. General word sense disambiguation method based on full sentencial context. In Proceedings of Usage of WordNet in Natural Language Processing. COLING-ACL Workshop, Montreal, Canada, 1998.
P. Vossen. EuroWordNet: Building a Multilingual Database with WordNets for European Languages. The ELRA Newsletter, 3(1), 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Montoyo, A., Romero, R., Vázquez, S., Calle, C., Soler, S. (2002). The Role of WSD for Multilingual Natural Language Applications. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2002. Lecture Notes in Computer Science(), vol 2448. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46154-X_6
Download citation
DOI: https://doi.org/10.1007/3-540-46154-X_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44129-8
Online ISBN: 978-3-540-46154-8
eBook Packages: Springer Book Archive