Abstract
Word Sense Disambiguation (WSD) aims at determining the appropriate sense of a word in a particular context. Although it is a highly relevant task for Natural Language Processing, there are few works for Portuguese, which are tailored to specific applications, such as translation and information retrieval. In this work, we report our investigation of some general purpose WSD methods for nouns in Portuguese, tackling two additional challenges: using Princeton Wordnet (for English) as the sense repository and applying/customizing a WSD method for multi-document applications, which, to the best of our knowledge, has not been addressed before. In this paper, we also report our efforts on building a sense annotated corpus (for nouns, only), which was used for evaluating the investigated WSD methods.
The corresponding MSc dissertation is available at www.teses.usp.br/teses/disponiveis/55/55134/tde-28082013-145948
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, E., Edmonds, P.: Introduction. In: Word Sense Disambiguation: Algorithms and Applications. Springer (2006)
Agirre, E., Soroa, A.: Personalizing pagerank for word sense disambiguation. In: Proceedings of 12th Conference of the European Chapter of the ACL, pp. 33–41 (2009)
Banerjee, S.: Adapting the Lesk algorithm for word sense disambiguation to wordnet. Master’s thesis. Department of Computer Science, University of Minnesota (2002)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of 17th International World-Wide Web Conference, WWW 1998 (1998)
Cardoso, P.C.F., Maziero, E.G., Jorge, M.L.R.C., Seno, E.M.R., Di Felippo, A., Rino, L.H.M., Nunes, M.D.G.V., Pardo, T.A.S.: CSTNews – a discourse-annotated corpus for single and multi-document summarization of news texts in brazilian portuguese. In: Anais do III Workshop “A RST e os Estudos do Texto”, pp. 88–105. Sociedade Brasileira de Computação, Cuiabá (2011)
Carletta, J.: Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics 22, 249–254 (1996)
Kilgarriff, A., England, B., Rosenzweig, J.: English senseval: Report and results. In: Proceedings of 2nd International Conference on Language Resources and Evaluation, pp. 1239–1244 (2000)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. In: Proceedings of 5th Annual International Conference on Systems Documentation, pp. 24–26. Association for Computing Machinery, New York (1986)
Machado, I.M., de Alencar, R.O., de, O.C., Junior, R., Davis, C.A.: An ontological gazetteer and its application for place name disambiguation in text. Journal of the Brazilian Computer Society 17, 267–279 (2011)
Mihalcea, R., Moldovan, D.I.: A method for word sense disambiguation of unrestricted text. In: Proceedings of 37th Annual Meeting of the Association for Computational Linguistics, pp. 152–158. Association for Computational Linguistics, College Park (1999)
Mihalcea, R., Radev, D.: Graph-based natural language processing and information retrieval. Cambridge University Press (2011)
Miller, G.A.: Wordnet: A lexical database for english. Communications of the ACM 38, 39–41 (1995)
Plaza, L., Diaz, A.: Using semantic graphs and word sense disambiguation techniques to improve text summarization. Proceedings of Procesamiento del Lenguaje Natural 47, 97–105 (2011)
Specia, L.: Uma Abordagem Híbrida Relacional para a Desambiguação Lexical de Sentido na Tradução Automática. PhD thesis. Instituto de Ciências Matemáticas e de Computação–ICMC–USP (2007)
Vasilescu, F., Langlais, P., Lapalme, G.: Evaluating variants of the Lesk approach for disambiguating words. In: Proceedings of Language Resources and Evaluation (LREC 2004), Lisbon, Portugal, pp. 633–636 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Nóbrega, F.A.A., Pardo, T.A.S. (2014). General Purpose Word Sense Disambiguation Methods for Nouns in Portuguese. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-09761-9_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09760-2
Online ISBN: 978-3-319-09761-9
eBook Packages: Computer ScienceComputer Science (R0)