Abstract
A promising approach to cope with the challenges that Word Sense Disambiguation brings is to use knowledge-intensive methods. Typically they rely on Wikipedia for supporting automatic concept identification. The exclusive use of Wikipedia as a knowledge base for word disambiguation and therefore the general identification of topics, however, have low accuracy vis-à-vis texts with diverse topics, as can be the case with blogs. This motivated us to propose a method for word disambiguation that, in addition to the use of Wikipedia, uses a common sense database. Use of this base enriches the definition of the concepts previously identified with the help of Wikipedia, and permits the definition of a similarity measure between concepts, which is characterized by verifying the similarity of two concepts from the viewpoint of conceptual proximity in the Wikipedia hierarchy, in addition to the proximity between such concepts in terms of the inferences that they can make. We show that by doing this, we improved the accuracy of automatic disambiguation of words compared with methods that do not use a common sense base.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Stevenson, M., Wilks, Y.: Word Sense Disambiguation. In: Mitkov, R. (ed.) Oxford Handbook of Computational Linguistics, pp. 249–265. Oxford University Press (2003)
Strube, M., Ponzetto, S.: WikiRelate! Computing semantic relatedness using Wikipedia. In: Proceedings of the National Conference on Artificial Intelligence, pp. 14–19. AAAI Press, London (2006)
Witten, I., Milne, D.: An effective, low-cost measure of semantic relatedness obtained from Wikipedia links. In: Proceeding of AAAI Workshop on Wikipedia and Artificial Intelligence: An Evolving Synergy, pp. 25–30. AAAI Press, Chicago (2008)
Pinheiro, V., Pequeno, T., Furtado, V., Franco, W.: InferenceNet.Br: Expression of Inferentialist Semantic Content of the Portuguese Language. In: Pardo, T.A.S., Branco, A., Klautau, A., Vieira, R., de Lima, V.L.S. (eds.) PROPOR 2010. LNCS (LNAI), vol. 6001, pp. 90–99. Springer, Heidelberg (2010a)
Pinheiro, V., Pequeno, T., Furtado, V., Nogueira, D.: Natural Language Processing Based on Semantic Inferentialism for Extracting Crime Information from Text. In: Proceeding of the IEEE Intelligence and Security Informatics (ISI), pp. 19–24 (2010b)
Brandom, R.B.: Articulating Reasons: An Introduction to Inferentialism. Harvard University Press, Cambridge (2000)
Mihalcea, R., Tarau, P.: TextRank: Bringing order into texts, pp. 404–411. Association for Computational Linguistics, Barcelona (2004)
Mihalcea, R., Csomai, A.: Wikify!: Linking documents to encyclopedic knowledge. CIKM 7, 233–242 (2007)
Li, C., Sun, A., Datta, A.: A Generalized Method for Word Sense Disambiguation Based on Wikipedia. In: Clough, P., Foley, C., Gurrin, C., Jones, G.J.F., Kraaij, W., Lee, H., Mudoch, V. (eds.) ECIR 2011. LNCS, vol. 6611, pp. 653–664. Springer, Heidelberg (2011)
Milne, D., Witten, I.: Learning to link with Wikipedia, pp. 509–518. ACM (2008)
Milne, D., Witten, I.: An open-source toolkit for mining Wikipedia. In: Proc. of New Zealand Computer Science Research Student Conference, vol. 9 (2009)
Lesk, M.: Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone, pp. 24–26. ACM (1986)
Medelyan, O., Witten, I., Milne, D.: Topic indexing with Wikipedia. In: AAAI WikiAI workshop (2008)
Santos, H., Furatdo, V., Pinheiro, V., Ferreira, C., Vasconcelos, J.E., Shiki, G.: Widgets baseados em conhecimento advindo de dados referenciados e abertos na Web. In: Proceeding of the XVII WebMedia (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pinheiro, V., Furtado, V., Freire, L.M., Ferreira, C. (2012). Knowledge-Intensive Word Disambiguation via Common-Sense and Wikipedia. In: Barros, L.N., Finger, M., Pozo, A.T., Gimenénez-Lugo, G.A., Castilho, M. (eds) Advances in Artificial Intelligence - SBIA 2012. SBIA 2012. Lecture Notes in Computer Science(), vol 7589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34459-6_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-34459-6_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34458-9
Online ISBN: 978-3-642-34459-6
eBook Packages: Computer ScienceComputer Science (R0)