Abstract
This paper aims to fully present a new word sense disambiguation method that has been introduced in Hristea and Popescu (Fundam Inform 91(3–4):547–562, 2009) and so far tested in the case of adjectives (Hristea and Popescu in Fundam Inform 91(3–4):547–562, 2009) and verbs (Hristea in Int Rev Comput Softw 4(1):58–67, 2009). We hereby extend the method to the case of nouns and draw conclusions regarding its performance with respect to all these parts of speech. The method lies at the border between unsupervised and knowledge-based techniques. It performs unsupervised word sense disambiguation based on an underlying Naïve Bayes model, while using WordNet as knowledge source for feature selection. The performance of the method is compared to that of previous approaches that rely on completely different feature sets. Test results for all involved parts of speech show that feature selection using a knowledge source of type WordNet is more effective in disambiguation than local type features (like part-of-speech tags) are.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Agirre E, Edmonds P (eds) (2006) Word sense disambiguation. Algorithms and applications. Springer, The Netherlands
Banerjee S, Pedersen T (2002) An adapted Lesk algorithm for word sense disambiguation using WordNet. In: Proceedings of the third international conference on intelligent text processing and computational linguistics, Mexico City, February 17–23, pp 136–145
Banerjee S, Pedersen T (2003) Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the eighteenth international joint conference on artificial intelligence, Acapulco, Mexico, pp 805–810
Bruce R, Wiebe J (1994). Word sense disambiguation using decomposable models. In: Proceedings of the 32nd meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp 139–146
Bruce R, Wiebe J, Pedersen T (1996). The measure of a model. In: Proceedings of the conference on empirical methods in natural language processing, Philadelphia, PA, pp 101–112
Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. J Royal Stat Soc B 39(1): 1–38
Fellbaum C (ed) (1998) WordNet: an Electronic Lexical Database. The MIT Press, Cambridge, MA
Gale WA, Church KW, Yarowsky D (1992) A method for disambiguating word senses in a large corpus. Comp Humanit 26(5–6): 415–439
Gale WA, Church KW, Yarowsky D (1995) Discrimination decisions for 100,000—dimensional space. Ann Oper Res 55(2): 323–344
Hristea F (2009) Recent advances concerning the usage of the Naïve Bayes Model in unsupervised word sense disambiguation. Int Rev Comput Softw 4(1): 58–67
Hristea F, Popescu M (2009) Adjective sense disambiguation at the border between unsupervised and knowledge-based techniques. Fundam Inform 91(3–4): 547–562
Leacock C, Towell G, Voorhees E (1993) Corpus-based statistical sense resolution. In: Proceedings of the ARPA workshop on human language technology, Princeton, New Jersey, pp 260–265
Lesk M (1986) Automatic sense disambiguation: how to tell a pine cone from an ice cream cone. In: Proceedings of the 1986 SIGDOC conference, New York, Association for Computing Machinery, pp 24–26
Miller GA (1990) Nouns in WordNet: a lexical inheritance system. Int J Lexicography 3(4): 245–264
Miller GA, Beckwith R, Fellbaum C, Gross D, Miller K (1990) WordNet: an on-line lexical database. J Lexicography 3(4): 234–244
Miller GA (1995) WordNet: a lexical database. Commun ACM 38(11): 39–41
Miller GA, Hristea F (2006) WordNet nouns: classes and instances. Comput Linguist 32(1): 1–3
Ng H, Lee H (1996) Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. In: Proceedings of the 34th annual meeting of the Society for Computational Linguistics, Santa Cruz, California, pp 40–47
Pedersen T, Bruce R (1997) Distinguishing word senses in untagged text. In: Proceedings of the second conference on empirical methods in natural language processing (EMNLP-2), Providence, Rhode Island, pp 197–207
Pedersen T, Bruce R (1998) Knowledge lean word-sense disambiguation. In: Proceedings of the 15th National conference on artificial intelligence, Madison, Wisconsin, pp 800–805
Pedersen T (2006) Unsupervised corpus-based methods for WSD. In: Agirre E, Edmonds P (eds) Word sense disambiguation Algorithms and Applications. Springer, The Netherlands, pp 133–166
Schütze H (1998) Automatic word-sense discrimination. Comput Linguist 24(1): 97–123
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hristea, F., Popescu, M. & Dumitrescu, M. Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques. Artif Intell Rev 30, 67 (2008). https://doi.org/10.1007/s10462-009-9117-6
Published:
DOI: https://doi.org/10.1007/s10462-009-9117-6