Skip to main content
Log in

Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

This paper aims to fully present a new word sense disambiguation method that has been introduced in Hristea and Popescu (Fundam Inform 91(3–4):547–562, 2009) and so far tested in the case of adjectives (Hristea and Popescu in Fundam Inform 91(3–4):547–562, 2009) and verbs (Hristea in Int Rev Comput Softw 4(1):58–67, 2009). We hereby extend the method to the case of nouns and draw conclusions regarding its performance with respect to all these parts of speech. The method lies at the border between unsupervised and knowledge-based techniques. It performs unsupervised word sense disambiguation based on an underlying Naïve Bayes model, while using WordNet as knowledge source for feature selection. The performance of the method is compared to that of previous approaches that rely on completely different feature sets. Test results for all involved parts of speech show that feature selection using a knowledge source of type WordNet is more effective in disambiguation than local type features (like part-of-speech tags) are.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Agirre E, Edmonds P (eds) (2006) Word sense disambiguation. Algorithms and applications. Springer, The Netherlands

    Google Scholar 

  • Banerjee S, Pedersen T (2002) An adapted Lesk algorithm for word sense disambiguation using WordNet. In: Proceedings of the third international conference on intelligent text processing and computational linguistics, Mexico City, February 17–23, pp 136–145

  • Banerjee S, Pedersen T (2003) Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the eighteenth international joint conference on artificial intelligence, Acapulco, Mexico, pp 805–810

  • Bruce R, Wiebe J (1994). Word sense disambiguation using decomposable models. In: Proceedings of the 32nd meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, pp 139–146

  • Bruce R, Wiebe J, Pedersen T (1996). The measure of a model. In: Proceedings of the conference on empirical methods in natural language processing, Philadelphia, PA, pp 101–112

  • Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. J Royal Stat Soc B 39(1): 1–38

    MATH  MathSciNet  Google Scholar 

  • Fellbaum C (ed) (1998) WordNet: an Electronic Lexical Database. The MIT Press, Cambridge, MA

    MATH  Google Scholar 

  • Gale WA, Church KW, Yarowsky D (1992) A method for disambiguating word senses in a large corpus. Comp Humanit 26(5–6): 415–439

    Article  Google Scholar 

  • Gale WA, Church KW, Yarowsky D (1995) Discrimination decisions for 100,000—dimensional space. Ann Oper Res 55(2): 323–344

    Article  MATH  Google Scholar 

  • Hristea F (2009) Recent advances concerning the usage of the Naïve Bayes Model in unsupervised word sense disambiguation. Int Rev Comput Softw 4(1): 58–67

    Google Scholar 

  • Hristea F, Popescu M (2009) Adjective sense disambiguation at the border between unsupervised and knowledge-based techniques. Fundam Inform 91(3–4): 547–562

    MATH  MathSciNet  Google Scholar 

  • Leacock C, Towell G, Voorhees E (1993) Corpus-based statistical sense resolution. In: Proceedings of the ARPA workshop on human language technology, Princeton, New Jersey, pp 260–265

  • Lesk M (1986) Automatic sense disambiguation: how to tell a pine cone from an ice cream cone. In: Proceedings of the 1986 SIGDOC conference, New York, Association for Computing Machinery, pp 24–26

  • Miller GA (1990) Nouns in WordNet: a lexical inheritance system. Int J Lexicography 3(4): 245–264

    Article  Google Scholar 

  • Miller GA, Beckwith R, Fellbaum C, Gross D, Miller K (1990) WordNet: an on-line lexical database. J Lexicography 3(4): 234–244

    Google Scholar 

  • Miller GA (1995) WordNet: a lexical database. Commun ACM 38(11): 39–41

    Article  Google Scholar 

  • Miller GA, Hristea F (2006) WordNet nouns: classes and instances. Comput Linguist 32(1): 1–3

    Article  Google Scholar 

  • Ng H, Lee H (1996) Integrating multiple knowledge sources to disambiguate word sense: an exemplar-based approach. In: Proceedings of the 34th annual meeting of the Society for Computational Linguistics, Santa Cruz, California, pp 40–47

  • Pedersen T, Bruce R (1997) Distinguishing word senses in untagged text. In: Proceedings of the second conference on empirical methods in natural language processing (EMNLP-2), Providence, Rhode Island, pp 197–207

  • Pedersen T, Bruce R (1998) Knowledge lean word-sense disambiguation. In: Proceedings of the 15th National conference on artificial intelligence, Madison, Wisconsin, pp 800–805

  • Pedersen T (2006) Unsupervised corpus-based methods for WSD. In: Agirre E, Edmonds P (eds) Word sense disambiguation Algorithms and Applications. Springer, The Netherlands, pp 133–166

    Chapter  Google Scholar 

  • Schütze H (1998) Automatic word-sense discrimination. Comput Linguist 24(1): 97–123

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Florentina Hristea.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hristea, F., Popescu, M. & Dumitrescu, M. Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques. Artif Intell Rev 30, 67 (2008). https://doi.org/10.1007/s10462-009-9117-6

Download citation

  • Published:

  • DOI: https://doi.org/10.1007/s10462-009-9117-6

Keywords