Skip to main content
Log in

A Topical/Local Classifier for Word Sense Identification

  • Published:
Computers and the Humanities Aims and scope Submit manuscript

Abstract

TLC is a supervised training (S) system that uses a Bayesianstatistical model and features of a word's context to identifyword sense. We describe the classifier's operation and how itcan be configured to use only topical context cues, only localcues, or a combination of both. Our results on Senseval'sfinal run are presented along with a comparison to theperformance of the best S system and the average for S systems.We discuss ways to improve TLC by enriching its featureset and by substituting other decision procedures for the Bayesianmodel. Future development of supervised training classifiers willdepend on the availability of tagged training data. TLC canassist in the hand-tagging effort by helping human taggers locateinfrequent senses of polysemous words.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Brill, E. “Some advances in rule-based part of speech tagging” Proceedings of the Twelfth National Conference on Artificial Intelligence, Seattle: AAAI. 1994.

    Google Scholar 

  • Chiang T-H., Y-C. Lin and K-Y Su. “Robust learning, smoothing, and parameter tying on syntactic ambiguity resolution”, Computational Linguistics, Vol. no. 21–3, 1995, pp. 321–349.

  • Fellbaum, C. (ed). WordNet: An Electronic Lexical Database, Cambridge: MIT Press. 1998.

    Google Scholar 

  • Good, I. F. “The population frequencies of species and the estimation of population parameters”, Biometrica, Vol. no. 40, 1953, pp. 237–264.

    Google Scholar 

  • Joshi, A.K., B. Srinivas. “Disambiguation of Super Parts of Speech (or Supertags): Almost parsing”, Proceedings of COLING 1994, 1994, pp. 154–160.

  • Leacock, C., M. Chodorow and G. A. Miller. “Using corpus statistics and WordNet relations for sense identification”, Computational Linguistics, Vol. no. 24–1, 1998, pp. 147–165.

    Google Scholar 

  • Miller, G. A., R. Tengi and S. Landes (submitted for publication). “Matching the Tagging to the Task”.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chodorow, M., Leacock, C. & Miller, G.A. A Topical/Local Classifier for Word Sense Identification. Computers and the Humanities 34, 115–120 (2000). https://doi.org/10.1023/A:1002463121011

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1002463121011

Navigation