Skip to main content

An Evaluation of a Lexicographer’s Workbench Incorporating Word Sense Disambiguation

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2588))

Abstract

NLP system developers and corpus lexicographers would both benefit from a tool for finding and organizing the distinctive patterns of use of words in texts. Such a tool would be an asset for both language research and lexicon development, particularly for lexicons for Machine Translation. We have developed the waspbench, a tool that (1) presents a “word sketch”, a summary of the corpus evidence for a word, to the lexicographer; (2) supports the lexicographer in analysing the word into its distinct meanings and (3) uses the lexicographer’s analysis as the input to a state-of-the-art word sense disambiguation (WSD) algorithm, the output of which is a “word expert” for the word which can then disambiguate new instances of the word. In this paper we describe a set of evaluation experiments, designed to establish whether waspbench can be used to save time and improve performance in the development of a lexicon for Machine Translation or other NLP application.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kilgarri., A.: The hard parts of lexicography. International Journal of Lexicography 11 (1998) 51–54

    Article  Google Scholar 

  2. Sinclair, J.M., ed.: Looking Up: An Account of the COBUILD Project in Lexical Computing. Collins, London (1987)

    Google Scholar 

  3. COBUILD: The Collins COBUILD English Language Dictionary. Edited by John McH. Sinclair et al., London. (1987)

    Google Scholar 

  4. Gale, W., Church, K., Yarowsky, D.: A method for disambiguating word senses in a large corpus. Computers and the Humanities 26 (1993) 415–539

    Article  Google Scholar 

  5. Kilgarri., A., Tugwell, D.: Word sketch: Extraction and display of significant collocations for lexicography. In: Proc. Collocations workshop, ACL 2001, Toulouse, France (2001) 32–38

    Google Scholar 

  6. Yarowsky, D.: One sense per collocation. In: Proc. ARPA Human Language Technology Workshop, Princeton (1993)

    Google Scholar 

  7. Kilgarri., A., Tugwell, D.: Wasp-bench: an MT lexicographer’s workstation supporting state-of-the-art lexical disambiguation. In: Proc. MT Summit VIII, Santiago de Compostela, Spain (2001)187–190

    Google Scholar 

  8. Fillmore, C.J.: Two dictionaries. International Journal of Lexicography 2 (1989) 57–83

    Article  Google Scholar 

  9. Atkins, B.T.S., Levin, B.: Admitting impediments. In Zernik, U., ed.: Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon. Lawrence Erlbaum, Hillsdale, New Jersey (1991) 233–262

    Google Scholar 

  10. Atkins, B.T.S.: Then and now: Competence and performance in 35 years of lexicography. In: 10th EURALEX, Proceedings, Copenhagen (2002) 1–28

    Google Scholar 

  11. Edmonds, P., Kilgarri., A.: Introduction to the special issue on evaluating word sense disambiguation systems. Natural Language Engineering (2002, forthcoming)

    Google Scholar 

  12. Magnini, B., Strapparava, C., Pezzulo, G., Gliozzo, A.: Using domain information for wsd. In: Proc. senseval-2: Second InternationalWorkshop on Evaluating WSD Systems, Toulouse, ACL (2001) 111–114

    Google Scholar 

  13. Vossen, P.: Extending, trimming and fusing wordnet for technical documents. In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh (2001) http://www.seas.smu.edu/rada/mwnw/papers/WNWNAACL-105.pdf.

  14. Buitelaar, P., Sacaleanu, B.: Ranking and selecting synsets by domain relevance. In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh (2001)

    Google Scholar 

  15. Tugwell, D., Kilgarri., TA.: Waspbench: a lexicographic tool supporting wsd. In: Proc. senseval-2: Second International Workshop on Evaluating WSD Systems, Toulouse, ACL (2001) 151–154

    Google Scholar 

  16. Rundell, M., ed.: Macmillan English Dictionary for Advanced Learners. Macmillan, London (2002)

    Google Scholar 

  17. Kilgarri., A., Rundell, M.: Lexical profiling software and its lexicographical applications-a case study. In: EURALEX 02, Copenhagen (2002)

    Google Scholar 

  18. McEnery, T., ed.: Language Engineering for South Asian Languages: workshop proceedings, University of Lancaster (2001) http://www.emille.lancs.ac.uk/lesal.htm.

  19. Kurohashi, S.: Senseval-2 japanese translation task. In: Proceedings of Second International Workshop of Evaluating Word Sense Disambiguation Systems (SENSEVAL-2), Toulouse (2001) 37–40

    Google Scholar 

  20. Yarowsky, D., Florian, R.: Evaluating sense disambiguation performance across diverse parameter spaces. Journal of Natural Language Engineering (2002) In press Special Issue on Evaluating Word Sense Disambiguation Systems.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kilgarriff, A., Koeling, R. (2003). An Evaluation of a Lexicographer’s Workbench Incorporating Word Sense Disambiguation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_23

Download citation

  • DOI: https://doi.org/10.1007/3-540-36456-0_23

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00532-2

  • Online ISBN: 978-3-540-36456-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics