Abstract
NLP system developers and corpus lexicographers would both benefit from a tool for finding and organizing the distinctive patterns of use of words in texts. Such a tool would be an asset for both language research and lexicon development, particularly for lexicons for Machine Translation. We have developed the waspbench, a tool that (1) presents a “word sketch”, a summary of the corpus evidence for a word, to the lexicographer; (2) supports the lexicographer in analysing the word into its distinct meanings and (3) uses the lexicographer’s analysis as the input to a state-of-the-art word sense disambiguation (WSD) algorithm, the output of which is a “word expert” for the word which can then disambiguate new instances of the word. In this paper we describe a set of evaluation experiments, designed to establish whether waspbench can be used to save time and improve performance in the development of a lexicon for Machine Translation or other NLP application.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Kilgarri., A.: The hard parts of lexicography. International Journal of Lexicography 11 (1998) 51–54
Sinclair, J.M., ed.: Looking Up: An Account of the COBUILD Project in Lexical Computing. Collins, London (1987)
COBUILD: The Collins COBUILD English Language Dictionary. Edited by John McH. Sinclair et al., London. (1987)
Gale, W., Church, K., Yarowsky, D.: A method for disambiguating word senses in a large corpus. Computers and the Humanities 26 (1993) 415–539
Kilgarri., A., Tugwell, D.: Word sketch: Extraction and display of significant collocations for lexicography. In: Proc. Collocations workshop, ACL 2001, Toulouse, France (2001) 32–38
Yarowsky, D.: One sense per collocation. In: Proc. ARPA Human Language Technology Workshop, Princeton (1993)
Kilgarri., A., Tugwell, D.: Wasp-bench: an MT lexicographer’s workstation supporting state-of-the-art lexical disambiguation. In: Proc. MT Summit VIII, Santiago de Compostela, Spain (2001)187–190
Fillmore, C.J.: Two dictionaries. International Journal of Lexicography 2 (1989) 57–83
Atkins, B.T.S., Levin, B.: Admitting impediments. In Zernik, U., ed.: Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon. Lawrence Erlbaum, Hillsdale, New Jersey (1991) 233–262
Atkins, B.T.S.: Then and now: Competence and performance in 35 years of lexicography. In: 10th EURALEX, Proceedings, Copenhagen (2002) 1–28
Edmonds, P., Kilgarri., A.: Introduction to the special issue on evaluating word sense disambiguation systems. Natural Language Engineering (2002, forthcoming)
Magnini, B., Strapparava, C., Pezzulo, G., Gliozzo, A.: Using domain information for wsd. In: Proc. senseval-2: Second InternationalWorkshop on Evaluating WSD Systems, Toulouse, ACL (2001) 111–114
Vossen, P.: Extending, trimming and fusing wordnet for technical documents. In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh (2001) http://www.seas.smu.edu/rada/mwnw/papers/WNWNAACL-105.pdf.
Buitelaar, P., Sacaleanu, B.: Ranking and selecting synsets by domain relevance. In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh (2001)
Tugwell, D., Kilgarri., TA.: Waspbench: a lexicographic tool supporting wsd. In: Proc. senseval-2: Second International Workshop on Evaluating WSD Systems, Toulouse, ACL (2001) 151–154
Rundell, M., ed.: Macmillan English Dictionary for Advanced Learners. Macmillan, London (2002)
Kilgarri., A., Rundell, M.: Lexical profiling software and its lexicographical applications-a case study. In: EURALEX 02, Copenhagen (2002)
McEnery, T., ed.: Language Engineering for South Asian Languages: workshop proceedings, University of Lancaster (2001) http://www.emille.lancs.ac.uk/lesal.htm.
Kurohashi, S.: Senseval-2 japanese translation task. In: Proceedings of Second International Workshop of Evaluating Word Sense Disambiguation Systems (SENSEVAL-2), Toulouse (2001) 37–40
Yarowsky, D., Florian, R.: Evaluating sense disambiguation performance across diverse parameter spaces. Journal of Natural Language Engineering (2002) In press Special Issue on Evaluating Word Sense Disambiguation Systems.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kilgarriff, A., Koeling, R. (2003). An Evaluation of a Lexicographer’s Workbench Incorporating Word Sense Disambiguation. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_23
Download citation
DOI: https://doi.org/10.1007/3-540-36456-0_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00532-2
Online ISBN: 978-3-540-36456-6
eBook Packages: Springer Book Archive