A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

Guessoum, Ahmed; Zantout, Rached

doi:10.1023/A:1014504808954

A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

Published: June 2001

Volume 16, pages 127–149, (2001)
Cite this article

Machine Translation

Ahmed Guessoum¹ &
Rached Zantout²

83 Accesses
3 Citations
Explore all metrics

Abstract

The lexicon is a major part of any Machine Translation (MT) system. If the lexicon of an MT system is not adequate, this will affect the quality of the whole system. Building a comprehensive lexicon, i.e., one with a high lexical coverage, is a major activity in the process of developing a good MT system. As such, the evaluation of the lexicon of an MT system is clearly a pivotal issue for the process of evaluating MT systems. In this paper, we introduce a new methodology that was devised to enable developers and users of MT Systems to evaluate their lexicons semi-automatically. This new methodology is based on the idea of the importance of a specific word or, more precisely, word sense, to a given application domain. This importance, or weight, determines how the presence of such a word in, or its absence from, the lexicon affects the MT system's lexical quality, which in turn will naturally affect the overall output quality. The method, which adopts a black-box approach to evaluation, was implemented and applied to evaluating the lexicons of three commercialEnglish–Arabic MT systems. A specific domain was chosen in which the various word-sense weights were determined by feeding sample texts from the domain into a system developed specifically for that purpose. Once this database of word senses and weights was built, test suites were presented to each of the MT systems under evaluation and their output rated by a human operator as either correct or incorrect. Based on this rating, an overall automated evaluation of the lexicons of the systems was deduced.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

VERTa: a linguistic approach to automatic machine translation evaluation

Article 15 October 2018

Elisabet Comelles & Jordi Atserias

Machine Translation Evaluation: Manual Versus Automatic—A Comparative Study

Plausibility Testing for Lexical Resources

References

Anon: 1996, [The Machine Translator: Al-Wafy], (Arabuter) 8, 27–28.
Google Scholar
Arnold, Doug, R. Lee Humphreys and Louisa Sadler (eds): 1993, ‘Special Issue on Evaluation of MT Systems’, Machine Translation 8(1–2).
Dorr, Bonnie J. and Judith Klavans: 1994/5, ‘Special Issue: Building Lexicons for Machine Translation’, Machine Translation 9(3–4), 10(1–2).
Dyson, Mary C. and Jean Hannah: 1987, ‘Towards a Methodology for the Evaluation of Machine-Assisted Translation Systems’, Computers and Translation 2, 163–176.
Google Scholar
Eagles (Expert Advisory Group on Language Engineering Standards): 1995, ‘Evaluation of Natural Language Processing Systems’, Eagles Document EAG-EWG-PR.2.
Eagles: 1998, Proceedings of the Second Eagles II Workshop on Evaluation in Human Language Technology, Geneva.
Hutchins, W. John and Harold L. Somers: 1992, An Introduction to Machine Translation, London: Academic Press.
Google Scholar
Jihad, A. : 1996, [Has the Arabic Machine Translation Era Started?], (Byte Middle East), November 1996, pp. 36–48.
King, M. and K. Falkedal: 1990, ‘Using Test Suites in Evaluation of Machine Translation Systems’, in COLING-90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Finland, Vol. 2, pp. 211–216.
Google Scholar
Klein, J., S. Lehmann, K. Netter and T. Wegst: 1999, ‘DiET in the Context of MT Evaluation’, in Rita Nübel and Uta Seewald-Heeg (eds), Evaluation of the Linguistic Performance of Machine Translation Systems, Proceedings of the Konvens'98, St. Augustin: Gardezi Verlag, pp. 107–126.
Google Scholar
Lehmann, Sabine, Stephan Oepen, Sylvie Regnier-Prost, Klaus Netter, Veronika Lux, Judith Klein, Kirsten Falkedal, Frederik Fouvry, Dominique Estival, Eva Dauphin, Herv, Compagnion, Judith Baur, Lorna Balkan and Doug Arnold: 1996, ‘TSNLP — Test Suites for Natural Language Processing’, COLING-96: The 16th International Conference on Computational Linguistics, Copenhagen, Denmark, pp. 711–716.
Lehrberger, John and Laurent Bourbeau: 1987, Machine Translation: Linguistic Characteristics of MT Systems and General Methodology of Evaluation, Amsterdam: John Benjamins.
Google Scholar
Mason Jane and Adriane Rinsche: 1995, Ovum Evaluates: Translation Technology Products, London: OVUM Ltd.
Google Scholar
Melby, A. K.: 1988, ‘Lexical Transfer: Between a Source Rock and a Hard Target’, Coling Budapest: Proceedings of the 12th International Conference on Computational Linguistics, Budapest, pp. 411–419.
Nagao, M.: 1985, ‘Evaluation of the Quality of Machine-Translated Sentences and the Control of Language’, Journal of the Information Processing Society of Japan 26, 1197–1202.
Google Scholar
Nyberg, Eric H. and Teruko Mitamura: 1992, ‘The KANT System: Fast, Accurate, High-Quality Translation in Practical Domains’, Proceedings of the fifteenth [sic] International Conference on Computational Linguistics, COLING-92, Nantes, pp. 1069–1073.
Nyberg, Eric H., Teruko Mitamura and Jaime G. Carbonell: 1994, ‘Evaluation Metrics for Knowledge-Based Machine Translation’, COLING 94: The 15th International Conference on Computational Linguistics, Kyoto, Japan, pp. 95–99.
Qendelft, G. : 1997, [The Translation Program Al-Wafy Is Useful for Getting a General Understanding of a Letter Written in English], (Al-Hayat) (Saturday, 25 October).
Sinaiko, H. W. and G. R. Klare: 1972, ‘Further Experiments in Language Translation: Readability of Computer Translations’, ITL 15, 1–29.
Google Scholar
Sinaiko, H. W. and G. R. Klare: 1973, ‘Further Experiments in Language Translation: A Second Evaluation of the Readability of Computer Translations’, ITL 19, 29–52.
Google Scholar
van Slype, G.: 1979, ‘Systran: Evaluation of the 1978 Version of the Systran English-French Automatic System of the Commission of the European Communities’, The Incorporated Linguist 18, 86–89.
Google Scholar
Vasconcellos, Muriel (ed.): 1988. Technology as Translation Strategy, Binghamton, NY: State University of New York at Binghamton (SUNY).
Google Scholar
White, John S., Theresa O'Connell and Francis O'Mara: 1994, ‘The ARPA MT Evaluation Methodologies: Evolution, Lessons, and Future Approaches’, Technology Partnerships for Crossing the Language Barrier: Proceedings of the First Conference of the Association for Machine Translation in the Americas, Columbia, Maryland, pp. 193–205.
Wilks Y.: 1992, ‘Systran: It Obviously Works, But How Much Can It Be Improved?’, in John Newton (ed.), Computers and Translation: A Practical Appraisal, London: Routledge, pp. 166–188.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Sharjah, P.O. Box, 27272, Sharjah, UAE
Ahmed Guessoum
Department of Computer Science, Faculty of Sciences, University of Balamand, P.O. Box, 100, Tripoli, Lebanon
Rached Zantout

Authors

Ahmed Guessoum
View author publications
You can also search for this author in PubMed Google Scholar
Rached Zantout
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Guessoum, A., Zantout, R. A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems. Machine Translation 16, 127–149 (2001). https://doi.org/10.1023/A:1014504808954

Download citation

Issue Date: June 2001
DOI: https://doi.org/10.1023/A:1014504808954

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

Abstract

Access this article

Similar content being viewed by others

VERTa: a linguistic approach to automatic machine translation evaluation

Machine Translation Evaluation: Manual Versus Automatic—A Comparative Study

Plausibility Testing for Lexical Resources

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

Abstract

Access this article

Similar content being viewed by others

VERTa: a linguistic approach to automatic machine translation evaluation

Machine Translation Evaluation: Manual Versus Automatic—A Comparative Study

Plausibility Testing for Lexical Resources

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation