Abstract
Lay people discussing machine translation systems often perform a round trip translation, that is translating a text into a foreign language and back, to measure the quality of the system. The idea behind this is that a good system will produce a round trip translation that is exactly (or perhaps very close to) the original text. However, people working with machine translation systems intuitively know that round trip translation is not a good evaluation method. In this article we will show empirically that round trip translation cannot be used as a measure of the quality of a machine translation system. Even when using translations of multiple machine translation systems into account, to reduce the impact of errors of a single system, round trip translation cannot be used to measure machine translation quality.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Papineni, K., Roukos, S., Ward, T., Zhu, W.-J.: Bleu: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), July 2002, pp. 311–318 (2002), http://www1.cs.columbia.edu/nlp/sgd/bleu.pdf
Turian, J.P., Shen, L., Melamed, I.D.: Evaluation of machine translation and its evaluation. In: Proceedings of MT Summit IX (2003)
Banerjee, S., Lavie, A.: Meteor: An automatic metric for MT Evaluation with improved correlation with human judgements. In: Proceedings of Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization at the 43th Annual Meeting of the Association of Computational Linguistics (ACL 2005) (June 2005)
Somers, H.: Round-trip translation: What is it good for. In: Proceedings of the Australasian Language Technology Workshop (2005)
Koehn, P.: Europarl: A multilingual corpus for evaluation of machine translation (2002), http://people.csail.mit.edu/~koehn/publications/europarl.ps
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
van Zaanen, M., Zwarts, S. (2006). Unsupervised Measurement of Translation Quality Using Multi-engine, Bi-directional Translation. In: Sattar, A., Kang, Bh. (eds) AI 2006: Advances in Artificial Intelligence. AI 2006. Lecture Notes in Computer Science(), vol 4304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11941439_149
Download citation
DOI: https://doi.org/10.1007/11941439_149
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49787-5
Online ISBN: 978-3-540-49788-2
eBook Packages: Computer ScienceComputer Science (R0)