Abstract
Machine Translation of arbitrary input is difficult,but the output quality can be improved significantlyif writers create documents with MT in mind. Thisarticle deals with ``MTranslatability'' – translatabilityof texts by MT systems. It identifies characteristics oftext that decrease MTranslatability and suggests waysto improve them. It also illustrates the effect of writingfor MTranslatability by showing before-and-after picturesof output from various commercially available MT systems,and gives an overview of tools that help identify and correctthe problems.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
AECMA: 1995, ‘A Guide for the Preparation of Aircraft Maintenance Documentation in the Aerospace Maintenance Language. AECMA Simplified English’. Brussels. AECMA Document: PSC-85-16598, Issue 1.
Anonymous: 1995, ‘Why Spell Check Does not Work — A Linguistic Odyssey’, Available on the Web at http://www.linguistlist.org/issues/6/6-407.html (as of May 9, 2001). Linguist List 6.407.
APS: 1997, ‘Dear Plant Doctor...’. Available on the Web at http://www.scisoc.org/visitors/plantdoc2.htm (as of April 4, 2002). American Phytopathological Society.
Becker, M., A. Bredenkamp, B. Crysmann, and J. Klein: forthcoming. ‘Annotations of Error Types for German USENET News Corpus’. In: A. Abeillé (ed.): Treebanks. Building and Using Syntactically Annotated Corpora. Dordrecht, Kluwer. To appear.
Bernth, A.: 1997, ‘EasyEnglish: A Tool for Improving Document Quality’. In: Fifth Conference on Applied Natural Language Processing. Washington, DC, pp. 159–165.
Bernth, A.: 1998, ‘EasyEnglish: Preprocessing for MT’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 30–41.
Bernth, A.: 1999a, ‘A Confidence Index for Machine Translation’. In: Proceedings of the 8th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI 99). Chester, England, pp. 120–127.
Bernth, A.: 1999b, ‘Controlling Input and Output of MT for Greater User Acceptance’. In: Translating and the Computer 21. London, no page numbering.
Bernth, A.: 1999c, ‘Tools for Improving E-G MT Quality’. In: Workshop on Problems and Potential of English-to-German MT Systems. Chester, England, no page numbering. Held in conjunction with the 8th International Conference on Theoretical and Methodological Issues in Machine Translation.
Bernth, A. and C. Gdaniec: 2000, ‘A Translation Confidence Index for English-German MT’. Technical Report RC 22403, IBM T.J. Watson Research Center, Yorktown Heights, NY.
Bernth, A. and M. C. McCord: 2000, ‘The Effect of Source Analysis on Translation Confidence’. In: J. S. White (ed.): Envisioning Machine Translation in the Information Future, 4th Conference of the Association for Machine Translation in the Americas. Berlin, Springer, pp. 89–99.
Bredenkamp, A., B. Crysmann, and M. Petrea: 2000, ‘Looking for Errors: A Declarative Formalism for Resource-Adaptive Language Checking’. In: Proceedings of the 2nd International Conference on Language Resources and Evaluation. Athens, Greece, pp. 667–673.
DFKI: 2000, ‘FLAG: Flexible Language and Grammar Checking’, Available on the Web at http://flag.dfki.de (as of May 17, 2001).
Ducrot, D. M.: 1989, ‘Le système Titus IV: système de traduction automatique et simultanée en quatre langues’ [The Titus IV system: a system for simultaneous machine translation in four languages]. In: A. Abbou (ed.): Traduction assistée par ordinateur: perspectives technologiques, industrielles et économiques envisageables à l'horizon 1990: l'offre, la demande, les marchés et les évolution en cours. Paris, Editions Daicadif.
Elliston, J. S. G.: 1979, ‘Computer Aided Translation: A Business Viewpoint’. In: B. M. Snell (ed.): Translating and the Computer. Amsterdam: North-Holland, pp. 149–158.
Farwell, D., L. Gerber, and E. Hovy (eds): 1998, Machine Translation and the Information Soup, Third Conference of the Association for Machine Translation in the Americas. Berlin, Springer.
Gdaniec, C.: 1994, ‘The Logos Translatability Index’. In: Technology Paratnerships for Crossing the Language Barrier: Proceedings of the First Conference of the Association for Machine Translation in The Americas. Columbia, Maryland, pp. 97–105.
Godden, K.: 1998a, ‘Controlling the Business Environment for Controlled Language’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 185–190.
Godden, K.: 1998b, ‘Machine Translation in Context’. In: E. Hovy (eds): 1998, Machine Translation and the Information Soup, Third Conference of the Association for Machine Translation in the Americas. Berlin, Springer Farwell et al. (1998), pp. 158–163.
Godden, K.: 2000, ‘The Evolution of CASL Controlled Authoring at General Motors’. In: Proceedings of the Third International Workshop on Controlled Language Applications, CLAW 2000. Seattle, WA, pp. 14–19.
Godden, K.: 2002. Personal communication.
Haller, J.: 2000, ‘MULTIDOC. Authoring Aids for Multilingual Technical Documentation’. In: Proceedings of the 1st Congress of Specialized Translation. Barcelona, pp. 143–147.
Halliday, M. A. K. and R. Hasan: 1976, Cohesion in English. London, Longman Group.
Harkus, S.: 2000, ‘Writing for Translation’. In: Proceedings of the Australasian Online Documentation Conference, Brisbane, Australia, pp. 154–166.
Hashida, K.: 1997, ‘Global Document Annotation’. In: 4th Natural Language Processing Pacific Rim Symposium '97, NLPRS-97. Phuket, Thailand.
Hashida, K.: 2000, ‘GDA: Semantically Annotated Documents as Intelligent Content’. In: Coling 2000 Workshop on Semantic Annotation and Intelligent Content. Luxembourg. Oral presentation only.
Hoard, J., R. Wojcik, and K. Holzhauser: 1992, ‘An Automated Grammar and Style Checker for Writers of Simplified English’. In: P. Holt and N. Williams (eds): Computers and Writing. State of the Art. Oxford, Intellect, pp. 278–296.
Holmback, H., L. Duncan, and P. Harrison: 2000, ‘A Word Sense Checking Application for Simplified English’. In: Proceedings of the Third International Workshop on Controlled Language Applications, CLAW 2000. Seattle, WA, pp. 120–133.
Huijsen, A. S.-O.: 1998, ‘Controlled Language — An Introduction’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 1–15.
Hutchins, W. J.: 1998, ‘Twenty years of Translating and the Computer’. In: Translating and the Computer 20. London, no page numbering.
Hutchins, W. J. and H. Somers: 1992, An Introduction to Machine Translation. London, Academic Press.
Hydroplant AG: 2001, ‘Frage von Jonas 2001.12.20 [Question from Jonas]’. Available on the Web at http://www.hydroplant.ch/pflanzendoktor/ (as of April 4, 2002).
Jackendoff, R. S.: 1972, Semantic Interpretation in Generative Grammar. Cambridge, Massachusetts, MIT Press.
Kamprath, C., E. Adolphson, T. Mitamura, and E. Nyberg: 1998, ‘Controlled Language for Multilingual Document Production: Experience with Caterpillar Technical English’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 51–61.
Kay, M.: 1973, ‘The MIND System’. In: R. Rustin (ed.): Natural Language Processing. New York, Algorithmics Press, pp. 155–188.
Kohl, J. R.: 1999, ‘Improving Translatability and Readability with Syntactic Cues’. Technical COMMUNICATION, pp. 149–166.
Korpela, J.: 1998, ‘Translation-friendly authoring, especially in HTML for the WWW’. Available on the Web at http://www.malibutelecom.com/yucca/transl/ and http://www.cs.tut.fi/jkorpela/transl/ (as of May 17, 2001).
Kurohashi, S. and M. Nagao: 1994, ‘A Syntactic Analysis Method of Long Japanese Sentences Based on the Detection of Conjunctive Structures’. Computational Linguistics 20, 507–534.
Kurohashi, S. and M. Nagao: 1998, ‘Building a Japanese Parsed Corpus while Improving the Parsing System’. In: Proceedings of the First International Conference on Language Resources and Evaluation. Granada, Spain, pp. 719–724.
Langlais, P., G. Foster, and G. Lapalme: 2000, ‘Unit Completion for a Computer-aided Translation Typing System’. Machine Translation 15, 267–294.
Langlais, P., G. Lapalme, and M. Loranger: 2002, ‘TransType: from an Idea to a System’. Machine Translation (Special Issue on Embedded Machine Translation Systems). To appear.
Language Partners International: 2001, ‘Writing for Translation’. Available on the Web at http://www.languagepartners.com/reference-center/wri4tr.htm (as of May 17, 2001).
Maruyama, H., H. Watanabe, and S. Ogino: 1990, ‘An Interactive Japanese Parser for Machine Translation’. In: COLING-90: Papers presented to the 13th International Conference on Computational Linguistics, Helsinki, Vol. 2, pp. 257–262.
Mason, J. and R. Rinsche: 1995, Translation Technology Products. London, OVUM Ltd.
McCord, M. C.: 1980, ‘Slot Grammars’. Computational Linguistics 6, 31–43.
McCord, M. C.: 1990, ‘Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars’. In: R. Studer (ed.): Natural Language and Logic: International Scientific Symposium. Berlin: Springer Verlag, pp. 118–145.
McCord, M. C. and A. Bernth: 1998, ‘The LMT Transformational System’. In: E. Hovy (eds): 1998, Machine Translation and the Information Soup, Third Conference of the Association for Machine Translation in the Americas. Berlin, Springer Farwell et al. (1998), pp. 344–355.
Means, L. and K. Godden: 1996, ‘The Controlled Automotive Service Language (CASL) Project’. In: Proceedings of the First International Workshop on Controlled Language Applications, CLAW-96. Belgium, Leuven, pp. 106–114.
Means, L. G., P. Chapman, and A. Liu: 2000, ‘Training for Controlled Language Processes’. In: Proceedings of the Third International Workshop on Controlled Language Applications, CLAW 2000. Seattle, WA, pp. 1–13.
Mitamura, T.: 1999, ‘Controlled Language for Multilingual Machine Translation’. In: Machine Translation Summit VII: MT in the Great Translation Era. Singapore, pp. 46–52.
Mitamura, T. and E. Nyberg: 1995, ‘Controlled English for Knowledge-Based MT: Experience with the KANT System’. In: Proceedings of Sixth International Conference on Theoretical and Methodological Issues in Machine Translation, TMI 95. Leuven, Belgium, pp. 158–172.
Mitamura, T. and E. Nyberg: 2000, ‘Controlled Languages’. Technical report, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, PA. AMTA 2000 Tutorial.
Multilint: n.d. Available on the Web at http://www.iai.uni-sb.de/en/multien.html (as of May 17, 2001).
Nyberg, E. H. and T. Mitamura: 1996, ‘Controlled Language and Knowledge-Based Machine Translation: Principles and Practice’. In: Proceedings of the First International Workshop on Controlled Language Applications, CLAW 96. Leuven, Belgium, pp. 137–142.
Quirk, R., S. Greenbaum, G. Leech, and J. Svartvik: 1972, A Grammar of Contemporary English. London, Longman.
Reuther, U.: 1998, ‘Controlling Language in an Industrial Application’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 174–184.
Reuther, U. and A. Schmidt-Wigger: 2000, ‘Designing a Multi-Purpose CL Application’. In: Proceedings of the Third International Workshop on Controlled Language Applications, CLAW 2000. Seattle, WA, pp. 72–82.
Schmidt-Wigger, A.: 1998, ‘Grammar and Style Checking for German’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, USA, pp. 76–85.
Schütz, J.: 2001, ‘Ontologies in Terminology Work. Enabling Controlled Authoring’. In: F. Steurs (ed.): Terminology in Advanced Microcomputer Applications; Sharing Terminological Knowledge; Terminology for Multilingual Content; (TAMA-2001). TermNet Publisher, Vienna, no page numbering.
Shehadeh, C. M. E. and J. B. Strother: 1994, ‘The Use of Computerized Readability Formulas: Bane or Blessing?’. In: Proceedings of the Society for Technical Communication Annual Conference. Minneapolis, MN. pp. 225–227.
Smart Communication Inc.: 1998, ‘MAXit: The SMART Expert Editor’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, p. 196.
Thurmair, G.: 1997, ‘Exchange Interfaces for Translation Tools’. In: MT Summit VI: Machine Translation Past Present Future. San Diego, CA, pp. 74–94.
Thurmair, G.: 2000a, ‘Text Handling Standard: OTEXT V323’. Translation Quality for Professionals (TQPro). Available on the Web at http://www.tqpro.de/dorsexternal/Otext-V323.pdf (as of May 18, 2001).
Thurmair, G.: 2000b, ‘TQPro: Quality Tools for the Translation Process’. In: Translating and the Computer 22. London, no page numbering.
Tomita, M.: 1986, ‘Sentence Disambiguation by Asking’. Computers and Translation 1, 39–52.
Underwood, N. L. and B. Jongejan: 2001, ‘Translatability Checker: A Tool to Help Decide Whether to Use MT’. In: MT Summit VIII: Machine Translation in the Information Age. Santiago, de Compostela, Spain, pp. 363–368.
Watanabe, H., K. Nagao, M. McCord, and A. Bernth: 2000, ‘Improving Natural Language Processing by Linguistic Document Annotation’. In: Coling 2000 Workshop on Semantic Annotation and Intelligent Content. Luxembourg, pp. 20–27.
Whitelock, P., M. M. Wood, B. Chandler, N. Holden, and H. Horsfall: 1986, ‘Strategies for Interactive Machine Translation: the experience and implications of the UMIST Japanese project’. In: 11th International Conference on Computational Linguistics, Proceedings of Coling 86. Bonn, pp. 329–334.
Winograd, T.: 1972, Understanding Natural Language. New York, NY, Academic Press.
Wojcik, R. H. and J. Hoard: 1997, ‘Controlled Languages in Industry’. In: R. A. Cole (ed.): Survey of the State of the Art in Human Language Technology. Cambridge, Cambridge University Press, pp. 238–239.
Wojcik, R. H., J. Hoard, and K. Holzhauser: 1990, ‘The Boeing Simplified English Checker’. In: Proceedings of International Conference. Human Machine Interaction and Artificial Intelligence in Aeronautics and Space. Toulouse, France, pp. 43–57.
Wojcik, R. H. and H. Holmback: 1996, ‘Getting a Controlled Language Off the Ground at Boeing’. In: Proceedings of the First International Workshop on Controlled Language Applications, CLAW 96. Leuven, Belgium, pp. 22–31.
Wojcik, R. H., H. Holmback, and J. Hoard: 1998, ‘Boeing Technical English: An Extension of AECMA beyond the Aircraft Maintenance Domaín’. In: Proceedings of the Second International Workshop on Controlled Language Applications, CLAW 98. Pittsburgh, PA, pp. 114–113.
Zar, Jerrold H.: 1994, ‘Candidate for a Pullet Surprise’, Journal of Irreproducible Results, Jan/Feb, p. 13; see also http://tenderbytes.net/rhymeworld/feeder/teacher/pullet.htm (as of May 13, 2002).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bernth, A., Gdaniec, C. MTranslatability. Machine Translation 16, 175–218 (2001). https://doi.org/10.1023/A:1019867030786
Issue Date:
DOI: https://doi.org/10.1023/A:1019867030786