Skip to main content
Log in

MT model space: statistical versus compositional versus example-based machine translation

  • Examble-Based MT
  • Published:
Machine Translation

Abstract

We offer a perspective on EBMT from a statistical MT standpoint, by developing a three-dimensional MT model space based on three pairs of definitions: (1) logical versus statistical MT, (2) schema-based versus example-based MT, and (3) lexical versus compositional MT. Within this space we consider the interplay of three key ideas in the evolution of transfer, example-based, and statistical approaches to MT. We depict how all translation models face these issues in one way or another, regardless of the school of thought, and suggest where the real questions for the future may lie.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aha DW, Kibler D, Albert MK (1991) Instance-based learning algorithms. Mach Learn 6:37–66

    Google Scholar 

  • Alshawi H, Bangalore S, Douglas S (1998) Automatic acquisition of hierarchical transduction models for machine translation. In: COLING-ACL ’98: 36th annual meeting of the Association for Computational Linguistics and 17th international conference on computational linguistics, Montreal, Quebec, Canada, pp 41–47

  • Alshawi H, Buchsbaum AL, Xia F (1997) A comparison of head transducers and transfer for a limited domain translation application. In: 35th Annual meeting of the Association for Computational Linguistics, Madrid, Spain, pp 360–365

  • Alshawi H, Douglas S, Bangalore S (2000) Learning dependency translation models as collections of finite-state head transducers. Comput Ling 26:45–60

    Article  Google Scholar 

  • American Heritage (2000) The American heritage dictionary of the English language, 4th edn. Houghton Mifflin, Boston, MA

    Google Scholar 

  • Andriamanankasina T, Araki K, Tochinai K (2003) EBMT of POS-tagged sentences by recursive division via inductive learning. In: Carl and Way (2003), pp 225–252

  • Angluin D (1980) Inductive inference of formal languages from positive data. Inform Control 45:117–135

    Article  Google Scholar 

  • Aramaki E, Kurohashi S, Kashioka H, Kato N (2005) Probabilistic model for example-based machine translation. In: MT Summit X: The tenth machine translation summit, Phuket, Thailand, pp 219–226

  • Bond F, Shirai S (2003) A hybrid rule and example-based method for machine translation. In: Carl and Way (2003), pp 211–224

  • Brown PE, Cocke J, Della Pietra SA, Della Pietra VJ, Jelinek F, Lafferty JD, Mercer RL, Roossin PS (1990) A statistical approach to machine translation. Comput Ling 16:79–85; repr. in Nirenburg et al. (2003), pp 355–362

  • Brown PE, Della Pietra VJ, Della Pietra SA, Mercer RL (1993) The mathematics of statistical machine translation: Parameter estimation. Comput Ling 19:263–311

    Google Scholar 

  • Brown RD (1999) Adding linguistic knowledge to a lexical example-based translation system. In: Proceedings of the eighth international conference on theoretical and methodological issues in machine translation (TMI-99), Chester, England, pp 22–32

  • Brown RD (2003) Clustered transfer rule induction for example-based translation. In: Carl and Way (2003), pp 287–305

  • Carl M (2000) A model of competence for corpus-based machine translation. In: Proceedings of the 18th international conference on computational linguistics: COLING 2000 in Europe, Germany, pp 997–1001

  • Carl M (2003) Inducing translation grammars from bracketed alignments. In: Carl and Way (2003), pp 339–361

  • Carl M (2005) A system-theoretical view of EBMT. Mach Translat 19:229–249

    Google Scholar 

  • Carl M, Way A (eds) (2003) Recent advances in example-based machine translation. Kluwer Academic Publishers, Dordrecht, The Netherlands

    Google Scholar 

  • Chandioux J (1976) Météo: Un système opérationnel pour la traduction automatique des bulletins météorologiques destinés au grand public [Météo: A working system for the machine translation of weather reports aimed at the general public]. META 21:127–133

  • Chiang D (2005) A hierarchical phrase-based model for statistical machine translation. In: 43rd annual meeting of the Association for Computational Linguistics, Ann Arbor, MI, pp 263–270

  • Cicekli I (2005) Inducing translation templates with type constraints. Mach Translat 19:281–297

    Google Scholar 

  • Cicekli I, Güvenir HA (1996) Learning translation rules from a bilingual corpus. In: Second international conference on new methods in language processing (NeMLaP-2), Ankara, Turkey, pp 90–97

  • Cicekli I, Güvenir HA (2003) Learning translation templates from bilingual translation examples. In: Carl and Way (2003), pp 255–286

  • Collins B, Somers H (2003) EBMT seen as case-based reasoning. In: Carl and Way (2003), pp 115–153

  • Columbia (2005) The Columbia electronic encyclopedia. Columbia University Press, New York, NY

    Google Scholar 

  • Dummett MAE (1975) What is a theory of meaning?. In: Guttenplan S (ed) Mind and language. Oxford University Press, Oxford, England, pp 97–138

    Google Scholar 

  • Furuse O, Iida H (1992) Cooperation between transfer and analysis in example-based framework. In: Proceedings of the fifteenth [sic] international conference on computational linguistics, COLING-92, Nantes, France, pp 645–651

  • Groves D, Hearne M, Way A (2004) Robust sub-sentential alignment of phrase-structure trees. In Coling: 20th international conference on computational linguistics, Geneva, Switzerland, pp 183–190

  • Groves D, Way A (2005) Hybrid example-based SMT: The best of both worlds? In ACL-2005 Workshop on building and using parallel texts, Ann Arbor, MI, pp 183–190

  • Groves D, Way A (2005) Hybrid data-driven models of machine translation. Mach Translat 19:299–321

    Google Scholar 

  • Hall RP (1989) Computational approaches to analogical reasoning: A comparative analysis. Artif Intell 39:39–120

    Article  Google Scholar 

  • Hutchins J (2005) Towards a definition of example-based machine translation. In: MT Summit X workshop: Second workshop on example-based machine translation, Phuket, Thailand, pp 63–70

  • Hutchins J (2005) Example-based machine translation: A review and commentary. Mach Translat 19:197–211

    Google Scholar 

  • Hutchins WJ, Somers HL (1992) An introduction to machine translation. Academic Press, London, England

    Google Scholar 

  • Kaji H, Kida Y, Morimoto Y (1992) Learning translation templates from bilingual text. In Proceedings of the fifteenth [sic] international conference on computational linguistics, COLING-92, Nantes, France, pp 672–678

  • Kibler D, Aha DW (1997) Learning representative exemplars of concepts: An initial study. In: Fourth international workshop on machine learning, Irvine, CA, pp 24–29

  • Kitano H, Higuchi T (1991) High performance memory-based translation on IXM2 massively parallel associative memory processor. In: AAAI-91: Proceedings of the ninth national conference on artificial intelligence, AAAI Press/ MIT Press, Menlo Park, CA, pp 149–154

  • Koehn P, Och FJ, Marcu D (2003) Statistical phrase-based translation. In: HLT-NAACL: Human language technology conference of the North American chapter of the Association for Computational Linguistics, Edmonton, Alberta, Canada, pp 127–133

  • Kolodner J (1983a) Maintaining organization in a dynamic long-term memory. Cog Sci 7:243–280

    Article  Google Scholar 

  • Kolodner J (1983b) Reconstructive memory, a computer model. Cog Sci 7:281–328

    Article  Google Scholar 

  • Langlais P, Gotti F (2006) EBMT by tree-phrasing. Mach Translat 20:1–25

    Google Scholar 

  • Lepage Y, Denoual E (2005) Purest ever example-based machine translation: Detailed presentation and assessment. Mach Translat 19:251–280

    Google Scholar 

  • Lin D, Cherry C (2003) Word alignment with cohesion constraint. In: HLT-NAACL 2003: Human language technology conference of the North American chapter of the Association for Computational Linguistics, Edmonton, Alberta, Canada, companion vol pp 49–51

  • Liu Z, Wang H, Wu H (2006) Example-based machine translation based on tree-string correspondence and statistical generation. Mach Translat 20:27–44

    Google Scholar 

  • Locke WN, Booth AD (1955) Machine translation of languages: Fourteen essays. The Technology Press of the Massachusetts Institute of Technology, Cambridge, Mass/John Wiley, New York, NY/Chapman & Hall, London, England

  • Lu Y, Li S, Zhao T, Yang M (2002) Learning Chinese bracketing knowledge based on a bilingual language model. In: Proceedings of the 18th international conference on computational linguistics: COLING 2000 in Europe, Saarbrücken, Germany, pp 591–598

  • Lu Y, Zhou M, Li S, Huang C, Zhao T (2001) Automatic translation template acquisition based on bilingual structure alignment. Comput Ling Chin Lang Proc 6:83–108

    Google Scholar 

  • Maas H-D (1987) The MT system susy. In King M (ed) Machine translation today: The state of the art. Edinburgh University Press, Edinburgh, Scotland, pp 209–246

  • Matsumoto Y, Ishimoto H, Utsuro T (1993) Structural matching of parallel texts. In: 31st annual meeting of the Association for Computational Linguistics, Columbus, OH, pp 23–30

  • McTait K, Trujillo A (1999) A language-neutral sparse-data algorithm for extracting translation patterns. In: Proceedings of the eighth international conference on theoretical and methodological issues in machine translation (TMI-99), Chester, England, pp 99–108

  • Merriam-Webster (2002) Webster’s third new international dictionary. Merriam-Webster, Springfield, MA

    Google Scholar 

  • Nagao M (1984) A framework of a mechanical translation between Japanese and English by analogy principle. In: Elithorn A, Banerji R (eds) Artificial and human intelligence (Edited review papers presented at the international NATO symposium on artificial and human intelligence), Amsterdam, The Netherlands, pp 173–180; repr. in Nirenburg et al. (2003), pp 351–354

  • Nirenburg S, Somers H, Wilks Y (eds) Readings in machine translation. MIT Press, Cambridge, MA

  • Och FJ, Ney H (2002) Discriminative training and maximum entropy models for statistical machine translation. In: 40th annual meeting of the Association for Computational Linguistics, Philadelphia, PA, pp 295–302

  • Och FJ, Tillmann C, Ney H (1999) Improved alignment models for statistical machine translation. In: 1999 conference on empirical methods in natural language processing and very large corpora (EMNLPVLC-99), College Park, MD, pp 20–28

  • OED (2005) Compact Oxford English dictionary of current English. Oxford University Press, Oxford, England

    Google Scholar 

  • Porter BW, Bareiss ER (1986) Protos: An experiment in knowledge acquisition for heuristic classification tasks. In: First international meeting on advances in learning (IMAL), Les Arcs, France, pp 159–174

  • Quirk C, Menezes A (2006) Dependency treelet translation: The convergence of statistical and example-based machine-translation? Mach Translat 20:45–66

    Google Scholar 

  • Russell S, Wefald E (1991) Do the right thing: Studies in limited rationality. MIT Press, Cambridge, MA

    Google Scholar 

  • Senellart J, Yang J, Rebollo A (2003) Systran intuitive coding technology. In: MT Summit IX: Proceedings of the ninth machine transaltion summit, New Orleans, USA, pp 346–353

  • Simard M, Langlais P (2003) Statistical translation alignment with compositionality constraints. In: HLT/NAACL-2003 Workshop on building and using parallel texts, Edmonton, Alberta, Canada, pp 19–22

  • Somers H (1999) Review article: Example-based machine translation. Mach Translat 14:113–157

    Article  Google Scholar 

  • Somers H (2003) An overview of EBMT. In: Carl and Way (2003), pp 3–57

  • Stanfill C, Waltz D (1988) The memory based reasoning paradigm. In: Case-based reasoning: Proceedings from a workshop, Morgan Kaufmann, San Mateo, CA, pp 414–424

  • Sumita E (2003) An example-based machine translation system using DP-matching between word sequences. In: Carl and Way (2003) pp 189–209

  • Sumita E, Iida H (1991) Experiments and prospects of example-based machine translation. In: 29th annual meeting of the Association for Computational Linguistics, Berkeley, CA, pp 185–192

  • Toma P (1976) An operational machine translation system. In: Brislin R (ed) Translation: Applications and research, Gardner Press, New York, NY, pp 247–259

  • Toma P (1977) Systran as a multilingual machine translation system. In: Overcoming the language barrier: Third European congress on information systems and networks, Luxembourg, May 1977, Verlag Dokumentation, München, pp 569–581

  • Turcato D, Popowich F (2003) What is example-based machine translation? In Carl and Way (2003), pp 59–81

  • Vauquois B (1975) La traduction automatique à Grenoble [Machine translation in Grenoble]. Dunod, Paris, France

  • Vauquois B, Boitet C (1985) Automated translation at Grenoble University. Comput Ling 11, 28–36

    Google Scholar 

  • Veale T, Way A (1997) Gaijin: A bootstrapping, template-driven approach to example-based machine translation. In: International conference, recent advances in natural language processing, Tzigov Chark, Bulgaria, pp 239–244

  • Veloso MM, Carbonell JG (1993) Derivational analogy in Prodigy. Mach Learn 10:249–278

    Article  Google Scholar 

  • Vilar JM, Vidal E (2005) A recursive statistical translation model. In ACL-2005 Workshop on building and using parallel texts, Ann Arbor, MI, pp 199–207

  • Way A, Gough N (2005) Comparing example-based and statistical machine translation. Nat Lang Eng 11:295–309

    Article  Google Scholar 

  • Weaver W (1949) Translation. In: Locke and Booth (1955) pp 15–23; repr. in Nirenburg (2003), pp 13–17

  • WordNet (2003) WordNet 2.0. Cognitive Science Laboratory, Princeton University, Princeton, NJ

  • Wu D (1995a) An algorithm for simultaneously bracketing parallel texts by aligning words. In: 33rd annual meeting of the Association for Computational Linguistics, Cambridge, MA, pp 244–251

  • Wu D (1995b) Trainable coarse bilingual grammars for parallel text bracketing. In: 3rd annual workshop on very large corpora (WVLC-3), Cambridge, MA, pp 69–81

  • Wu D (1996) A polynomial-time algorithm for statistical machine translation. In: 34th annual meeting of the Association for Computational Linguistics, Santa Cruz, CA, pp 152–158

  • Wu D (1997) Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Comput Ling 23:377–404

    Google Scholar 

  • Wu D (2000) Alignment. In: Dale R, Moisl H, Somers H (eds) Handbook of natural language processing. Marcel Dekker, New York, NY, pp 415–458

    Google Scholar 

  • Wu D (2005) Recognizing paraphrases and textual entailment using inversion transduction grammars. In: ACL-2005 workshop on empirical modeling of semantic equivalence and entailment, Ann Arbor, MI, pp 25–30

  • Wu D, Fung P (2005) Inversion transduction grammar constraints for mining parallel sentences from quasi-comparable corpora. In: Second international joint conference on natural language processing (IJCNLP-2005), Jeju, Korea, pp 257–268

  • Wu D, Wong H (1998) Machine translation with a stochastic grammatical channel. In: COLING-ACL ’98: 36th annual meeting of the Association for Computational Linguistics and 17th international conference on computational linguistics, Montreal, Quebec, Canada, pp 1408–1415

  • Yamada K, Knight K (2001) A syntax-based statistical translation model. In: Association for Computational Linguistics 39th annual meeting and 10th conference of the European chapter, Toulouse, France, pp 523–529

  • Yamamoto K, Matsumoto Y (2003) Extracting translation patterns from parallel corpora. In: Carl and Way (2003), pp 365–395

  • Zens R, Ney H (2003) A comparative study on reordering constraints in statistical machine translation. In: 41st annual meeting of the Association for Computational Linguistics, Sapporo, Japan, 192–202

  • Zens R, Ney H, Watanabe T, Sumita E (2004) Reordering constraints for phrase-based statistical machine translation. In: Coling: 20th international conference on computational linguistics, Geneva, Switzerland, pp 205–211

  • Zhang H, Gildea D (2005) Stochastic lexicalized inversion transduction grammar for alignment. In: 43rd annual meeting of the Association for Computational Linguistics, Ann Arbor, MI, pp 475–482

  • Zhang Y, Vogel S (2005) An efficient phrase-to-phrase alignment model for arbitrarily long phrase and large corpora. In: Proceedings of the 10th annual conference of the European Association for Machine Translation, Budapest, Hungary, pp 294–301

  • Zhao B, Vogel S (2003) Word alignment based on bilingual bracketing. In HLT/NAACL-2003 workshop on building and using parallel texts, Edmonton, Alberta, Canada, pp 15–18

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dekai Wu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, D. MT model space: statistical versus compositional versus example-based machine translation. Machine Translation 19, 213–227 (2005). https://doi.org/10.1007/s10590-006-9009-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10590-006-9009-3

Keywords

Navigation