Abstract
We describe OpenMaTrEx, a free/open-source example-based machine translation (EBMT) system based on the marker hypothesis, comprising a marker-driven chunker, a collection of chunk aligners, and two engines: one based on a simple proof-of-concept monotone EBMT recombinator and a Moses-based statistical decoder. OpenMaTrEx is a free/open-source release of the basic components of MaTrEx, the Dublin City University machine translation system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Green, T.: The necessity of syntax markers. two experiments with artificial languages. Journal of Verbal Learning and Behavior 18, 481–496 (1979)
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Ann. Meeting of the Association for Computational Linguistics (ACL), demonstration session, Prague, Czech Republic, pp. 177–180 (June 2007)
Stroppa, N., Way, A.: MaTrEx: DCU machine translation system for IWSLT 2006. In: Proceedings of IWSLT 2006, pp. 31–36 (2006)
Stroppa, N., Groves, D., Way, A., Sarasola, K.: Example-based machine translation of the Basque language. In: Proc. of AMTA 2006, Cambridge, MA, USA, pp. 232–241 (2006)
Groves, D., Way, A.: Hybrid example-based SMT: the best of both worlds? In: ACL-2005 Workshop on Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond, vol. 100, pp. 183–190 (2005)
Hassan, H., Ma, Y., Way, A., Dublin, I.: MaTrEx: the DCU machine translation system for IWSLT 2007. In: Proc. of IWSLT 2007, Trento, Italy, pp. 69–75 (2007)
Tinsley, J., Ma, Y., Ozdowska, S., Way, A.: MaTrEx: the DCU MT system for WMT 2008. In: Proc. of the Third Workshop on Statistical Machine Translation, Waikiki, HI, pp. 171–174 (2008)
Phillips, A.B., Brown, R.D.: Cunei machine translation platform: System description. In: Proc. of the 3rd Workshop on Example-Based Machine Translation, Dublin, Ireland, pp. 29–36 (November 2009)
Tyers, F.M., Forcada, M.L., Ramírez-Sánchez, G.: The Apertium machine translation platform: Five years on. In: Proc. of the First Intl. Workshop on Free/Open-Source Rule-Based Machine Translation, Alacant, Spain, November 2009, pp. 3–10 (2009)
Groves, D., Way, A.: Hybridity in MT: Experiments on the Europarl corpus. In: Proc. of the 11th Ann. Conf. of the European Association for Machine Translation (EAMT-2006), Oslo, Norway, pp. 115–124 (2006)
van den Bosch, A., Stroppa, N., Way, A.: A memory-based classification approach to marker-based EBMT. In: Proc. of the METIS-II Workshop on New Approaches to Machine Translation, Leuven, Belgium, pp. 63–72 (2007)
Sánchez-Martínez, F., Forcada, M.L., Way, A.: Hybrid rule-based – example-based MT: Feeding Apertium with sub-sentential translation units. In: Proc. of the 3rd Workshop on Example-Based Machine Translation, Dublin, Ireland, pp. 11–18 (November 2009)
Sánchez-Martínez, F., Way, A.: Marker-based filtering of bilingual phrase pairs for SMT. In: Proc. of EAMT 2009, the 13th Ann. Meeting of the European Association for Machine Translation, Barcelona, Spain, pp. 144–151 (2009)
Och, F.J.: Minimum error rate training in statistical machine translation. In: Proc. 41st Ann. Meeting of the Association for Computational Linguistics, Sapporo, Japan, vol. 1, pp. 160–167 (2003)
Koehn, P., Axelrod, A., Mayne, A.B., Callison-Burch, C., Osborne, M., Talbot, D.: Edinburgh system description for the 2005 IWSLT speech translation evaluation. In: Proc. of IWSLT 2005, Pittsburgh, PA (2005)
Srivastava, A., Penkale, S., Groves, D., Tinsley, J.: Evaluating syntax-driven approaches to phrase extraction for MT. In: Proc. of the 3rd Workshop on Example-Based Machine Translation, Dublin, Ireland, pp. 19–28 (November 2009)
Federico, M., Cettolo, M.: Efficient handling of n-gram language models for statistical machine translation. In: Proc. of the 2nd Workshop on Statistical Machine Translation, Prague, Czech Rep., pp. 88–95 (2007)
Koehn, P.: Statistical significance tests for machine translation evaluation. In: Proceedings of EMNLP, vol. 4, pp. 388–395 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dandapat, S., Forcada, M.L., Groves, D., Penkale, S., Tinsley, J., Way, A. (2010). OpenMaTrEx: A Free/Open-Source Marker-Driven Example-Based Machine Translation System. In: Loftsson, H., Rögnvaldsson, E., Helgadóttir, S. (eds) Advances in Natural Language Processing. NLP 2010. Lecture Notes in Computer Science(), vol 6233. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14770-8_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-14770-8_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14769-2
Online ISBN: 978-3-642-14770-8
eBook Packages: Computer ScienceComputer Science (R0)