Abstract
This paper describes an approach to enhance statistical machine translation. This approach uses a two-phase decoder system; the first decoder which we call the initial decoder translates from English to Arabic and the second is a post-processing decoder that re-translates the initial’s decoder Arabic output to Arabic again to fix some of the translation errors. This new technique showed to be useful when trying to translate corpus from a different context other than the original corpus used in the training of the initial decoder. We recorded a BLEU score enhancement on out-of-context corpus close to 10 BLEU points on UN corpus and 2 BLEU points on TED 2013 corpus.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bisazza, A., Federico, M.: Chunk-based verb reordering in VSO sentences for Arabic-English statistical machine translation. In: Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, Stroudsburg, PA, USA, pp. 235–243 (2010)
Carpuat, M., Marton, Y., Habash, N.: Improving Arabic-to-English statistical machine translation by reordering post-verbal subjects for alignment. In: Proceedings of the ACL 2010 Conference Short Papers, pp. 178–183 (2010)
Salameh, M., Cherry, C., Kondrak, G.: Reversing morphological tokenization in English-to-Arabic SMT. In: HLT-NAACL, pp. 47–53 (2013)
Khemakhem, I.T., Jamoussi, S., Hamadou, A.B.: Arabic-English semantic class alignment to improve statistical machine translation. In: Recent Advances in Natural Language Processing, p. 663, September 2015
Och, F.J., Ney, H.: A systematic comparison of various statistical alignment models. Comput. Linguist. 29(1), 19–51 (2003)
Almahairi, A., Cho, K., Habash, N., Courville, A.: First result on Arabic neural machine translation. arXiv:1606.02680 [cs], June 2016
Xiao, T., Zhu, J., Liu, T.: Bagging and Boosting statistical machine translation systems. Artif. Intell. 195, 496–527 (2013)
El Kholy, A., Habash, N.: Orthographic and morphological processing for English-Arabic statistical machine translation. Mach. Transl. 26(1–2), 25–45 (2012)
Kholy, A.E., Habash, N.: Techniques for Arabic morphological detokenization and orthographic denormalization (2011)
Manning, C.D., Surdeanu, M., Bauer, J., Finkel, J.R., Bethard, S., McClosky, D.: The stanford CoreNLP natural language processing toolkit. In: ACL (System Demonstrations), pp. 55–60 (2014)
Federico, M., Bertoldi, N., Cettolo, M.: IRSTLM: an open source toolkit for handling large scale language models. In: Interspeech, pp. 1618–1621 (2008)
Koehn, P., et al.: Moses: open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions, Stroudsburg, PA, USA, pp. 177–180 (2007)
Spence Green, D.C., Manning, C.D.: Phrasal: a toolkit for new directions in statistical machine translation. In: Proceedings of the Ninth Workshop on Statistical Machine Translation, pp. 114–121 (2014)
Ziemski, M., Junczys-Dowmunt, M., Pouliquen, B.: The United Nations Parallel Corpus v1. 0. In: LREC (2016)
Tiedemann, J.: Parallel data, tools and interfaces in OPUS. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012) (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
ElMaghraby, A., Rafea, A. (2019). Enhancing Translation from English to Arabic Using Two-Phase Decoder Translation. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2018. Advances in Intelligent Systems and Computing, vol 868. Springer, Cham. https://doi.org/10.1007/978-3-030-01054-6_39
Download citation
DOI: https://doi.org/10.1007/978-3-030-01054-6_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01053-9
Online ISBN: 978-3-030-01054-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)