research-article

Low-resource Neural Machine Translation: Methods and Trends

Authors:
Shumin Shi

School of Computer Scienceand Technology, Beijing Institute of Technology, China

School of Computer Scienceand Technology, Beijing Institute of Technology, China

0000-0003-3436-7575
View Profile

,
Xing Wu

School of Computer Scienceand Technology, Beijing Institute of Technology, China

School of Computer Scienceand Technology, Beijing Institute of Technology, China

0000-0001-6413-5180
View Profile

,
Rihai Su

School of Computer Scienceand Technology, Beijing Institute of Technology, China

School of Computer Scienceand Technology, Beijing Institute of Technology, China

0000-0002-3437-0099
View Profile

,
Heyan Huang

School of Computer Scienceand Technology, Beijing Institute of Technology, China

School of Computer Scienceand Technology, Beijing Institute of Technology, China

0000-0002-0320-7520
View Profile

ACM Transactions on Asian and Low-Resource Language Information Processing Volume 21 Issue 5Article No.: 103pp 1–22https://doi.org/10.1145/3524300

Published:15 November 2022Publication History

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

Neural Machine Translation (NMT) brings promising improvements in translation quality, but until recently, these models rely on large-scale parallel corpora. As such corpora only exist on a handful of language pairs, the translation performance is far from the desired effect in the majority of low-resource languages. Thus, developing low-resource language translation techniques is crucial and it has become a popular research field in neural machine translation. In this article, we make an overall review of existing deep learning techniques in low-resource NMT. We first show the research status as well as some widely used low-resource datasets. Then, we categorize the existing methods and show some representative works detailedly. Finally, we summarize the common characters among them and outline the future directions in this field.

REFERENCES

[1] Artetxe Mikel, Labaka Gorka, and Agirre Eneko. 2017. Learning bilingual word embeddings with (almost) no bilingual data. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 451–462.Google ScholarCross Ref
[2] Artetxe Mikel, Labaka Gorka, and Agirre Eneko. 2019. An effective approach to unsupervised machine translation. arXiv preprint arXiv:1902.01313 (2019).Google Scholar
[3] Artetxe Mikel, Labaka Gorka, and Agirre Eneko. 2020. Translation artifacts in cross-lingual transfer learning. arXiv preprint arXiv:2004.04721 (2020).Google Scholar
[4] Artetxe Mikel, Labaka Gorka, Agirre Eneko, and Cho Kyunghyun. 2017. Unsupervised neural machine translation. arXiv preprint arXiv:1710.11041 (2017).Google Scholar
[5] Bahdanau Dzmitry, Cho Kyunghyun, and Bengio Yoshua. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google Scholar
[6] Banerjee Satanjeev and Lavie Alon. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. 65–72.Google ScholarDigital Library
[7] Bertoldi Nicola and Federico Marcello. 2009. Domain adaptation for statistical machine translation with monolingual resources. In Proceedings of the Fourth Workshop on Statistical Machine Translation. 182–189.Google ScholarDigital Library
[8] Bojar Ondřej and Tamchyna Aleš. 2011. Improving translation model by monolingual data. In Proceedings of the 6th Workshop on Statistical Machine Translation. 330–336.Google Scholar
[9] Brown Peter F., Pietra Stephen A. Della, Pietra Vincent J. Della, and Mercer Robert L.. 1993. The mathematics of statistical machine translation: Parameter estimation. Computat. Ling. 19, 2 (1993), 263–311.Google ScholarDigital Library
[10] Bugliarello Emanuele and Okazaki Naoaki. 2019. Enhancing machine translation with dependency-aware self-attention. arXiv preprint arXiv:1909.03149 (2019).Google Scholar
[11] Carbonell Jaime G., Klein Steve, Miller David, Steinbaum Mike, Grassiany Tomer, and Frey Jochen. 2006. Context-based machine translation. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers, 19–28.Google Scholar
[12] Caswell Isaac, Chelba Ciprian, and Grangier David. 2019. Tagged back-translation. arXiv preprint arXiv:1906.06442 (2019).Google Scholar
[13] Chen Yun, Liu Yang, Cheng Yong, and Li Victor O. K.. 2017. A teacher-student framework for zero-resource neural machine translation. arXiv preprint arXiv:1705.00753 (2017).Google Scholar
[14] Chen Yun, Liu Yang, and Li Victor. 2018. Zero-resource neural machine translation with multi-agent communication game. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.Google ScholarCross Ref
[15] Cheng Yong. 2019. Joint training for pivot-based neural machine translation. In Joint Training for Neural Machine Translation. Springer, 41–54.Google ScholarCross Ref
[16] Cheng Yong. 2019. Semi-supervised learning for neural machine translation. In Joint Training for Neural Machine Translation. Springer, 25–40.Google ScholarCross Ref
[17] Chiang David. 2005. A hierarchical phrase-based model for statistical machine translation. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05). 263–270.Google ScholarDigital Library
[18] Chimalamarri Santwana, Sitaram Dinkar, and Jain Ashritha. 2020. Morphological segmentation to improve crosslingual word embeddings for low resource languages. ACM Trans. Asian Low-resour. Lang. Inf. Process. 19, 5 (2020), 1–15.Google ScholarDigital Library
[19] Chronopoulou Alexandra, Stojanovski Dario, and Fraser Alexander. 2020. Reusing a pretrained language model on languages with limited corpora for unsupervised NMT. arXiv preprint arXiv:2009.07610 (2020).Google Scholar
[20] Collins Michael, Koehn Philipp, and Kučerová Ivona. 2005. Clause restructuring for statistical machine translation. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05). 531–540.Google ScholarDigital Library
[21] Cotterell Ryan and Kreutzer Julia. 2018. Explaining and generalizing back-translation through wake-sleep. arXiv preprint arXiv:1806.04402 (2018).Google Scholar
[22] Currey Anna and Heafield Kenneth. 2019. Incorporating source syntax into transformer-based neural machine translation. In Proceedings of the 4th Conference on Machine Translation (Volume 1: Research Papers). 24–33.Google ScholarCross Ref
[23] Currey Anna, Miceli-Barone Antonio Valerio, and Heafield Kenneth. 2017. Copied monolingual data improves low-resource neural machine translation. In Proceedings of the 2nd Conference on Machine Translation. 148–156.Google ScholarCross Ref
[24] Devlin Jacob, Chang Ming-Wei, Lee Kenton, and Toutanova Kristina. 2018. BERT: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).Google Scholar
[25] Dou Qing and Knight Kevin. 2012. Large scale decipherment for out-of-domain machine translation. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 266–275.Google ScholarDigital Library
[26] Edunov Sergey, Ott Myle, Auli Michael, and Grangier David. 2018. Understanding back-translation at scale. arXiv preprint arXiv:1808.09381 (2018).Google Scholar
[27] Kholy Ahmed El, Habash Nizar, Leusch Gregor, Matusov Evgeny, and Sawaf Hassan. 2013. Language independent connectivity strength features for phrase pivot statistical machine translation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 412–418.Google Scholar
[28] Fadaee Marzieh, Bisazza Arianna, and Monz Christof. 2017. Data augmentation for low-resource neural machine translation. arXiv preprint arXiv:1705.00440 (2017).Google Scholar
[29] Fadaee Marzieh and Monz Christof. 2018. Back-translation sampling by targeting difficult words in neural machine translation. arXiv preprint arXiv:1808.09006 (2018).Google Scholar
[30] Farajian M. Amin, Turchi Marco, Negri Matteo, and Federico Marcello. 2017. Multi-domain neural machine translation through unsupervised adaptation. In Proceedings of the 2nd Conference on Machine Translation. 127–137.Google ScholarCross Ref
[31] Firat Orhan, Sankaran Baskaran, Al-Onaizan Yaser, Vural Fatos T. Yarman, and Cho Kyunghyun. 2016. Zero-resource translation with multi-lingual neural machine translation. arXiv preprint arXiv:1606.04164 (2016).Google Scholar
[32] Gal Yarin and Ghahramani Zoubin. 2016. A theoretically grounded application of dropout in recurrent neural networks. Adv. Neural Inf. Process. Syst. 29 (2016), 1019–1027.Google Scholar
[33] Ganin Yaroslav, Ustinova Evgeniya, Ajakan Hana, Germain Pascal, Larochelle Hugo, Laviolette François, Marchand Mario, and Lempitsky Victor. 2016. Domain-adversarial training of neural networks. J. Mach. Learn. Res. 17, 1 (2016), 2096–2030.Google ScholarDigital Library
[34] Gao Fei, Zhu Jinhua, Wu Lijun, Xia Yingce, Qin Tao, Cheng Xueqi, Zhou Wengang, and Liu Tie-Yan. 2019. Soft contextual data augmentation for neural machine translation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 5539–5544.Google ScholarCross Ref
[35] Gibadullin Ilshat, Valeev Aidar, Khusainova Albina, and Khan Adil. 2019. A survey of methods to leverage monolingual data in low-resource neural machine translation. arXiv preprint arXiv:1910.00373 (2019).Google Scholar
[36] Gu Jiatao, Hassan Hany, Devlin Jacob, and Li Victor O. K.. 2018. Universal neural machine translation for extremely low resource languages. arXiv preprint arXiv:1802.05368 (2018).Google Scholar
[37] Gu Jiatao, Wang Yong, Chen Yun, Cho Kyunghyun, and Li Victor O. K.. 2018. Meta-learning for low-resource neural machine translation. arXiv preprint arXiv:1808.08437 (2018).Google Scholar
[38] Gulcehre Caglar, Firat Orhan, Xu Kelvin, Cho Kyunghyun, Barrault Loic, Lin Huei-Chi, Bougares Fethi, Schwenk Holger, and Bengio Yoshua. 2015. On using monolingual corpora in neural machine translation. arXiv preprint arXiv:1503.03535 (2015).Google Scholar
[39] Guo Junliang, Tan Xu, He Di, Qin Tao, Xu Linli, and Liu Tie-Yan. 2019. Non-autoregressive neural machine translation with enhanced decoder input. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 3723–3730.Google ScholarDigital Library
[40] Guzmán Francisco, Chen Peng-Jen, Ott Myle, Pino Juan, Lample Guillaume, Koehn Philipp, Chaudhary Vishrav, and Ranzato Marc’Aurelio. 2019. The FLoRes evaluation datasets for low-resource machine translation: Nepali-English and Sinhala-English. arXiv preprint arXiv:1902.01382 (2019).Google Scholar
[41] He Di, Xia Yingce, Qin Tao, Wang Liwei, Yu Nenghai, Liu Tie-Yan, and Ma Wei-Ying. 2016. Dual learning for machine translation. Adv. Neural Inf. Process. Syst. 29 (2016), 820–828.Google Scholar
[42] Hoang Vu Cong Duy, Koehn Philipp, Haffari Gholamreza, and Cohn Trevor. 2018. Iterative back-translation for neural machine translation. In Proceedings of the 2nd Workshop on Neural Machine Translation and Generation. 18–24.Google ScholarCross Ref
[43] Imankulova Aizhan, Dabre Raj, Fujita Atsushi, and Imamura Kenji. 2019. Exploiting out-of-domain parallel data through multilingual transfer learning for low-resource neural machine translation. arXiv preprint arXiv:1907.03060 (2019).Google Scholar
[44] Imankulova Aizhan, Sato Takayuki, and Komachi Mamoru. 2019. Filtered pseudo-parallel corpus improves low-resource neural machine translation. ACM Trans. Asian Low-resour. Lang. Inf. Process. 19, 2 (2019), 1–16.Google ScholarDigital Library
[45] Irvine Ann and Callison-Burch Chris. 2013. Combining bilingual and comparable corpora for low resource machine translation. In Proceedings of the 8th Workshop on Statistical Machine Translation. 262–270.Google Scholar
[46] Irvine Ann and Callison-Burch Chris. 2014. Hallucinating phrase translations for low resource MT. In Proceedings of the 18th Conference on Computational Natural Language Learning. 160–170.Google ScholarCross Ref
[47] Irvine Ann and Callison-Burch Chris. 2016. End-to-end statistical machine translation with zero or small parallel texts. Nat. Lang. Eng. 22, 4 (2016), 517.Google ScholarCross Ref
[48] Isozaki Hideki, Sudoh Katsuhito, Tsukada Hajime, and Duh Kevin. 2010. Head finalization: A simple reordering rule for SOV languages. In Proceedings of the Joint 5th Workshop on Statistical Machine Translation and Metrics. 244–251.Google Scholar
[49] Ji Baijun, Zhang Zhirui, Duan Xiangyu, Zhang Min, Chen Boxing, and Luo Weihua. 2020. Cross-lingual pre-training based transfer for zero-shot neural machine translation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 115–122.Google ScholarCross Ref
[50] Jitao Xu, Crego Josep M., and Senellart Jean. 2020. Boosting neural machine translation with similar translations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 1580–1590.Google Scholar
[51] Johnson Melvin, Schuster Mike, Le Quoc V., Krikun Maxim, Wu Yonghui, Chen Zhifeng, Thorat Nikhil, Viégas Fernanda, Wattenberg Martin, Corrado Greg, et al. 2017. Google’s multilingual neural machine translation system: Enabling zero-shot translation. Trans. Assoc. Computat. Ling. 5 (2017), 339–351.Google ScholarCross Ref
[52] Karakanta Alina, Dehdari Jon, and Genabith Josef van. 2018. Neural machine translation for low-resource languages without parallel corpora. Mach. Transl. 32, 1 (2018), 167–189.Google ScholarDigital Library
[53] Khayrallah Huda, Thompson Brian, Post Matt, and Koehn Philipp. 2020. Simulated multiple reference training improves low-resource machine translation. arXiv preprint arXiv:2004.14524 (2020).Google Scholar
[54] Kim Yunsu, Gao Yingbo, and Ney Hermann. 2019. Effective cross-lingual transfer of neural machine translation models without shared vocabularies. arXiv preprint arXiv:1905.05475 (2019).Google Scholar
[55] Kim Yunsu, Geng Jiahui, and Ney Hermann. 2019. Improving unsupervised word-by-word translation with language model and denoising autoencoder. arXiv preprint arXiv:1901.01590 (2019).Google Scholar
[56] Kim Yunsu, Petrov Petre, Petrushkov Pavel, Khadivi Shahram, and Ney Hermann. 2019. Pivot-based transfer learning for neural machine translation between non-English languages. arXiv preprint arXiv:1909.09524 (2019).Google Scholar
[57] Klementiev Alexandre, Irvine Ann, Callison-Burch Chris, and Yarowsky David. 2012. Toward statistical machine translation without parallel corpora. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. 130–140.Google ScholarDigital Library
[58] Kocmi Tom and Bojar Ondřej. 2018. Trivial transfer learning for low-resource neural machine translation. arXiv preprint arXiv:1809.00357 (2018).Google Scholar
[59] Koehn Philipp, Och Franz J., and Marcu Daniel. 2003. Statistical Phrase-based Translation. Technical Report. University of Southern California Marina Del Rey Information Sciences Inst.Google ScholarCross Ref
[60] Lakew Surafel M., Erofeeva Aliia, Negri Matteo, Federico Marcello, and Turchi Marco. 2018. Transfer learning in multilingual neural machine translation with dynamic vocabulary. arXiv preprint arXiv:1811.01137 (2018).Google Scholar
[61] Lample Guillaume, Conneau Alexis, Denoyer Ludovic, and Ranzato Marc’Aurelio. 2017. Unsupervised machine translation using monolingual corpora only. arXiv preprint arXiv:1711.00043 (2017).Google Scholar
[62] Lample Guillaume, Ott Myle, Conneau Alexis, Denoyer Ludovic, and Ranzato Marc’Aurelio. 2018. Phrase-based & neural unsupervised machine translation. arXiv preprint arXiv:1804.07755 (2018).Google Scholar
[63] Lee Jason, Cho Kyunghyun, and Hofmann Thomas. 2017. Fully character-level neural machine translation without explicit segmentation. Trans. Assoc. Computat. Ling. 5 (2017), 365–378.Google ScholarCross Ref
[64] Leng Yichong, Tan Xu, Qin Tao, Li Xiang-Yang, and Liu Tie-Yan. 2019. Unsupervised pivot translation for distant languages. arXiv preprint arXiv:1906.02461 (2019).Google Scholar
[65] Li Guanlin, Liu Lemao, Huang Guoping, Zhu Conghui, and Zhao Tiejun. 2019. Understanding data augmentation in neural machine translation: Two perspectives towards generalization. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). 5693–5699.Google ScholarCross Ref
[66] Li Rumeng, Wang Xun, and Yu Hong. 2020. MetaMT, a meta learning method leveraging multiple domain data for low resource machine translation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 8245–8252.Google ScholarCross Ref
[67] Li Xiaoqing, Zhang Jiajun, and Zong Chengqing. 2016. One sentence one model for neural machine translation. arXiv preprint arXiv:1609.06490 (2016).Google Scholar
[68] Liu Ding, Ma Ning, Yang Fangtao, and Yang Xuebin. 2019. A survey of low resource neural machine translation. In Proceedings of the 4th International Conference on Mechanical, Control and Computer Engineering (ICMCCE). IEEE, 39–393.Google ScholarCross Ref
[69] Liu Zihan, Winata Genta Indra, and Fung Pascale. 2021. Continual mixed-language pre-training for extremely low-resource neural machine translation. arXiv preprint arXiv:2105.03953 (2021).Google Scholar
[70] Luo Gongxu, Yang Yating, Yuan Yang, Chen Zhanheng, and Ainiwaer Aizimaiti. 2019. Hierarchical transfer learning architecture for low-resource neural machine translation. IEEE Access 7 (2019), 154157–154166.Google ScholarCross Ref
[71] Luong Minh-Thang, Pham Hieu, and Manning Christopher D.. 2015. Effective approaches to attention-based neural machine translation. arXiv preprint arXiv:1508.04025 (2015).Google Scholar
[72] Maimaiti Mieradilijiang, Liu Yang, Luan Huanbo, Pan Zegao, and Sun Maosong. 2021. Improving data augmentation for low-resource NMT guided by POS-Tagging and paraphrase embedding. Trans. Asian Low-resour. Lang. Inf. Process. 20, 6 (2021), 1–21.Google ScholarDigital Library
[73] Maimaiti Mieradilijiang, Liu Yang, Luan Huanbo, and Sun Maosong. 2019. Multi-round transfer learning for low-resource NMT using multiple high-resource languages. ACM Trans. Asian Low-resour. Lang. Inf. Process. 18, 4 (2019), 1–26.Google ScholarDigital Library
[74] Marie Benjamin, Rubino Raphael, and Fujita Atsushi. 2020. Tagged back-translation revisited: Why does it really work? In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 5990–5997.Google ScholarCross Ref
[75] Nakayama Hideki and Nishida Noriki. 2017. Zero-resource machine translation by multimodal encoder–decoder network with multimedia pivot. Mach. Transl. 31, 1 (2017), 49–64.Google ScholarDigital Library
[76] Nguyen Toan Q. and Chiang David. 2017. Transfer learning across low-resource, related languages for neural machine translation. arXiv preprint arXiv:1708.09803 (2017).Google Scholar
[77] Nguyen Xuan-Phi, Joty Shafiq, Kui Wu, and Aw Ai Ti. 2019. Data diversification: A simple strategy for neural machine translation. arXiv preprint arXiv:1911.01986 (2019).Google Scholar
[78] Pan Boyuan, Yang Yazheng, Li Hao, Zhao Zhou, Zhuang Yueting, Cai Deng, and He Xiaofei. 2019. MacNet: Transferring knowledge from machine comprehension to sequence-to-sequence models. arXiv preprint arXiv:1908.01816 (2019).Google Scholar
[79] Papineni Kishore, Roukos Salim, Ward Todd, and Zhu Wei-Jing. 2002. Bleu: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 311–318.Google ScholarDigital Library
[80] Paul Michael, Finch Andrew, and Sumita Eiichrio. 2013. How to choose the best pivot language for automatic translation of low-resource languages. ACM Trans. Asian Lang. Inf. Process. 12, 4 (2013), 1–17.Google ScholarDigital Library
[81] Peters Matthew E., Neumann Mark, Iyyer Mohit, Gardner Matt, Clark Christopher, Lee Kenton, and Zettlemoyer Luke. 2018. Deep contextualized word representations. arXiv preprint arXiv:1802.05365 (2018).Google Scholar
[82] Pham Hieu, Wang Xinyi, Yang Yiming, and Neubig Graham. 2021. Meta back-translation. arXiv preprint arXiv:2102.07847 (2021).Google Scholar
[83] Popović Maja. 2015. chrF: Character n-gram F-score for automatic MT evaluation. In Proceedings of the 10th Workshop on Statistical Machine Translation. 392–395.Google ScholarCross Ref
[84] Pourdamghani Nima, Aldarrab Nada, Ghazvininejad Marjan, Knight Kevin, and May Jonathan. 2019. Translating translationese: A two-step approach to unsupervised machine translation. arXiv preprint arXiv:1906.05683 (2019).Google Scholar
[85] Pourdamghani Nima and Knight Kevin. 2017. Deciphering related languages. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2513–2518.Google ScholarCross Ref
[86] Radford Alec, Wu Jeffrey, Child Rewon, Luan David, Amodei Dario, Sutskever Ilya, et al. 2019. Language models are unsupervised multitask learners. OpenAI Blog 1, 8 (2019), 9.Google Scholar
[87] Ravi Sujith and Knight Kevin. 2011. Deciphering foreign language. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies. 12–21.Google ScholarDigital Library
[88] Ren Shuo, Chen Wenhu, Liu Shujie, Li Mu, Zhou Ming, and Ma Shuai. 2018. Triangular architecture for rare language translation. arXiv preprint arXiv:1805.04813 (2018).Google Scholar
[89] Ren Shuo, Wu Yu, Liu Shujie, Zhou Ming, and Ma Shuai. 2019. Explicit cross-lingual pre-training for unsupervised machine translation. arXiv preprint arXiv:1909.00180 (2019).Google Scholar
[90] Ren Shuo, Wu Yu, Liu Shujie, Zhou Ming, and Ma Shuai. 2020. A retrieve-and-rewrite initialization method for unsupervised machine translation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 3498–3504.Google ScholarCross Ref
[91] Sen Sukanta, Gupta Kamal Kumar, Ekbal Asif, and Bhattacharyya Pushpak. 2019. Multilingual unsupervised NMT using shared encoder and language-specific decoders. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 3083–3089.Google ScholarCross Ref
[92] Sennrich Rico, Haddow Barry, and Birch Alexandra. 2015. Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709 (2015).Google Scholar
[93] Sennrich Rico, Haddow Barry, and Birch Alexandra. 2016. Edinburgh neural machine translation systems for WMT 16. arXiv preprint arXiv:1606.02891 (2016).Google Scholar
[94] Sennrich Rico and Zhang Biao. 2019. Revisiting low-resource neural machine translation: A case study. arXiv preprint arXiv:1905.11901 (2019).Google Scholar
[95] Shavarani Hassan S. and Sarkar Anoop. 2021. Better neural machine translation by extracting linguistic information from BERT. arXiv preprint arXiv:2104.02831 (2021).Google Scholar
[96] Snover Matthew, Dorr Bonnie, Schwartz Richard, Micciulla Linnea, and Makhoul John. 2006. A study of translation edit rate with targeted human annotation. In Proceedings of the 7th Conference of the Association for Machine Translation in the Americas: Technical Papers. 223–231.Google Scholar
[97] Sun Haipeng, Wang Rui, Chen Kehai, Utiyama Masao, Sumita Eiichiro, and Zhao Tiejun. 2019. Unsupervised bilingual word embedding agreement for unsupervised neural machine translation. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 1235–1245.Google ScholarCross Ref
[98] Sutskever Ilya, Vinyals Oriol, and Le Quoc V.. 2014. Sequence to sequence learning with neural networks. arXiv preprint arXiv:1409.3215 (2014).Google Scholar
[99] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Lukasz, and Polosukhin Illia. 2017. Attention is all you need. arXiv preprint arXiv:1706.03762 (2017).Google Scholar
[100] Vincent Pascal, Larochelle Hugo, Lajoie Isabelle, Bengio Yoshua, Manzagol Pierre-Antoine, and Bottou Léon. 2010. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion.J. Mach. Learn. Res. 11, 12 (2010).Google Scholar
[101] Wang Rui, Tan Xu, Luo Renqian, Qin Tao, and Liu Tie-Yan. 2021. A survey on low-resource neural machine translation. arXiv preprint arXiv:2107.04239 (2021).Google Scholar
[102] Wang Xinyi, Pham Hieu, Dai Zihang, and Neubig Graham. 2018. SwitchOut: An efficient data augmentation algorithm for neural machine translation. arXiv preprint arXiv:1808.07512 (2018).Google Scholar
[103] Wei Hao-Ran, Zhang Zhirui, Chen Boxing, and Luo Weihua. 2020. Iterative domain-repaired back-translation. arXiv preprint arXiv:2010.02473 (2020).Google Scholar
[104] Weng Rongxiang, Yu Heng, Huang Shujian, Cheng Shanbo, and Luo Weihua. 2020. Acquiring knowledge from pre-trained model to neural machine translation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 9266–9273.Google ScholarCross Ref
[105] Weng Rongxiang, Yu Heng, Huang Shujian, Luo Weihua, and Chen Jiajun. 2019. Improving neural machine translation with pre-trained representation. arXiv preprint arXiv:1908.07688 (2019).Google Scholar
[106] Wu Jiawei, Wang Xin, and Wang William Yang. 2019. Extract and edit: An alternative to back-translation for unsupervised neural machine translation. arXiv preprint arXiv:1904.02331 (2019).Google Scholar
[107] Xia Yingce, He Di, Qin Tao, Wang Liwei, Yu Nenghai, Liu Tie-Yan, and Ma Wei-Ying. 2016. Dual learning for machine translation. arXiv preprint arXiv:1611.00179 (2016).Google Scholar
[108] Xie Ziang, Wang Sida I., Li Jiwei, Lévy Daniel, Nie Aiming, Jurafsky Dan, and Ng Andrew Y.. 2017. Data noising as smoothing in neural network language models. arXiv preprint arXiv:1703.02573 (2017).Google Scholar
[109] Yang Zhen, Chen Wei, Wang Feng, and Xu Bo. 2018. Unsupervised neural machine translation with weight sharing. arXiv preprint arXiv:1804.09057 (2018).Google Scholar
[110] Zahabi Samira Tofighi, Bakhshaei Somayeh, and Khadivi Shahram. 2013. Using context vectors in improving a machine translation system with bridge language. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). 318–322.Google Scholar
[111] Zhang Jiajun and Zong Chengqing. 2016. Exploiting source-side monolingual data in neural machine translation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. 1535–1545.Google ScholarCross Ref
[112] Zhang Meng, Liu Yang, Luan Huanbo, and Sun Maosong. 2017. Adversarial training for unsupervised bilingual lexicon induction. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1959–1970.Google ScholarCross Ref
[113] Zheng Hao, Cheng Yong, and Liu Yang. 2017. Maximum expected likelihood estimation for zero-resource neural machine translation. In Proceedings of the International Joint Conference on Artificial Intelligence. 4251–4257.Google ScholarCross Ref
[114] Zheng Zaixiang, Zhou Hao, Huang Shujian, Li Lei, Dai Xin-Yu, and Chen Jiajun. 2019. Mirror-generative neural machine translation. In Proceedings of the International Conference on Learning Representations.Google Scholar
[115] Zhou Chunting, Ma Xuezhe, Hu Junjie, and Neubig Graham. 2019. Handling syntactic divergence in low-resource machine translation. arXiv preprint arXiv:1909.00040 (2019).Google Scholar
[116] Zoph Barret, Yuret Deniz, May Jonathan, and Knight Kevin. 2016. Transfer learning for low-resource neural machine translation. arXiv preprint arXiv:1604.02201 (2016).Google Scholar

Index Terms

Low-resource Neural Machine Translation: Methods and Trends
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Machine translation

Recommendations

Neural Machine Translation for Low-resource Languages: A Survey
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since the early 2000s and has already entered a mature phase. While considered the most widely used solution for Machine Translation, its performance on low-resource ...
Read More
Extremely low-resource neural machine translation for Asian languages
Abstract
This paper presents a set of effective approaches to handle extremely low-resource language pairs for self-attention based neural machine translation (NMT) focusing on English and four Asian languages. Starting from an initial set of parallel ...
Read More
Morphologically Motivated Input Variations and Data Augmentation in Turkish-English Neural Machine Translation
Success of neural networks in natural language processing has paved the way for neural machine translation (NMT), which rapidly became the mainstream approach in machine translation. Significant improvement in translation performance has been achieved ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Asian and Low-Resource Language Information Processing Volume 21, Issue 5
September 2022
486 pages
ISSN:2375-4699
EISSN:2375-4702
DOI:10.1145/3533669
Editor:
Imed Zitouni
Google, USA
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 November 2022
- Online AM: 15 March 2022
- Accepted: 27 January 2022
- Revised: 15 December 2021
- Received: 10 June 2021
Published in tallip Volume 21, Issue 5

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Low-resource
neural machine translation
semi-supervised
unsupervised
transfer learning
pivot-based methods
data augmentation
Qualifiers
- research-article
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 3
  Total Citations
  View Citations
- 764
  Total Downloads
- Downloads (Last 12 months)337
- Downloads (Last 6 weeks)43
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

HTML Format

View this article in HTML Format .

View HTML Format

Low-resource Neural Machine Translation: Methods and Trends

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Neural Machine Translation for Low-resource Languages: A Survey

Extremely low-resource neural machine translation for Asian languages

Morphologically Motivated Input Variations and Data Augmentation in Turkish-English Neural Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Caption

Low-resource Neural Machine Translation: Methods and Trends

ACM Transactions on Asian and Low-Resource Language Information Processing

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Neural Machine Translation for Low-resource Languages: A Survey

Extremely low-resource neural machine translation for Asian languages

Morphologically Motivated Input Variations and Data Augmentation in Turkish-English Neural Machine Translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Share this Publication link

Share on Social Media