Neural Machine Translation: Assamese–Bengali

Laskar, Sahinur Rahman; Pakray, Partha; Bandyopadhyay, Sivaji

doi:10.1007/978-981-15-9829-6_45

Neural Machine Translation: Assamese–Bengali

Sahinur Rahman Laskar⁷,
Partha Pakray⁷ &
Sivaji Bandyopadhyay⁷

Conference paper
First Online: 18 March 2021

721 Accesses
3 Citations

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 206))

Abstract

Neural machine translation (NMT) is a state-of-the-art technique in the task of machine translation (MT), where a source-language text is converted into a target language text while preserving its meaning. NMT attracts attention because it handles sequence to sequence learning problems for variable-length source and target sentences. With the attention mechanism, the NMT system performs well in the context-analyzing ability. But it needs sufficient parallel training corpus, which is a challenge in low resource language scenario. To overcome the bar of a handy parallel corpus, there is an increase in demand for direct translation among similar language pairs. This paper investigates the NMT system for direct translation of low resource similar language pair: Assamese–Bengali. The main contribution of this work is Assamese–Bengali parallel corpus. The NMT system has achieved a bilingual evaluation understudy (BLEU) score of 7.20 for Assamese to Bengali translation and BLEU score 10.10 for Bengali to Assamese translation, respectively.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Bengio, Y., LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings (2015). http://arxiv.org/abs/1409.0473
Cho, K., van Merriënboer, B., Bahdanau, D., Bengio, Y.: On the properties of neural machine translation: Encoder–decoder approaches. In: Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp. 103–111. Association for Computational Linguistics, Doha, Qatar (2014). https://doi.org/10.3115/v1/W14-4012
Cho, K., van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., Bengio, Y.: Learning phrase representations using RNN encoder–decoder for statistical machine translation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1724–1734. Association for Computational Linguistics, Doha, Qatar (2014). https://doi.org/10.3115/v1/D14-1179
Ghosh, S., Girish, K.V.V., Sreenivas, T.: Relationship between indian languages using long distance bi-gram language models. In: Proceedings of ICON 2011, 9th International Conference on Natural Language Processing, pp. 104–113. Macmillan (2011)
Google Scholar
Josan, G., Lehal, G.: A Punjabi to Hindi machine translation system, pp. 157–160 (2008)
Google Scholar
Klein, G., Kim, Y., Deng, Y., Senellart, J., Rush, A.: OpenNMT: Open-source toolkit for neural machine translation. In: Proceedings of ACL 2017, System Demonstrations, pp. 67–72. Association for Computational Linguistics, Vancouver, Canada (2017). https://www.aclweb.org/anthology/P17-4012
Koehn, P.: Statistical Machine Translation. Cambridge University Press (2009). https://doi.org/10.1017/CBO9780511815829
Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open source toolkit for statistical machine translation. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, pp. 177–180. Association for Computational Linguistics, Prague, Czech Republic (2007). https://www.aclweb.org/anthology/P07-2045
Laskar, S.R., Dutta, A., Pakray, P., Bandyopadhyay, S.: Neural machine translation: English to Hindi. In: 2019 IEEE Conference on Information and Communication Technology, pp. 1–6 (2019)
Google Scholar
Laskar, S.R., Pakray, P., Bandyopadhyay, S.: Neural machine translation: Hindi-Nepali. In: Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), pp. 202–207. Association for Computational Linguistics, Florence, Italy (2019). https://doi.org/10.18653/v1/W19-5427
Luong, T., Pham, H., Manning, C.D.: Effective approaches to attention-based neural machine translation. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1412–1421. Association for Computational Linguistics, Lisbon, Portugal (2015). https://doi.org/10.18653/v1/D15-1166
Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: Bleu: A method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL ’02, pp. 311–318. Association for Computational Linguistics, Stroudsburg, PA, USA (2002). https://doi.org/10.3115/1073083.1073135
Pathak, A., Pakray, P.: Neural machine translation for Indian languages. J. Intell. Syst. pp. 1–13 (2018). https://doi.org/10.1515/jisys-2018-0065
Pathak, A., Pakray, P., Bentham, J.: English–Mizo machine translation using neural and statistical approaches. Neur. Comput. Appl. 30, 1–17 (2018). https://doi.org/10.1007/s00521-018-3601-3
Saini, S., Sahula, V.: Neural machine translation for English to Hindi, pp. 1–6 (2018). https://doi.org/10.1109/INFRKM.2018.8464781
Sutskever, I., Vinyals, O., Le, Q.V.: Sequence to sequence learning with neural networks. In: Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger K.Q. (eds.) Advances in Neural Information Processing Systems, vol. 27, pp. 3104–3112. Curran Associates, Inc. (2014). http://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf

Download references

Acknowledgements

We would like to thank Department of Computer Science and Engineering and Center for Natural Language Processing (CNLP) at National Institute of Technology Silchar for providing the requisite support and infrastructure to execute this work.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, National Institute of Technology Silchar, Assam, India
Sahinur Rahman Laskar, Partha Pakray & Sivaji Bandyopadhyay

Authors

Sahinur Rahman Laskar
View author publications
You can also search for this author in PubMed Google Scholar
Partha Pakray
View author publications
You can also search for this author in PubMed Google Scholar
Sivaji Bandyopadhyay
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sahinur Rahman Laskar .

Editor information

Editors and Affiliations

Department of Mechanical Engineering, National Institute of Technology, Silchar, Assam, India
Biplab Das
Department of Computer Science and Engineering, National Institute of Technology Silchar, Silchar, Assam, India
Ripon Patgiri
National Institute of Technology Silchar, Silchar, India
Sivaji Bandyopadhyay
Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Laskar, S.R., Pakray, P., Bandyopadhyay, S. (2021). Neural Machine Translation: Assamese–Bengali. In: Das, B., Patgiri, R., Bandyopadhyay, S., Balas, V.E. (eds) Modeling, Simulation and Optimization. Smart Innovation, Systems and Technologies, vol 206. Springer, Singapore. https://doi.org/10.1007/978-981-15-9829-6_45

Download citation

DOI: https://doi.org/10.1007/978-981-15-9829-6_45
Published: 18 March 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-9828-9
Online ISBN: 978-981-15-9829-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics