Abstract
Machine Translation (MT) being an emerging area of research has currently gained significant attention from researchers primarily because of incredible features offered by contemporary data scientific approaches such as machine learning, deep learning & transfer learning, etc. Machine Translation is the automatic conversion of a text while preserving its underlying information content form and across various languages. Various approaches having different levels of efficiency and accuracy that can be applied to perform machine translation from one language to another include Direct, Rule-based, and Corpus-based approaches. India being a diverse country in terms of its cultural heritage with a huge population, plenty of information related to different spheres of life is being exchanged between people of various regions & communities in different languages on daily basis. It is therefore imperative to come up with automated translation tools capable of translating among these languages. In this paper, a study of various languages spoken in India with special reference to those listed among scheduled languages and machine translation systems designed for them is presented. The languages with low translation status have also been highlighted. The work presented in this paper will be of great benefit to prospective researchers working in the area of machine translation. It will certainly help them in gaining a better insight about the previous research developments in this area and lead them towards the design, development & implementation of improved and effective translation systems in the future.
Similar content being viewed by others
Data availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Ali A, Siddiq S, Malik MK (2010) Development of parallel corpus and english to urdu statistical machine translation. Int J Eng Technol 10:31–33
Andrabi SAB, Wahid A (2022) Machine translation system using deep learning for English to Urdu. Comput Intell Neurosci 2022:1–11
Anju ES, Manoj Kumar KV (2014) Malayalam to English machine translation: an EBMT system. IOSR Journal of Engineering (IOSRJEN) 4(1):18–23
ANUVADAKSH – An expert English to Indian language machine translation system. http://www.tdil-dc.in/tdildcMain/IPR/Anuvaadaaksh.pdf. Accessed 9 Oct 2021
Ashrafi SS, Kabir MH, Anwar MM, Noman AKM (2013) English to Bangla machine translation system using context-free grammars. Int J Comput Sci Issues 10(3):144
Ata N, Jawaid B, Kamran A (2007) Rule based English to Urdu machine translation. In: Proceedings of Conference on Language and Technology (CLT’07)
Bahadur P, Jain AK, Chauhan DS (2012) EtranS-A complete framework for English to Sanskrit machine translation. In: International Journal of Advanced Computer Science and Applications (IJACSA) from International Conference and workshop on Emerging Trends in Technology
Bal S, Mahanta S, Mandal L, Parekh R (2019) Bilingual machine translation: English to bengali. In: Proceedings of international ethical hacking conference 2018. Springer, Singapore, pp 247–259
Bansal M, Lobiyal DK (2021) Multilingual sequence to sequence convolutional machine translation. Multimed Tools Appl 80(25):33701–33726
Baruah KK, Das P, Hannan A, Sarma SK (2014) Assamese-English bilingual machine translation. arXiv preprint arXiv:1407.2019
Basavaraddi MCCS, Shashirekha DH (2014) A typical machine translation system for English to Kannada. Int J Sci Eng Res 5(4)
Batra KK, Lehal GS (2010) Rule based machine translation of noun phrases from Punjabi to English. Int J Comput Sci Issues 7(5):409
Behera P, Singh R, Jha GN (2016) Evaluation of Anuvadaksh (EILMT) English-Odia machine-assisted translation tool. WILDRE: LREC
Bharati A, Chaitanya V, Kulkarni AP, Sangal R (1997) Anusaaraka: Machine translation in stages. VIVEK-BOMBAY 10:22–25
Bharati A, Chaitanya V, Kulkarni AP, Sangal R, Rao GU (2003) ANUSAARAKA: overcoming the language barrier in India. arXiv preprint cs/0308018
Bharati RM, Reddy P, Sankar B, Sharma DM, Sangal R (2003) Machine translation: the Shakti approach. Pre-conference tutorial. ICON
Brown PF, Cocke J, Della Pietra SA, Della Pietra VJ, Jelinek F, Lafferty J … Roossin PS (1990) A statistical approach to machine translation. Comput linguistics 16(2):79–85
Chatterji S, Roy D, Sarkar S, Basu A (2009) A hybrid approach for Bengali to Hindi machine translation. In: Proceedings of ICON-2009: 7th international conference on natural language processing, pp 81–91
Choudhary A, Singh M (2009) GB theory based Hindi to English translation system. In: 2009 2nd IEEE International Conference on Computer Science and Information Technology. IEEE, pp 293–297
Choudhary H, Pathak AK, Saha RR, Kumaraguru P (2018) Neural machine translation for English-Tamil. In: Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pp 770–775
Cieślak M (2017) The scope and limits of machine translation. ElingUP: Revista eletrónica de
Computational Linguistics R & D. http://sanskrit.jnu.ac.in/index.jsp. Accessed 11 Nov 2021
Constitutional provisions relating to Eighth Schedule. https://www.mha.gov.in/sites/default/files/EighthSchedule_19052017.pdf. Accessed 7 Nov 2021
Daimary M, Sarma SK, Rahman M (2019) Bodo to english statistical machine translation system
Das AK, Pradhan M, Dash AK, Pradhan C, Das H (2018) A constructive machine translation system for English to Odia translation. In 2018 International Conference on Communication and Signal Processing (ICCSP). IEEE, pp 0854–0857
Dave S, Parikh J, Bhattacharyya P (2001) Interlingua-based English-Hindi machine translation and language divergence. Mach Transl 16(4):251–304
Dhar AC, Roy A, Akhand MAH, Kamal MAS, Siddique N (2021) Bangla↔English machine translation using attention-based multi-headed transformer model. J Comput Sci 17(10):1000–1010. https://doi.org/10.3844/jcssp.2021.1000.1010
Dubey P (2013) Machine translation system for Hindi-Dogri language pair. In: 2013 international conference on machine intelligence and research advancement. IEEE, pp 422–425
Dubey P (2020) The Hindi to Dogri machine translation system. In: Proceedings of the 17th International Conference on Natural Language Processing (ICON): system demonstrations, pp 19–20
Dwivedi SK, Sukhadeve PP (2010) Machine translation system in indian perspectives. J Comput Sci 6(10):1111
Garje GV, Kharate GK, Harshad K (2014) Transmuter: an approach to rule-based English to Marathi machine translation. Int J Comput Appl 975:8887
George A (2013) English to Malayalam statistical machine translation system. Int J Eng Res Technol 2(7):640–647
Ghosh S, Thamke S (2014) Translation of Telugu-Marathi and vice-versa using rule based machine translation. arXiv preprint arXiv:1406.3969
Godase A, Govilkar S (2015) A novel approach for rule based translation of English to Marathi. Advanced Computational Intelligence: An International Journal 2(4)
Gore L, Patil N (2002) English to Hindi-translation system. In: Proceedings of symposium on translation support systems, pp 178–184
Goyal V, Lehal GS (2009) Advances in machine translation systems. Language In India 9(11):138–150
Goyal V, Lehal GS (2009) Evaluation of Hindi to Punjabi machine translation system. arXiv preprint arXiv:0910.1868
Goyal V, Lehal GS (2010) Web based Hindi to Punjabi machine translation system. J Emerg Technol Web Intell 2(2):148–151
Goyal V, Lehal GS (2011) Hindi to Punjabi machine translation system. In: international conference on information systems for Indian languages. Springer, Berlin, pp 236–241
Goyal V, Sharma DM (2019) The IIIT-H Gujarati-English machine translation system for WMT19. In: Proceedings of the fourth conference on machine translation, volume 2: shared task papers, day 1, pp 191–195
Haroon RP, Shaharban TA (2016) Malayalam machine translation using hybrid approach. In: 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT). IEEE, pp 1013–1017
Hasan MA, Alam F, Chowdhury SA, Khan N (2019) Neural machine translation for the Bangla-English language pair. In: 2019 22nd International Conference on Computer and Information Technology (ICCIT). IEEE, pp 1–6
Hybrid Approach to English-Konkani Machine Translation. https://www.slideshare.net/sunayanagawde/hybrid-approach-to-englishkonkani-machine-translation. Accessed 2 Nov 2021
Ishfaq M, Bansal K (2015) English to Kashmiri translation systemle based machine translation. Int J Adv Res Comput Sci Technol 3(2):193–194
Ishfaq M, Bansal K (2015) English to Kashmiri translation system: using example based machine translation. Int J Innov Res Comput Commun Eng 3(10):193–194
Islam MS, Purkayastha BS (2018) English to Bodo phrase-based statistical machine translation. In: Advanced computing and communication technologies. Springer, Singapore, pp 207–217
Jindal S, Goyal V, Bhullar JS (2018) English to Punjabi statistical machine translation using moses (Corpus Based). J Stat Manag Syst 21(4):553–560
Josan GS, Lehal GS (2008) A Punjabi to Hindi machine translation system. In: Coling 2008: companion volume: demonstrations, pp 157–160
Karamat N (2006) Verb transfer for English to Urdu machine translation (Using Lexical Functional Grammar (LFG)). Unpublished MS Thesis, National University of Computer & Emerging Sciences, Lahore, Pakistan
Kaur H, Laxmi DV (2013) A web based English to Punjabi MT system for news headlines. Int J Adv Res Comput Sci Softw Eng 3(6):1092–1094
Khan N, Anwar W, Bajwa UI, Durrani N (2013) English to urdu hierarchical phrase-based statistical machine translation. In: Proceedings of the 4th Workshop on South and Southeast Asian Natural Language Processing, pp 72–76
Khan S, Mishra RB (2011) Translation rules and ANN based model for English to Urdu machine translation. INFOCOMP 10(3):36–47
Khan SN, Usman I (2019) A model for English to Urdu and Hindi machine translation system using translation rules and artificial neural network. Int Arab J Inf Technol 16(1):125–131
Kshirsagar SS (n.d.) Machine translation projects in India: current status and future prospects
Kumar A, Goyal V (2018) Hindi to Punjabi machine translation system based on statistical approach. J Stat Manag Syst 21(4):547–552
Lakshmikanth MG, Tech M (2016) An approach for Telugu to English phrase based statistical machine translation system
Laskar SR, Pakray P, Bandyopadhyay S (2019) Neural machine translation: Hindi-Nepali. In: Proceedings of the fourth conference on machine translation, volume 3: shared task papers, day 2, pp 202–207
Laskar SR, Paul B, Adhikary PK, Pakray P, Bandyopadhyay S (2021) Neural Machine Translation for Tamil–Telugu Pair. In: Proceedings of the sixth conference on machine translation, pp 284–287
Lone NA, Giri KJ, Bashir R (2022) Machine intelligence for language translation from Kashmiri to English. J Inf Knowl Manag 2250074. https://doi.org/10.1142/S0219649222500745
Machine Translation. https://sites.google.com/site/profrmksinha/research-projects/machine-translation. Accessed 18 Nov 2021
Machine translation: a short overview. https://towardsdatascience.com/machine-translation-a-short-overview-91343ff39c9f. Accessed 8 Nov 2021
Machine Translation - The saara Approach. http://languagetechnologies.uohyd.ac.in/saara-ppt.pdf. Accessed 18 Nov 2021
Machine Translation System for English to Konkani. https://technirmitisoftwares.com/blog/index.php/2019/11/19/machine-translation-system-for-english-to-konkani/. Accessed 11 Nov 2021
Mahesh R, Sinha K (2009) Developing english-urdu machine translation via hindi. In: Proceedings of the Third Workshop on Computational Approaches to Arabic-Script-based Languages (CAASL3)
MANTRA–Rajbhasha. https://www.cdac.in/index.aspx?id=mc_mat_mantra_rajbhasha. Accessed 12 Nov 2020
Mishra V, Mishra RB (2010) ANN and Rule based model for English to Sanskrit machine translation. INFOCOMP J Comput Sci 9(1):80–89
Moudgil MR (n.d.) Development of the dogri hindi statistical machine translation system
Mujadia V, Sharma DM (2021). English-Marathi neural machine translation for LoResMT 2021. In: Proceedings of the 4th workshop on technologies for MT of Low Resource Languages (LoResMT2021), pp 151–157
Mumbai CDAC (2008) MaTra: an English to Hindi machine translation system. a report by CDAC Mumbai formerly NCST
Murthy K (2002) MAT: a machine assisted translation system. In Proceedings of symposium on translation support system (STRANS-2002). IIT Kanpur, pp 134–139
Murthy BK, Deshpande WR (1998) Language technology in India: past, present, and the future. In: Proceedings of the SAARC Conference on extending the use of Multilingual and Multimedia Information Technology (EMMIT’98), Pune
Nair LR, Peter D, Ravindran RP (2012) Design and development of a Malayalam to English translator-a transfer based approach. Int J Comput Linguist 3(1):1–11
Narzary S, Brahma M, Singha B, Brahma R, Dibragede B, Barman S, … Som B (2019) Attention based English-bodo neural machine translation system for tourism domain. In: 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC). IEEE, pp 335–343
Naskar SK, Bandyopadhyay S (2005) Use of machine translation in India: current status. In: Proceedings of Machine Translation Summit X: Posters, pp 465–470
Nidhi R, Singh T (2018) English-Maithili machine translation and divergence. In: 2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO). IEEE, pp 775–778
Nidhi R, Singh T (2020) SMT Algorithms for Indian languages-a case study of Moses and MT Hub for English-Maithili language pair. In: Proceedings of ICETIT 2019. Springer, Cham, pp 269–279
Pandey RK, Jha GN (2016) Error analysis of sahit-a statistical sanskrit-hindi translator. Procedia Comput Sci 96:495–501
Pandian LS, Kadhirvelu K (2012) Machine translation from English to Tamil using hybrid technique. Int J Comput Appl 46:36–42
Pathak GR, Godse SP (2010) English to Sanskrit machine translation using transfer based approach. In AIP conference proceedings, vol 1324, no 1. American Institute of Physics, pp 122–126
Patil SP, Prasad JR (2020) deep learning approach in machine translation for indian context: a survey and discussion. Int J Adv Sci Technol 29(05):364–370. Retrieved from http://sersc.org/journals/index.php/IJAST/article/view/8985
Paul A, Purkayastha BS (2016) Current status of machine translation: Indian perspective. Int J Curr Eng Sci Res (IJCESR) 3(8):62–67
Paul A, Purkayastha BS (2018) English to Nepali statistical machine translation system. In: Proceedings of the international conference on computing and communication systems. Springer, Singapore, pp 423–431
PM Modi Advocates for MBBS In Mother-Tongue, Doctors Oppose Move. https://medicaldialogues.in/news/education/pm-modi-advocates-for-mbbs-in-mother-tongue-doctors-oppose-move-89657. Accessed 21 Nov 2022
Prasad TV, Muthukumaran GM (2013) Telugu to English translation using direct machine translation approach. Int J Sci Eng Investig 2(12):25–32
Rathod SG, Sondur S (2012) English to Sanskrit Translator and synthesizer (ETSTS). Int J Emerg Technol Adv Eng 2(12):379–383
Raulji JK, Saini JR (2019) Sanskrit-Gujarati constituency mapper for machine translation system. In: 2019 IEEE Bombay Section Signature Conference (IBSSC). IEEE, pp 1–8
Rawat DS (2015) Survey on machine translation approaches used in India. Int J Eng Tech Res 3(6):36–39
Reddy MV, Hanumanthappa DM (n.d.) Machine translation for English to Kannada
Saini S, Sahula V (2015) A survey of machine translation techniques and systems for indian languages. In: 2015 IEEE international conference on computational intelligence & communication technology. IEEE, pp 676–681
Sampark: Indian Language MT System. http://tdildc.in/index.php?option=com_vertical&parentid=74. Accessed 1 Nov 2021
Sebastian MP, Kumar GS (2010) English to Malayalam translation: a statistical approach. In: Proceedings of the 1st Amrita ACM-W Celebration on Women in Computing in India, pp 1–5
Seven decades after Independence, many small languages in India face extinction threat. https://economictimes.indiatimes.com/news/politics-and-nation/seven-decades-after-independence-many-small-languages-in-india-facing-extinction-threat/articleshow/60038323.cms. Accessed 6 Nov 2021
Shah P, Bakrola V (2019) Neural machine translation system of indic languages-an attention based approach. In: 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP). IEEE
Sindhu DV, Sagar BM (2017) Dictionary based machine translation from Kannada to Telugu. In: IOP conference series: materials science and engineering, vol 12182
Singh TD, Bandyopadhyay S (2010) Manipuri-English example based machine translation system. Int J Comput Linguistics Appl 1(1–2):201–216
Singh TD, Bandyopadhyay S (2010) Manipuri-English bidirectional statistical machine translation systems using morphology and dependency relations. In: Proceedings of the 4th workshop on syntax and structure in statistical translation, pp 83–91
Singh SM, Singh TD (2022) Low resource machine translation of english–manipuri: a semi-supervised approach. Exp Syst Appl 209:118187
Singh MT, Borgohain R, Gohain S (2014) An English-assamese machine translation system. Int J Comput Appl 93(4)
Singh U, Goyal V, Lehal GS (2016) Urdu to Punjabi machine translation: an incremental training approach. Int J Adv Comput Sci Appl 7(4):227–237
Singh M, Kumar R, Chana I (2019) GA-based machine translation system for Sanskrit to Hindi language. In: Recent trends in communication, computing, and electronics. Springer, Singapore, pp 419–427
Singh M, Kumar R, Chana I (2020) Corpus based machine translation system with deep neural network for Sanskrit to Hindi translation. Procedia Comput Sci 167:2534–2544
Sinha RMK, Jain A (2003) AnglaHindi: an English to Hindi machine-aided translation system. MT Summit IX, New Orleans, pp 494–497
Sinha RMK, Thakur A (2005) Machine translation of bi-lingual hindi-english (hinglish) text. 10th Machine Translation summit (MT Summit X), Phuket, Thailand, pp 149–156
Sinhal RA, Gupta KO (2014) A pure EBMT approach for English to Hindi sentence translation system. Int J Mod Educ Comput Sci 6(7):1–8
Sreedeepa HS, Idicula SM (2017) Interlingua based Sanskrit-English machine translation. In: 2017 International Conference on Circuit, Power and Computing Technologies (ICCPCT). IEEE, pp 1–5
Sreelekha S, Bhattacharyya P (2018) Morphology injection for English-Malayalam statistical machine translation. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018).
Sridhar R, Sethuraman P, Krishnakumar K (2016) English to Tamil machine translation system using universal networking language. Sādhanā 41(6):607–620
Statmt. http://kbcs.in/projects.html#Statmt. Accessed 1 Nov 2021
Sutskever I, Vinyals O, Le QV (2014) Sequence to sequence learning with neural networks. Adv Neural Inf Process Syst 27:3104–3112
Tathe NS, Kriplani JS, Deshmukh SS, Sadrani NK, Panjwani MA (2018) EBMT Sindhi to Hindi sentence translation system
The official language policy of the union. https://rajbhasha.gov.in/en/official-language-policy-union. Accessed 5 Nov 2021
The saara Machine Translation Systems. http://languagetechnologies.uohyd.ac.in/mt.html. Accessed 12 Oct 2021
Thenmozhi D, Kumar BS, Aravindan C (2018) Deep learning approach to English-Tamil and Hindi-Tamil verb phrase translations. In: FIRE (Working Notes), pp 323–331
Upadhyay P, Jaiswal UC, Ashish K (2014) Transish: translator from Sanskrit to English-a rule based machine translation. Int J Curr Eng Technol E-ISSN, 2277-4106
Vasuki DG, Sankaravelayuthan PR (2019) English to Tamil machine translation system using parallel corpus. Language in India www.languageinindia.com, 19(5).
Vijayanand K, Choudhury SI, Ratna P (2002) Vaasaanubaada: automatic machine translation of bilingual bengali-assamese news texts. In: Language engineering conference, 2002. Proceedings. IEEE, pp 183–188
Warhade SR, Patil SH, Devale PR (2012) English-to-sanskrit statistical machine translation with ubiquitous application. Int J Comput Appl 51(1)
Xuan HW, Li W, Tang GY (2012) An advanced review of hybrid machine translation (HMT). Procedia Eng 29:3017–3022
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interest to disclose.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Lone, N.A., Giri, K.J. & Bashir, R. Machine translation status of Indian scheduled languages: A survey. Multimed Tools Appl 82, 45145–45173 (2023). https://doi.org/10.1007/s11042-023-15287-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15287-z