Speech translation system for english to dravidian languages

Sangeetha, J.; Jothilakshmi, S.

doi:10.1007/s10489-016-0846-3

Speech translation system for english to dravidian languages

Published: 17 September 2016

Volume 46, pages 534–550, (2017)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

J. Sangeetha¹ &
S. Jothilakshmi²

679 Accesses
16 Citations
Explore all metrics

Abstract

In this paper the Speech-to-Speech Translation (SST) system, which is mainly focused on translation from English to Dravidian languages (Tamil and Malayalam) has been proposed. Three major techniques involved in SST system are Automatic continuous speech recognition, machine translation, and text-to-speech synthesis system. In this paper automatic Continuous Speech Recognition (CSR) has been developed based on the Auto Associative Neural Network (AANN), Support Vector Machine (SVM) and Hidden Markov Model (HMM). The HMM yields better results compared with SVM and AANN. Hence the HMM based Speech recognizer for English language has been taken. We propose a hybrid Machine Translation (MT) system (combination of Rule based and Statistical) for converting English to Dravidian languages text. A syllable based concatenative Text To Speech Synthesis (TTS) for Tamil and Malayalam has been proposed. AANN based prosody prediction has been done for the Tamil language which is used to improve the naturalness and intelligibility. The domain is restricted to sentences that cover the announcements in the railway station, bus stop and airport. This work is framed a novel translation method for English to Dravidian languages. The improved performance of each module HMM based CSR, Hybrid MT and concatenative TTS increases the overall speech translation performance. This proposed speech translation system can be applied to English to any Indian languages if we train and create a parallel corpus for those languages.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Arora K, Arora S, Roy MK (2013) Speech to speech translation: a communication boon. CSI Trans ICT 1(3):207–213
Article Google Scholar
Condon S, Arehart M, Parvaz D, Sanders G, Doran C, Aberdeen J (2012) Evaluation of 2-way iraqi arabic - english speech translation systems using automated metrics. Mach Transl Springer 26(1):159–176
Article Google Scholar
Cui J, Deng Y, Zhou B (2009) Reinforcing language model for speech translation with auxiliary data. ASRU IEEE502–506
Dutoit T (1997) An introduction to text-to-speech synthesis. Kulwer Academic Publishers
Dyer C, Muresan S, Resnik P (2008) Generalizing word lattice translation. In: Proceedings of ACL 08: HLT, pp 1012–1020
Ettelaie E, Georgiou PG, Narayanan SS (2013) Unsupervised data processing for classifier-based speech translator. Comput Speech Lang 27(2):438–454
Article Google Scholar
Gao Y, Gu L, Zhou B, Sarikaya R, Afity M, Juo HK, Zhong W (2006) IBMmaster system: Multilingual automatic speech to speech translator. In: Proceedings of the workshop on medical speech translation at HLT-NAACL, pp 57–60
Guan Y, Zheng L, Tian J (2010) Real-time speaker adapted speech to speech translation system in mobile environment. In: 10th international conference on signal processing (ICSP). IEEE, pp 577–580
Juang BH, Rabiner LR (2005) Automatic speech recognition – a brief history of the technology development
Kalyanasundaram K A brief History and Features of the Tamil Language. http://www.tamilelibrary.org/teli/tamil7.html, Accessed April 5, 2015
Kao CL, Saleem S, Prasad R, Choi F, Natrajan P, Stallard D, Krstovski K, Kamali M (2008) Rapid development of an english/farsi speech to speech translation system. In: Proceedings of IWSLT, pp 166–173
Lewis MP, Simons GF, Fennig CD Ethnologue: languages of the World. http://www.ethnologue.com/country/IN, Accessed April 5, 2015
Matusov E, Ney H (2011) Lattice-based ASR-MT interface for speech translation. IEEE Trans Audio Speech Lang Process 19(4):721–732
Article Google Scholar
Nakamura S (2009) Overcoming the language barriers with speech translation technology. Sci Technol Q Rev 31:35–48
Google Scholar
Nakamura S, Markov K, Nakaiwa H, Kikui G, Kawai H, Jitsuhiro T, Zhang JS (2006) The ATR multilingual speech to speech translation system. IEEE Trans Acoust Speech Signal Process 14(2):365–376
Google Scholar
Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: A method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on association for computational linguistics, ACL ’02. Association for Computational Linguistics, Stroudsburg, pp 311–318
Rabiner L, Juang BH (1993) Fundamentals of speech recognition. Prentice-Hall, Inc., Upper Saddle River
MATH Google Scholar
Rajaram BSR, Ramakrishnan AG, Kumar HRS (2013) An accessible translation system between simple kannada and tamil sentences. In: Proceedings of 6th language and technology conference
Sakti PS, Maia M, Sakai R, Kimura S, Ashikari N, Sumita Y, Nakamura E (2009) Toward translating indonesian spoken utterances to/from other languages. In: International conference on oriental COCOSDA, pp 137–142
Saraswathi S, Geetha TV (2010) Design of language models at various phases of tamil speech recognition system. Int J Eng Sci Technol 2(5):244–257
Article Google Scholar
Szekely E, Steiner I, Ahmed Z, Carson-Berndsen J (2013) Facial expression based affective speech translation. J Multimodal User Interfaces 8(1):87–96
Article Google Scholar
Weaver W (1955) Machine translation of languages, chap. translation. The MIT Press, pp 15–23
Yegnanarayana B (1999) Artificial neural networks. Prentice-Hall, New Delhi
Google Scholar
Zhou B, Cui X, Huang S, Cmejrek M, Zhang W (2013) The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks. Comput Speech Lang 27(2):397–418
Article Google Scholar
Zhou B, Gao Y, Sorensen J, Diao Z, Picheny M (2002) Statistical natural language generation for speech to speech machine translation systems. In: International conference on spoken language processing, vol 3. Denver, pp 1897–1900

Download references

Author information

Authors and Affiliations

Department of Information Technology, School Of Computing, SASTRA University Thanjavur, Tamil Nadu, India
J. Sangeetha
Department of Computer Science, Annamalai University, Chidambaram, Tamil Nadu, India
S. Jothilakshmi

Authors

J. Sangeetha
View author publications
You can also search for this author in PubMed Google Scholar
S. Jothilakshmi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Sangeetha.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sangeetha, J., Jothilakshmi, S. Speech translation system for english to dravidian languages. Appl Intell 46, 534–550 (2017). https://doi.org/10.1007/s10489-016-0846-3

Download citation

Published: 17 September 2016
Issue Date: April 2017
DOI: https://doi.org/10.1007/s10489-016-0846-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Speech translation system for english to dravidian languages

Abstract

Access this article

Similar content being viewed by others

Speech Recognition in Indian Languages—A Survey

Text-to-Speech Synthesis System for Mymensinghiya Dialect of Bangla Language

An efficient model for text-to-speech synthesis in Indian languages

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Speech translation system for english to dravidian languages

Abstract

Access this article

Similar content being viewed by others

Speech Recognition in Indian Languages—A Survey

Text-to-Speech Synthesis System for Mymensinghiya Dialect of Bangla Language

An efficient model for text-to-speech synthesis in Indian languages

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation