Skip to main content
Log in

Speech translation system for english to dravidian languages

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

In this paper the Speech-to-Speech Translation (SST) system, which is mainly focused on translation from English to Dravidian languages (Tamil and Malayalam) has been proposed. Three major techniques involved in SST system are Automatic continuous speech recognition, machine translation, and text-to-speech synthesis system. In this paper automatic Continuous Speech Recognition (CSR) has been developed based on the Auto Associative Neural Network (AANN), Support Vector Machine (SVM) and Hidden Markov Model (HMM). The HMM yields better results compared with SVM and AANN. Hence the HMM based Speech recognizer for English language has been taken. We propose a hybrid Machine Translation (MT) system (combination of Rule based and Statistical) for converting English to Dravidian languages text. A syllable based concatenative Text To Speech Synthesis (TTS) for Tamil and Malayalam has been proposed. AANN based prosody prediction has been done for the Tamil language which is used to improve the naturalness and intelligibility. The domain is restricted to sentences that cover the announcements in the railway station, bus stop and airport. This work is framed a novel translation method for English to Dravidian languages. The improved performance of each module HMM based CSR, Hybrid MT and concatenative TTS increases the overall speech translation performance. This proposed speech translation system can be applied to English to any Indian languages if we train and create a parallel corpus for those languages.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Arora K, Arora S, Roy MK (2013) Speech to speech translation: a communication boon. CSI Trans ICT 1(3):207–213

    Article  Google Scholar 

  2. Condon S, Arehart M, Parvaz D, Sanders G, Doran C, Aberdeen J (2012) Evaluation of 2-way iraqi arabic - english speech translation systems using automated metrics. Mach Transl Springer 26(1):159–176

    Article  Google Scholar 

  3. Cui J, Deng Y, Zhou B (2009) Reinforcing language model for speech translation with auxiliary data. ASRU IEEE502–506

  4. Dutoit T (1997) An introduction to text-to-speech synthesis. Kulwer Academic Publishers

  5. Dyer C, Muresan S, Resnik P (2008) Generalizing word lattice translation. In: Proceedings of ACL 08: HLT, pp 1012–1020

  6. Ettelaie E, Georgiou PG, Narayanan SS (2013) Unsupervised data processing for classifier-based speech translator. Comput Speech Lang 27(2):438–454

    Article  Google Scholar 

  7. Gao Y, Gu L, Zhou B, Sarikaya R, Afity M, Juo HK, Zhong W (2006) IBMmaster system: Multilingual automatic speech to speech translator. In: Proceedings of the workshop on medical speech translation at HLT-NAACL, pp 57–60

  8. Guan Y, Zheng L, Tian J (2010) Real-time speaker adapted speech to speech translation system in mobile environment. In: 10th international conference on signal processing (ICSP). IEEE, pp 577–580

  9. Juang BH, Rabiner LR (2005) Automatic speech recognition – a brief history of the technology development

  10. Kalyanasundaram K A brief History and Features of the Tamil Language. http://www.tamilelibrary.org/teli/tamil7.html, Accessed April 5, 2015

  11. Kao CL, Saleem S, Prasad R, Choi F, Natrajan P, Stallard D, Krstovski K, Kamali M (2008) Rapid development of an english/farsi speech to speech translation system. In: Proceedings of IWSLT, pp 166–173

  12. Lewis MP, Simons GF, Fennig CD Ethnologue: languages of the World. http://www.ethnologue.com/country/IN, Accessed April 5, 2015

  13. Matusov E, Ney H (2011) Lattice-based ASR-MT interface for speech translation. IEEE Trans Audio Speech Lang Process 19(4):721–732

    Article  Google Scholar 

  14. Nakamura S (2009) Overcoming the language barriers with speech translation technology. Sci Technol Q Rev 31:35–48

    Google Scholar 

  15. Nakamura S, Markov K, Nakaiwa H, Kikui G, Kawai H, Jitsuhiro T, Zhang JS (2006) The ATR multilingual speech to speech translation system. IEEE Trans Acoust Speech Signal Process 14(2):365–376

    Google Scholar 

  16. Papineni K, Roukos S, Ward T, Zhu WJ (2002) BLEU: A method for automatic evaluation of machine translation. In: Proceedings of the 40th annual meeting on association for computational linguistics, ACL ’02. Association for Computational Linguistics, Stroudsburg, pp 311–318

  17. Rabiner L, Juang BH (1993) Fundamentals of speech recognition. Prentice-Hall, Inc., Upper Saddle River

    MATH  Google Scholar 

  18. Rajaram BSR, Ramakrishnan AG, Kumar HRS (2013) An accessible translation system between simple kannada and tamil sentences. In: Proceedings of 6th language and technology conference

  19. Sakti PS, Maia M, Sakai R, Kimura S, Ashikari N, Sumita Y, Nakamura E (2009) Toward translating indonesian spoken utterances to/from other languages. In: International conference on oriental COCOSDA, pp 137–142

  20. Saraswathi S, Geetha TV (2010) Design of language models at various phases of tamil speech recognition system. Int J Eng Sci Technol 2(5):244–257

    Article  Google Scholar 

  21. Szekely E, Steiner I, Ahmed Z, Carson-Berndsen J (2013) Facial expression based affective speech translation. J Multimodal User Interfaces 8(1):87–96

    Article  Google Scholar 

  22. Weaver W (1955) Machine translation of languages, chap. translation. The MIT Press, pp 15–23

  23. Yegnanarayana B (1999) Artificial neural networks. Prentice-Hall, New Delhi

    Google Scholar 

  24. Zhou B, Cui X, Huang S, Cmejrek M, Zhang W (2013) The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks. Comput Speech Lang 27(2):397–418

    Article  Google Scholar 

  25. Zhou B, Gao Y, Sorensen J, Diao Z, Picheny M (2002) Statistical natural language generation for speech to speech machine translation systems. In: International conference on spoken language processing, vol 3. Denver, pp 1897–1900

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to J. Sangeetha.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sangeetha, J., Jothilakshmi, S. Speech translation system for english to dravidian languages. Appl Intell 46, 534–550 (2017). https://doi.org/10.1007/s10489-016-0846-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-016-0846-3

Keywords

Navigation