A Comparative Study of Text-to-Speech Systems in LabVIEW

Panoiu, Manuela; Rat, Cezara-Liliana; Panoiu, Caius

doi:10.1007/978-3-319-18296-4_1

Manuela Panoiu⁵,
Cezara-Liliana Rat⁶ &
Caius Panoiu⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 356))

Included in the following conference series:

International Workshop Soft Computing Applications

780 Accesses
2 Citations

Abstract

In this paper we propose to study the possibilities of transforming the written language into speech (text-to-speech) using the LabVIEW programming environment. To this aim we studied the text-to-speech interfaces provided by the Microsoft Speech SDK for TTS applications. The number and diversity of languages that can be used through these interfaces are also taken into consideration. Emphasis is on the advantages and the limitations of each class while analyzing the possibility of rendering languages that use special characters. It is well known that LabVIEW offers little support for special characters, special language characters being no exception, hence finding a functional method of correctly rendering speech for special character languages is an arduous task we proceeded to undertake. We have also researched the use of a speech synthesizer called MBROLA that provides support for a wide range of international languages. Together with open-source software, namely eSpeak, MBROLA becomes a complete text-to-speech (TTS) system. We have also analyzed the possibility of interfacing eSpeak with LabVIEW.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Azis NA, Hikmah RM, Tjahja TV, Nugroho AS (2011) Evaluation of text-to-speech synthesizer for indonesian language using semantically unpredictable sentences test: IndoTTS, eSpeak, and Google Translate TTS. In: Proceedings of international conference on advanced computer science & information systems
Google Scholar
Rafieee MS, Jafari S, Ahmadi HS, Jafari M (2011) Considerations to spoken language recognition for text-to-speech applications. In: 2011 UkSim 13th international conference on computer modelling and simulation (UKSim), pp 304–309
Google Scholar
Falk TH, Mölle S (2008) Towards signal-based instrumental quality diagnosis for text-to-speech systems. IEEE Signal Process Lett 15:781–784
Article Google Scholar
Hunt AJ, Black AW (1996) Unit selection in a concatenative speech synthesis system using a large speech database. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, ICASSP-96, vol 1, pp 373–376
Google Scholar
Dutoit T, Pagel V, Pierret N, Bataille F, van der Vrecken O (1996) The Mbrola project: towards a set of high quality speech synthesizers free of use for non commercial purposes. In: Proceedings of the fourth international conference on spoken language, ICSLP 96, vol 3, pp 1393–1396
Google Scholar
Microsoft Speech SDK 5.1 Help
Google Scholar
http://en.wikipedia.org/wiki/Lernout_%26_Hauspie
Bigorgne D, Boeffard O, Cherbonnel B, Emerard F, Larreur D, Le Saint-Milon JL, Metayer I, Sorin C, White S (1993) Multilingual PSOLA text-to-speech system. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing, ICASSP-93, vol 2, pp 187–190
Google Scholar
Dutoit T, Leich H (1993) MBR-PSOLA: text-to-speech synthesis based on an MBE re-synthesis of the segments database. Speech Commun 435–440
Google Scholar
Moulines E, Charpentier F (1989) Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:5–6
Google Scholar
http://espeak.sourceforge.net/
https://decibel.ni.com/content/docs/DOC-2263
http://en.wikipedia.org/wiki/Microsoft_text-to-speech_voices
http://www.blong.com/Conferences/DCon2002/Speech/SAPI4HighLevel/SAPI4.htm
https://decibel.ni.com/content/docs/DOC-10153

Download references

Author information

Authors and Affiliations

Electrical Engineering and Industrial Informatics Department, University Polytechnic Timisoara, Timisoara, Romania
Manuela Panoiu & Caius Panoiu
University Polytechnic Timisoara, Timisoara, Romania
Cezara-Liliana Rat

Authors

Manuela Panoiu
View author publications
You can also search for this author in PubMed Google Scholar
Cezara-Liliana Rat
View author publications
You can also search for this author in PubMed Google Scholar
Caius Panoiu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manuela Panoiu .

Editor information

Editors and Affiliations

Department of Automation and Applied Informatics, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Faculty of Science and Technology, Data Science Institute, Bournemouth University, Poole, United Kingdom
Lakhmi C. Jain
University of Belgrade, Belgrade, Serbia
Branko Kovačević

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Panoiu, M., Rat, CL., Panoiu, C. (2016). A Comparative Study of Text-to-Speech Systems in LabVIEW. In: Balas, V., C. Jain, L., Kovačević, B. (eds) Soft Computing Applications. SOFA 2014. Advances in Intelligent Systems and Computing, vol 356. Springer, Cham. https://doi.org/10.1007/978-3-319-18296-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-18296-4_1
Published: 03 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18295-7
Online ISBN: 978-3-319-18296-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics