Abstract
In this paper we propose to study the possibilities of transforming the written language into speech (text-to-speech) using the LabVIEW programming environment. To this aim we studied the text-to-speech interfaces provided by the Microsoft Speech SDK for TTS applications. The number and diversity of languages that can be used through these interfaces are also taken into consideration. Emphasis is on the advantages and the limitations of each class while analyzing the possibility of rendering languages that use special characters. It is well known that LabVIEW offers little support for special characters, special language characters being no exception, hence finding a functional method of correctly rendering speech for special character languages is an arduous task we proceeded to undertake. We have also researched the use of a speech synthesizer called MBROLA that provides support for a wide range of international languages. Together with open-source software, namely eSpeak, MBROLA becomes a complete text-to-speech (TTS) system. We have also analyzed the possibility of interfacing eSpeak with LabVIEW.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Azis NA, Hikmah RM, Tjahja TV, Nugroho AS (2011) Evaluation of text-to-speech synthesizer for indonesian language using semantically unpredictable sentences test: IndoTTS, eSpeak, and Google Translate TTS. In: Proceedings of international conference on advanced computer science & information systems
Rafieee MS, Jafari S, Ahmadi HS, Jafari M (2011) Considerations to spoken language recognition for text-to-speech applications. In: 2011 UkSim 13th international conference on computer modelling and simulation (UKSim), pp 304–309
Falk TH, Mölle S (2008) Towards signal-based instrumental quality diagnosis for text-to-speech systems. IEEE Signal Process Lett 15:781–784
Hunt AJ, Black AW (1996) Unit selection in a concatenative speech synthesis system using a large speech database. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, ICASSP-96, vol 1, pp 373–376
Dutoit T, Pagel V, Pierret N, Bataille F, van der Vrecken O (1996) The Mbrola project: towards a set of high quality speech synthesizers free of use for non commercial purposes. In: Proceedings of the fourth international conference on spoken language, ICSLP 96, vol 3, pp 1393–1396
Microsoft Speech SDK 5.1 Help
Bigorgne D, Boeffard O, Cherbonnel B, Emerard F, Larreur D, Le Saint-Milon JL, Metayer I, Sorin C, White S (1993) Multilingual PSOLA text-to-speech system. In: Proceedings of IEEE international conference on acoustics, speech, and signal processing, ICASSP-93, vol 2, pp 187–190
Dutoit T, Leich H (1993) MBR-PSOLA: text-to-speech synthesis based on an MBE re-synthesis of the segments database. Speech Commun 435–440
Moulines E, Charpentier F (1989) Pitch synchronous waveform processing techniques for text-to-speech synthesis using diphones. Speech Commun 9:5–6
http://en.wikipedia.org/wiki/Microsoft_text-to-speech_voices
http://www.blong.com/Conferences/DCon2002/Speech/SAPI4HighLevel/SAPI4.htm
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Panoiu, M., Rat, CL., Panoiu, C. (2016). A Comparative Study of Text-to-Speech Systems in LabVIEW. In: Balas, V., C. Jain, L., Kovačević, B. (eds) Soft Computing Applications. SOFA 2014. Advances in Intelligent Systems and Computing, vol 356. Springer, Cham. https://doi.org/10.1007/978-3-319-18296-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-18296-4_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18295-7
Online ISBN: 978-3-319-18296-4
eBook Packages: EngineeringEngineering (R0)