Abstract
Quality of service (QoS) evaluation is vital for text-to-speech (TTS) web service applications. Most of the current solutions focus on either evaluating functional or nonfunctional attributes of the TTS. In this paper, we propose a QoS framework to evaluate and analyze the perceived QoS that combines general and specific mechanisms for measuring both functional and nonfunctional requirements of speech quality. General mechanism measures the response time of TTS services while specific mechanism measures intelligibility and naturalness through subjective quality measurements, which are mapped onto mean opinion score (MOS). The result shows the workability of the framework, tested by predetermined users to three services: service1 (Fromtexttospeech) resulting 47.84%; service2 and service3 (NaturalReader and Yakitome) are 31.62 and 21.53% respectively. The TTS services evaluation can be to enhance the user experience.
Similar content being viewed by others
References
Patil, M., Kawitkar, R.S.: “Syllable” concatenation for text to speech synthesis for Devnagari script. Int. J. Adv. Res. Eng. Comput. Sci. Softw. 2(9), 180–184 (2012)
Md Fudzee, M.F., Abawajy, J.: A protocol for discovering content adaptation services. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) ICA3PP 2011. LNCS, vol. 7017, pp. 235–244. Springer, Heidelberg (2011). doi:10.1007/978-3-642-24669-2
Wang, L., et al.: Evaluating text-to-speech intelligibility using template constrained generalized posterior probability. U.S. Patent Application (2012)
Remes, U., Reima, K., Mikko, K.: Objective evaluation measures for speaker adaptive HMM-TTS systems. In: Proceedings of 8th ISCA Speech Synthesis Workshop (2013)
Möller, S., Wai, Y.C., Cote, N., Falk, T., Raake, A., Waltermann, A.: Speech quality estimation: models and trends. IEEE Sign. Process. Mag. 28, 18–28 (2011)
Egger, S., et al.: Waiting times in quality of experience for web based services. In: 2012 Fourth International Workshop on Quality of Multimedia Experience (QoMEX). IEEE (2012)
Streijl, C.R., Winkler, S., Hands, D.S.: Mean Opinion Score (MOS) revisited: methods and applications, limitations and alternatives. Multimedia Syst. 22, 213–227 (2014)
Md Fudzee, M.F., Abawajy, J.: Request-driven cross-media content adaptation technique. In: Ragab, K., Helmy, T., Hassanien, A.E. (eds.) Developing Advanced Web Services Through P2P Computing and Autonomous Agents: Trends and Innovations, chap. 6, pp. 91–113. IGI Global (2010)
Eyben, F., et al.: Unsupervised clustering of emotion and voice styles for expressive TTS. In: International Conference on IEEE Acoustics, Speech and Signal Processing (ICASSP) (2012)
Md Fudzee, M.F., Abawajy, J.: Management of Service level agreement for service-oriented content adaptation platform. In: Network and Traffic Engineering in Emerging Distributed Computing Applications, pp. 21–42 (2012)
Acknowledgments
The authors would like to acknowledge the Malaysian Ministry of Higher Education for the Fundamental Research Grant Scheme vot 1238. This research also supported by GATES IT Solution Sdn. Bhd under its publication scheme.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Md Fudzee, M.F., Hassan, M., Mahdin, H., Kasim, S., Abawajy, J. (2017). A Framework to Analyze Quality of Service (QoS) for Text-To-Speech (TTS) Services. In: Herawan, T., Ghazali, R., Nawi, N.M., Deris, M.M. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2016. Advances in Intelligent Systems and Computing, vol 549. Springer, Cham. https://doi.org/10.1007/978-3-319-51281-5_59
Download citation
DOI: https://doi.org/10.1007/978-3-319-51281-5_59
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51279-2
Online ISBN: 978-3-319-51281-5
eBook Packages: EngineeringEngineering (R0)