Abstract
Mixed excitation linear prediction (MELP) vocoder is generally used in low bit-rate vocoder, whose target now focuses on overall coding scheme, decrease of coding rate and improvement of robustness. Unvoiced/voiced/silence detective algorithm (UVS) possesses certain robustness and anti-noise property, while voiced excitation model based on maximum voicing frequency algorithm (MVF) is closer to the original speech characteristics. In this paper, the original excitation model of MELP vocoder is replaced and UVS is joined so that an improved 2.4 kbps coding rate vocoder is accomplished. Compared with MELP of federal standards, the improved vocoder owns better synthetic speech quality and robustness.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
C114 communications network in China. Development and application of low bit-rate speech coding (2015). http://market.c114.net/154/a190256.html
Underwater acoustic communication. http://wiki.dzsc.com/info/7374.html
Degottex, G., Stylianou, Y.: Analysis and synthesis of speech using an adaptive full-band harmonic model. IEEE Trans. Audio Speech Lang. Process. 21(10), 2085–2095 (2013)
Zongfu, L.: Multimedia Technology. Tsinghua University Press, Beijing (2009)
Jingyun, X., Xiaoqun, Z., Rongyun, L., Jiao, W.: Vocoder excitation model based on voicing cut-off frequency of speech. J. Beijing Univ. Posts Telecommun. 03, 28–33 (2015)
Rongyun, L.I., Xiaoqun, Z., Jingyun, X.U.: Adaptive anti-noise unvoiced/voiced/silence detection algorithm. J. Yanshan Univ. 02, 133–138 (2015)
Rothauser, E.H., et al.: IEEE recommended practice for speech quality measurements. IEEE Trans. Audio Electroacoust. 17, 227–246 (1969)
Conway, A.E.: Output-based method of applying PESQ to measure the perceptual quality of framed speech signals. Wirel. Commun. Netw. Conf. 4, 2521–2526 (2004)
Hines, A., Skoglund, J., Kokaram, A., et al.: Robustness of speech quality metrics to background noise and network degradations: comparing ViSQOL, PESQ and POLQA. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3697–3701 (2013)
Wei, D.: Establishment and application of call duration model. Telecommun. Technol. 10, 58–60 (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Lu, T., Zhao, X. (2017). An MELP Vocoder Based on UVS and MVF. In: Xin-lin, H. (eds) Machine Learning and Intelligent Communications. MLICOM 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-319-52730-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-52730-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52729-1
Online ISBN: 978-3-319-52730-7
eBook Packages: Computer ScienceComputer Science (R0)