An MELP Vocoder Based on UVS and MVF

Lu, Tangle; Zhao, Xiaoqun

doi:10.1007/978-3-319-52730-7_5

Tangle Lu¹⁶ &
Xiaoqun Zhao¹⁶

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 183))

Included in the following conference series:

International Conference on Machine Learning and Intelligent Communications

Abstract

Mixed excitation linear prediction (MELP) vocoder is generally used in low bit-rate vocoder, whose target now focuses on overall coding scheme, decrease of coding rate and improvement of robustness. Unvoiced/voiced/silence detective algorithm (UVS) possesses certain robustness and anti-noise property, while voiced excitation model based on maximum voicing frequency algorithm (MVF) is closer to the original speech characteristics. In this paper, the original excitation model of MELP vocoder is replaced and UVS is joined so that an improved 2.4 kbps coding rate vocoder is accomplished. Compared with MELP of federal standards, the improved vocoder owns better synthetic speech quality and robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

Design of MELPe-Based Variable-Bit-Rate Speech Coding with Mel Scale Approach Using Low-Order Linear Prediction Filter and Representing Excitation Signal Using Glottal Closure Instants

Article 05 December 2019

Two-Stage Sequence-to-Sequence Neural Voice Conversion with Low-to-High Definition Spectrogram Mapping

References

C114 communications network in China. Development and application of low bit-rate speech coding (2015). http://market.c114.net/154/a190256.html
Underwater acoustic communication. http://wiki.dzsc.com/info/7374.html
Degottex, G., Stylianou, Y.: Analysis and synthesis of speech using an adaptive full-band harmonic model. IEEE Trans. Audio Speech Lang. Process. 21(10), 2085–2095 (2013)
Article Google Scholar
Zongfu, L.: Multimedia Technology. Tsinghua University Press, Beijing (2009)
Google Scholar
Jingyun, X., Xiaoqun, Z., Rongyun, L., Jiao, W.: Vocoder excitation model based on voicing cut-off frequency of speech. J. Beijing Univ. Posts Telecommun. 03, 28–33 (2015)
Google Scholar
Rongyun, L.I., Xiaoqun, Z., Jingyun, X.U.: Adaptive anti-noise unvoiced/voiced/silence detection algorithm. J. Yanshan Univ. 02, 133–138 (2015)
Google Scholar
Rothauser, E.H., et al.: IEEE recommended practice for speech quality measurements. IEEE Trans. Audio Electroacoust. 17, 227–246 (1969)
Google Scholar
Conway, A.E.: Output-based method of applying PESQ to measure the perceptual quality of framed speech signals. Wirel. Commun. Netw. Conf. 4, 2521–2526 (2004)
Google Scholar
Hines, A., Skoglund, J., Kokaram, A., et al.: Robustness of speech quality metrics to background noise and network degradations: comparing ViSQOL, PESQ and POLQA. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 3697–3701 (2013)
Google Scholar
Wei, D.: Establishment and application of call duration model. Telecommun. Technol. 10, 58–60 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Tongji University Shanghai, Shanghai, China
Tangle Lu & Xiaoqun Zhao

Authors

Tangle Lu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoqun Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoqun Zhao .

Editor information

Editors and Affiliations

Tongji University, Shanghai, China
Huang Xin-lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, T., Zhao, X. (2017). An MELP Vocoder Based on UVS and MVF. In: Xin-lin, H. (eds) Machine Learning and Intelligent Communications. MLICOM 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 183. Springer, Cham. https://doi.org/10.1007/978-3-319-52730-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-52730-7_5
Published: 03 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52729-1
Online ISBN: 978-3-319-52730-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics