Abstract
The high frequency part of the voiced speech signal beyond 4 kHz is very difficult to study and to decompose into harmonics. In the HNM this spectrum part is assumed to be noise. In this paper it is shown that the main problem is numerical. Faster harmonics have faster trends. It is necessary to implement precise estimation technique to estimate a high frequency complex amplitude on a short time interval. An illustrative example is supplied. In the second part of the paper a new modification technique is proposed for interpolation of the complex amplitudes in the case of intonation modification. Reliable estimates of harmonic complex amplitudes are necessary as inputs. Then a nonlinear rule is formulated that incorporates specific features of formants and their slopes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Stylianou, Y.: Harmonic plus Noise Models for speech, combined with statistical methods, for speech and speaker modification. Ph.D. Thesis. Ecole Nationale Superieure des Telecommunications. Paris (1996)
Degottex, G., Stylianou, Y.: Analysis and synthesis of speech using an adaptive full-band harmonic model. IEEE Trans. Audio Speech Lang. Process. 21(10), 2085–2095 (2013)
Petrovsky, A., Azarov, E., Petrovsky, A.: Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Process. 91(6), 1489–1504 (2011)
Petrovsky, A., Azarov, E.: Instantaneous harmonic analysis: techniques and applications to speech signal processing. In: Ronzhin, A., Potapova, R., Delic, V. (eds.) SPECOM 2014. LNCS, vol. 8773, pp. 24–33. Springer, Heidelberg (2014)
Barabanov, A., Melnikov, A., Magerkin, V., Vikulov, E.: Fast algorithm for precise estimation of fundamental frequency on short time intervals. In: Ronzhin, A., Potapova, R., Fakotakis, N. (eds.) SPECOM 2015. LNCS, vol. 9319, pp. 217–225. Springer, Heidelberg (2015)
Shipilo, A., Barabanov, A., Lipkovich, M.: Parametric speech synthesis and user interface for speech modification. In: Železný, M., Habernal, I., Ronzhin, A. (eds.) SPECOM 2013. LNCS, vol. 8113, pp. 249–256. Springer, Heidelberg (2013)
Acknowledgments
The work was supported by Saint Petersburg State University, project 6.37.349.2015.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Barabanov, A., Magerkin, V., Vikulov, E. (2016). Precise Estimation of Harmonic Parameter Trend and Modification of a Speech Signal. In: Ronzhin, A., Potapova, R., Németh, G. (eds) Speech and Computer. SPECOM 2016. Lecture Notes in Computer Science(), vol 9811. Springer, Cham. https://doi.org/10.1007/978-3-319-43958-7_66
Download citation
DOI: https://doi.org/10.1007/978-3-319-43958-7_66
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-43957-0
Online ISBN: 978-3-319-43958-7
eBook Packages: Computer ScienceComputer Science (R0)