A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum

Ling, Zhen-Hua; Hu, Yu; Wang, Ren-Hua

doi:10.1007/11573548_57

Zhen-Hua Ling¹⁹,
Yu Hu¹⁹ &
Ren-Hua Wang¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3784))

Included in the following conference series:

International Conference on Affective Computing and Intelligent Interaction

5029 Accesses
8 Citations

Abstract

This paper presents a voice source analysis method by studying the spectral characters of LF model and their representation in output speech signal. The estimation of source features is defined as the set of LF parameter whose spectrum has the most similar characters in frequency domain, including glottal formant and spectral tilt, with the corresponding characters held by the STRAIGHT spectrum of speech signal for analysis. Besides, the concept of analyzable frame is introduced to ensure the feasibility and improve the reliability of proposed method. Evaluation with synthetic speech proves this method is able to estimate the LF parameters with satisfactory precision. Furthermore, the experiment with emotional speech shows the effectiveness of proposed method in describing voice quality variety among speech with different emotions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Murray, I., Arnott, J.L.: Towards the Simulation of Emotion in Synthetic Speech: A review of the Literature on Human Vocal Emotion. Journal of the Acoustic Society of America, 1097–1108 (1993)
Google Scholar
Gobl, C.: The voice source in speech communication - production and perception experiments involving inverse filtering and synthesis, Department of Speech, Music and Hearing, KTH, Stockholm (2003)
Google Scholar
Fant, G., Liljencrants, J., Lin, Q.: A four-parameter model of glottal flow. In: STL-QPSR, Speech, Music and Hearing, vol. 4, pp. 1–13. Royal Institute of Technology, Stockholm (1985)
Google Scholar
Hedelin, P.: High quality glottal LPC-vocoder. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (1986)
Google Scholar
Alku, P.: Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. Speech Communication 11, 109–118 (1992)
Article Google Scholar
Kawahara, H., Masuda-Katsuse, I., Cheveigné, A.: Restructuring speech representations using a pitch adaptive time frequency smoothing and a instantaneous frequency based F0 extraction: Possible role of a repetitive structure in sound. Speech Communication 27, 187–207 (1999)
Article Google Scholar
Ling, Z., et al.: Modeling Glottal Effect on the Spectral Envelop of STRAIGHT using Mixture of Gaussians. In: International Symposium on Chinese Spoken Language Processing (2004)
Google Scholar
d’Alessandro, C., Doval, B.: Voice quality modification for emotional speech synthesis. In: Eurospeech (2003)
Google Scholar
Fröhlich, M., Michaelis, D., Strube, H.W.: SIM — simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals. Journal of the Acoustical Society of America 110, 479–488 (2001)
Article Google Scholar

Download references

Author information

Authors and Affiliations

iFlytek Speech Laboratory, University of Science and Technology of China, Hefei
Zhen-Hua Ling, Yu Hu & Ren-Hua Wang

Authors

Zhen-Hua Ling
View author publications
You can also search for this author in PubMed Google Scholar
Yu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Ren-Hua Wang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences,
Jianhua Tao
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
MIT Media Laboratory, 20 Ames Street, 02139, Cambridge, MA, USA
Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ling, ZH., Hu, Y., Wang, RH. (2005). A Novel Source Analysis Method by Matching Spectral Characters of LF Model with STRAIGHT Spectrum. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_57

Download citation

DOI: https://doi.org/10.1007/11573548_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics