Abstract
In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker’s individuality and meaning of the of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two levels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Akagi, M., Ienaga, T.: Speaker Individualities in Fundamental Frequency Contours and Its Control. In: Proc. EuroSpeech 1995, pp. 439–442 (September 1995)
Kuwabara, H., Sagisaka, Y.: Acoustic Characteristics of Speaker Individuality: Control and Conversion. Speech Communication 16, 165–173 (1995)
Kain, A., Macon, M.W.: Spectral Voice Conversion for Text-To-Speech Synthesis. In: Proc. ICASSP 1998, vol. 1, pp. 285–288 (1998)
van Santen, J.P.H.: Prosodic Modeling in Text-to- Speech Synthesis. Proc. EuroSpeech 1997, KN 19-KN 28 (1997)
Kim, Y.J., Byeon, H.J., Oh, Y.H.: Prosodic Phrasing in Korean; Determine Governor, and then Split or Not. In: Proc. EuroSpeech 1999, pp. 539–542 (1999)
Arslan, L.M., Talkin, D.: Speaker Transformation using Sentence HMM based Alignments and Detailed Prosody Modification. In: Proc. ICASSP 1998, vol. 1, pp. 289–292 (1998)
Chappel, D.T., Hansen, J.H.L.: Speaker-Specific Pitch Contour Modeling and Modification. In: Proc. ICASSP 1998, vol. 1, pp. 885–888 (1998)
Nespor, M., Vogel, I.: Prosodic Phonology. Foris Publication, Dordrecht
Jun, S.-A.: The Phonetics and Phonology of Korean Prosody, Ph. D. Dissertation, The Ohio State University (1993)
Lee, K.Y., Song, M.S.: Automatic Detection of Korean Accentual Phrase Boundaries. The Journal of Acoustic Society of Korea 18(1E), 27–31 (1999)
Moulines, E., Charpentier, F.: Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones. Speech Communication 9(5,6), 453–467 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, K.Y., Kim, J.K., Bae, M.J. (2004). Statistical Pitch Conversion Approaches Based on Korean Accentual Phrases. In: Zhang, C., W. Guesgen, H., Yeap, WK. (eds) PRICAI 2004: Trends in Artificial Intelligence. PRICAI 2004. Lecture Notes in Computer Science(), vol 3157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28633-2_97
Download citation
DOI: https://doi.org/10.1007/978-3-540-28633-2_97
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22817-2
Online ISBN: 978-3-540-28633-2
eBook Packages: Springer Book Archive