Skip to main content

Statistical Pitch Conversion Approaches Based on Korean Accentual Phrases

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3157))

Abstract

In performing speech conversion from a source speaker to a target speaker, it is important that the pitch contour of the source speakers utterance be converted into that of the target speaker, because pitch contour of a speech utterance plays an important role in expressing speaker’s individuality and meaning of the of the utterance. This paper describes statistical algorithms of pitch contour conversion for Korean language. Pitch contour conversions are investigated at two levels of prosodic phrases: intonational phrase and accentual phrase. The basic algorithm is a Gaussian normalization in intonational phrase. The first presented algorithm is combined with a declination-line of pitch contour in an intonational phrase. The second one is Gaussian normalization within accentual phrases to compensate for local pitch variations. Experimental results show that the algorithm of Gaussian normalization within accentual phrases is significantly more accurate than the other two algorithms in intonational phrase.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akagi, M., Ienaga, T.: Speaker Individualities in Fundamental Frequency Contours and Its Control. In: Proc. EuroSpeech 1995, pp. 439–442 (September 1995)

    Google Scholar 

  2. Kuwabara, H., Sagisaka, Y.: Acoustic Characteristics of Speaker Individuality: Control and Conversion. Speech Communication 16, 165–173 (1995)

    Article  Google Scholar 

  3. Kain, A., Macon, M.W.: Spectral Voice Conversion for Text-To-Speech Synthesis. In: Proc. ICASSP 1998, vol. 1, pp. 285–288 (1998)

    Google Scholar 

  4. van Santen, J.P.H.: Prosodic Modeling in Text-to- Speech Synthesis. Proc. EuroSpeech 1997, KN 19-KN 28 (1997)

    Google Scholar 

  5. Kim, Y.J., Byeon, H.J., Oh, Y.H.: Prosodic Phrasing in Korean; Determine Governor, and then Split or Not. In: Proc. EuroSpeech 1999, pp. 539–542 (1999)

    Google Scholar 

  6. Arslan, L.M., Talkin, D.: Speaker Transformation using Sentence HMM based Alignments and Detailed Prosody Modification. In: Proc. ICASSP 1998, vol. 1, pp. 289–292 (1998)

    Google Scholar 

  7. Chappel, D.T., Hansen, J.H.L.: Speaker-Specific Pitch Contour Modeling and Modification. In: Proc. ICASSP 1998, vol. 1, pp. 885–888 (1998)

    Google Scholar 

  8. Nespor, M., Vogel, I.: Prosodic Phonology. Foris Publication, Dordrecht

    Google Scholar 

  9. Jun, S.-A.: The Phonetics and Phonology of Korean Prosody, Ph. D. Dissertation, The Ohio State University (1993)

    Google Scholar 

  10. Lee, K.Y., Song, M.S.: Automatic Detection of Korean Accentual Phrase Boundaries. The Journal of Acoustic Society of Korea 18(1E), 27–31 (1999)

    Google Scholar 

  11. Moulines, E., Charpentier, F.: Pitch-Synchronous Waveform Processing Techniques for Text-to-Speech Synthesis Using Diphones. Speech Communication 9(5,6), 453–467 (1990)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lee, K.Y., Kim, J.K., Bae, M.J. (2004). Statistical Pitch Conversion Approaches Based on Korean Accentual Phrases. In: Zhang, C., W. Guesgen, H., Yeap, WK. (eds) PRICAI 2004: Trends in Artificial Intelligence. PRICAI 2004. Lecture Notes in Computer Science(), vol 3157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28633-2_97

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-28633-2_97

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22817-2

  • Online ISBN: 978-3-540-28633-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics