On a Speech Multiple System Implementation for Speech Synthesis

Kim, Jong Kuk; Hahn, Hern Soo; Bae, Myung Jin

doi:10.1007/s11277-008-9575-1

On a Speech Multiple System Implementation for Speech Synthesis

Published: 23 August 2008

Volume 49, pages 533–543, (2009)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

Jong Kuk Kim¹,
Hern Soo Hahn¹ &
Myung Jin Bae¹

66 Accesses
2 Citations
Explore all metrics

Abstract

This paper proposes a voice synthesizer to convert a single speech to multiple speeches. Pitch is an important voice characteristic of speech parameter and it is produced by the periodic vibration of the vocal-cords; the parameter most sensitive for human’s auditory sense. So if you change the pitch of the voice with several different scaling rates, you can produce several different voices at the same time with a voice. The Multiple-Speech Synthesizer will be used in diverse practical applications. The cheer synthesizer that makes group voice from a single voice would be such an example. You could also use the system for a troll toy, birthday song tracks, effect sounds in movies and plays, protection systems of houses, etc. Furthermore, the synthesizer could be used to imitate the voice of famous entertainers or cartoon characters, such as the Mask-man.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bastien P. (2001). Pitch shifting and voice transformation techniques. TC-Helicon Vocal Technologies, http://www.tc-helicon.com.
Gold E., Morgan N. (2000) Speech and audio signal processing. Wiley, New York
Google Scholar
Iida, A., et al. (2000). A speech synthesis system with emotion for assisting communication. In Proceedings of ICSA Workshop on Speech and Emotion (pp. 167–177).
Keller E. et al. (2002). Improvement in speech synthesis. Wiley, New York
Google Scholar
Kim J.K., Jo W.R., Bae M.J. (2003) A study on real time prosody control of speech. CCCT 2003(5): 195–198
Google Scholar
Kim, J. K., Na, D. S., & Bae, M. J. (2007). On a pitch alteration technique in transformation domain of speech signals. In International Conference on Life System Modelling and Simulation 2007 (Vol. 14, pp. 522–526). Watam Press.
Mitra S. (2001) Digital signal processing, a computer-based approach. McGraw-Hill, New York
Google Scholar
Park H.B., Bae M.J. (2000) On a detection of pitch point for voice color conversion. Journal of the Acoustical Society of Korea & World Affairs 19(1): 49–152
Google Scholar
Seo J.H., Bae M.J. (2004) On a pitch alteration technique by sub-band scaling in speech signal. GESTS, GESTS International Transactions on Speech Science and Engineering 1(2): 112–120
Google Scholar
Tse, M. (2003). Voice transformation. EE6820 Speech and Audio Processing Project Report, Columbia University.
Turk, O. (2003). New methods for voice conversion. M.S. Thesis, in Electrical and Electronics Engineering, Bogazici University.
Turk, O., & Arslan, L. M. (2002). Subband based voice conversion. In Proceedings of the ICSLP 2002 (Vol. 1, pp. 289–292).
Verhelst, W. (2000). Overlap-add methods for time-scaling of speech. Speech Communication, 30, 207–221, Elsevier Science.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Telecommunication Engineering, Soongsil University, Seoul, 156-743, South Korea
Jong Kuk Kim, Hern Soo Hahn & Myung Jin Bae

Authors

Jong Kuk Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hern Soo Hahn
View author publications
You can also search for this author in PubMed Google Scholar
Myung Jin Bae
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jong Kuk Kim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, J.K., Hahn, H.S. & Bae, M.J. On a Speech Multiple System Implementation for Speech Synthesis. Wireless Pers Commun 49, 533–543 (2009). https://doi.org/10.1007/s11277-008-9575-1

Download citation

Received: 29 October 2007
Accepted: 04 August 2008
Published: 23 August 2008
Issue Date: June 2009
DOI: https://doi.org/10.1007/s11277-008-9575-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On a Speech Multiple System Implementation for Speech Synthesis

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey on automatic speech recognition using neural networks

Conventional and contemporary approaches used in text to speech synthesis: a review

Milestones in speaker recognition

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

On a Speech Multiple System Implementation for Speech Synthesis

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey on automatic speech recognition using neural networks

Conventional and contemporary approaches used in text to speech synthesis: a review

Milestones in speaker recognition

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation