Skip to main content
Log in

On a Speech Multiple System Implementation for Speech Synthesis

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

This paper proposes a voice synthesizer to convert a single speech to multiple speeches. Pitch is an important voice characteristic of speech parameter and it is produced by the periodic vibration of the vocal-cords; the parameter most sensitive for human’s auditory sense. So if you change the pitch of the voice with several different scaling rates, you can produce several different voices at the same time with a voice. The Multiple-Speech Synthesizer will be used in diverse practical applications. The cheer synthesizer that makes group voice from a single voice would be such an example. You could also use the system for a troll toy, birthday song tracks, effect sounds in movies and plays, protection systems of houses, etc. Furthermore, the synthesizer could be used to imitate the voice of famous entertainers or cartoon characters, such as the Mask-man.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bastien P. (2001). Pitch shifting and voice transformation techniques. TC-Helicon Vocal Technologies, http://www.tc-helicon.com.

  2. Gold E., Morgan N. (2000) Speech and audio signal processing. Wiley, New York

    Google Scholar 

  3. Iida, A., et al. (2000). A speech synthesis system with emotion for assisting communication. In Proceedings of ICSA Workshop on Speech and Emotion (pp. 167–177).

  4. Keller E. et al. (2002). Improvement in speech synthesis. Wiley, New York

    Google Scholar 

  5. Kim J.K., Jo W.R., Bae M.J. (2003) A study on real time prosody control of speech. CCCT 2003(5): 195–198

    Google Scholar 

  6. Kim, J. K., Na, D. S., & Bae, M. J. (2007). On a pitch alteration technique in transformation domain of speech signals. In International Conference on Life System Modelling and Simulation 2007 (Vol. 14, pp. 522–526). Watam Press.

  7. Mitra S. (2001) Digital signal processing, a computer-based approach. McGraw-Hill, New York

    Google Scholar 

  8. Park H.B., Bae M.J. (2000) On a detection of pitch point for voice color conversion. Journal of the Acoustical Society of Korea & World Affairs 19(1): 49–152

    Google Scholar 

  9. Seo J.H., Bae M.J. (2004) On a pitch alteration technique by sub-band scaling in speech signal. GESTS, GESTS International Transactions on Speech Science and Engineering 1(2): 112–120

    Google Scholar 

  10. Tse, M. (2003). Voice transformation. EE6820 Speech and Audio Processing Project Report, Columbia University.

  11. Turk, O. (2003). New methods for voice conversion. M.S. Thesis, in Electrical and Electronics Engineering, Bogazici University.

  12. Turk, O., & Arslan, L. M. (2002). Subband based voice conversion. In Proceedings of the ICSLP 2002 (Vol. 1, pp. 289–292).

  13. Verhelst, W. (2000). Overlap-add methods for time-scaling of speech. Speech Communication, 30, 207–221, Elsevier Science.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jong Kuk Kim.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, J.K., Hahn, H.S. & Bae, M.J. On a Speech Multiple System Implementation for Speech Synthesis. Wireless Pers Commun 49, 533–543 (2009). https://doi.org/10.1007/s11277-008-9575-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-008-9575-1

Keywords

Navigation