Skip to main content
Log in

Parameters evaluation of SOLA algorithm for time scale modification

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Successful operation of the Synchronous Overlap and Add (SOLA) algorithm for Time Scale Modification (TSM) of speech is closely tied to the proper choice of parameters. This paper investigates the quality of time scale modified speech under different values of primary parameters. Based on Mean Opinion Score (MOS) tests and Bark Spectral Distortion (BSD) measure, the proper choices of synthesis shift (Ss) and the duration of the shift search interval (K max ) are given experimentally. The conclusions can be helpful for operating the SOLA algorithm for time scale modification of speech.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Wong, W., & Au, O. C. (2002). Fast SOLA-based time scale modification using modified envelope matching[A]. In: Proc. of IEEE international conference on acoustics, speech and signal processing[C] (pp. 3188–3191). Orlando, FL.

  • Roucos, S., & Wilgus, A. M. (1985). High quality time scale modification for speech. In: Proc. IEEE int. conf. acoustics, speech., signal processing (vol. 1, pp. 493–496).

  • Griffin, D. W., & Lin, J. S. (1984). Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust., Speech, Signal Processing, ASSP-32(2), 236–243.

    Article  Google Scholar 

  • McAulay, R. J., & Quatieri, T. F. (1986). Speech analysis–synthesis based on a sinusoidal representation. IEEE Trans. Acoust., Speech, Signal Prosess., ASSP-34, 744–754.

    Article  Google Scholar 

  • Du, S.-F. (2005). Adaptive synchronous overlap and add algorithm for time scale modification of speech (In Chinese).

  • Su, Y. (1997), A novel approach for hi-fi audio signal processing. Patent No. CN 1145519A (In Chinese).

  • Hejna, D. J. (1990). Real-time time-scale modification of speech via the synchronized overlap-add algorithm. Master Thesis

  • Chen, Y.-P. (2001). Study on auditory perception and its applications in speech enhancement. PhD Thesis (In Chinese).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhou Jun.

Additional information

Supported by Shaanxi Province Natural Science Researching Project (No. 2003F26).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jun, Z., Wei, T., Yanpu, C. et al. Parameters evaluation of SOLA algorithm for time scale modification. Int J Speech Technol 10, 89–94 (2007). https://doi.org/10.1007/s10772-009-9019-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-009-9019-7

Keywords

Navigation