Skip to main content

Analysis/Synthesis Speech Model Based on the Pitch-Tracking Periodic-Aperiodic Decomposition

  • Conference paper
Book cover Information Processing and Security Systems

Abstract

This paper presents a speech analysis/synthesis model based on periodic-aperiodic decomposition. In presented approach, decomposition is performed in whole speech band without making identification of voiced/unvoiced regions. Other important feature is pitch-tracking ability of decomposition algorithm. For this purpose a new pitch-tracking transformation called Time-Varying Discrete Fourier Transform (TVDFT) is employed. Periodic component is modelled as a sum of pitch harmonics with amplitudes and phases estimated with TVDFT. Aperiodic component is defined as a difference between original speech signal and synthesised periodic component. TVDFT needs accurate fundamental pitch estimation. This paper also presents a robust pitch estimation.. Experimental results showing advantages of suggested model are also given.

This work was supported by Bialystok Technical University under the grant W/WI/2/04

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

7 References

  1. Kondoz A.M., “Digital speech: coding for low bit rate communication systems”, John Wiley & Sons, Inc., New York, 1996.

    Google Scholar 

  2. Spanias A.S., „Speech coding: a tutorial review“, Proc. IEEE, Vol. 82, No. 10, pp. 1541–1582, 1994.

    Article  Google Scholar 

  3. Almeida L.B., Tribolet J.M., “Harmonic Coding: A Low Bit-Rate, Good Quality, Speech Coding Technique”, Proc. IEEE Int. Conf. on Accoust., Speech and Signal Processing, pp. 1664–1667, 1982.

    Google Scholar 

  4. McAulay R.J., Quatieri T.F., „Sinusoidal Coding“ in “Speech Coding and Synthesis” (W. Klein and K. Palival, eds.), Elsevier Science Publishers, Amsterdam, 1995.

    Google Scholar 

  5. George E.B., Smith M.J.T., “Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model”, IEEE Trans, on Speech and Audio Processing, Vol 5, No. 5, pp. 389–406, 1997.

    Article  Google Scholar 

  6. Stylianou Y., „Applying the Harmonic Plus Noise Mode in Concatenative Speech Synthesis“ IEEE Trans, on Speech and Audio Processing, Vol. 9, No 1., pp. 21–29, 2001.

    Article  Google Scholar 

  7. Griffin D.W., Lim J.S., „Multiband Excitation Vocoder“, IEEE Trans, on Acoust., Speech and Signal Processing, Vol. ASSP-36, pp. 1223–1235, 1988.

    Article  Google Scholar 

  8. B. Yegnanarayana, C. d'Alessandro, V. Darsions, “An Iterative Algorithm for Decomposiiton of Speech Signals into Periodic and Aperiodic Components”, IEEE Trans. On Speech and Audio Coding, Vol. 6, No. 1, pp. 1–11, 1998.

    Article  Google Scholar 

  9. Jackson P.J.B., Shadle C.H., “Pitch-Scaled Estimation of Simultaneous Voiced and Turbulence-Noise Components in Speech”, IEEE Trans. On Speech and Audio Processing, Vol. 9, No. 7, pp. 713–726, 2001

    Article  Google Scholar 

  10. Sercov V., Petrovsky A., „An Improved Speech Model with Allowance for Time-Varying Pitch Harmonic Amplitudes and Frequencies in Low Bit-Rate MBE Coders”, Proc. of the 6ht European Conf. on Speech Communication and Technology EUROSPEECH'99, pp. 1479–1482 Budapest, Hungary, 1999.

    Google Scholar 

  11. Petrovsky A., Sercov V., “Low Bit-Rate AbS Spectral Coding Based on the Harmonic Analysis of Speech Agreed Upon with Time-Varying Pitch Frequency and Psychoacoustical Optimization”, Proc. of Nordic Signal Processing Symposium NORSIG2000, pp. 45–48, 2000.

    Google Scholar 

  12. Petrovsky A., Zubrycki P., Sawicki A., Tonal and noise components separation based on a pitch synchronous DFT analyzer as a speech coding method // Proc. of European Conference on Circuit Theory and Devices ECCTD2003, Vol. III, pp. 169–172, 2003.

    Google Scholar 

  13. Eric W. M. Yu, Cheung-Fat Chan, A harmonic+noise coder with improved transient speech performance // Proc. of European Signal Processing Conference EUSIPCO'99, Special Session “Speech Coding”, 1999.

    Google Scholar 

  14. Sondhi M.M., New Methods of Pitch Extraction, IEEE Trans, on Audio and Electroacoustics, Vol. AU-16, No. 2, pp. 262–266, 1968.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer Science+Business Media, Inc.

About this paper

Cite this paper

Zubrycki, P., Petrovsky, A.A. (2005). Analysis/Synthesis Speech Model Based on the Pitch-Tracking Periodic-Aperiodic Decomposition. In: Saeed, K., Pejaś, J. (eds) Information Processing and Security Systems. Springer, Boston, MA. https://doi.org/10.1007/0-387-26325-X_4

Download citation

  • DOI: https://doi.org/10.1007/0-387-26325-X_4

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-25091-5

  • Online ISBN: 978-0-387-26325-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics