Skip to main content

Analysis/Synthesis of Speech Signals Based on AbS/OLA Sinusoidal Modeling Using Elliptic Filter

  • Conference paper
Intelligent Data Engineering and Automated Learning - IDEAL 2005 (IDEAL 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

  • 1342 Accesses

Abstract

The analysis-by-synthesis/overlap-add (AbS/OLA) sinusoidal model has been applied to a broad range of speech and audio signal processing, such as coding, analysis and synthesis, fundamental frequency modification, time and frequency scale modification. This model uses an iterative analysis-by-synthesis procedure to estimate the sinusoidal parameters {amplitudes, frequencies, and phases}. However, one drawback of this model is that the analysis frame length is generally fixed in analyzing the signal. As a result, since each sinusoidal parameter has different frequencies, an analysis frame with fixed length cannot an optimal spectral resolution to each sinusoidal parameter. In this paper, in order to overcome this drawback and to estimate sinusoidal parameter more accurately, an AbS/OLA sinusoidal model using an elliptic filter is presented and evaluated against the performance of conventional AbS/OLA sinusoidal model. Our proposed AbS/OLA sinusoidal model is found to achieve better performance, in terms of spectral characteristics, phase characteristics, and the synthetic speech quality, than conventional model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. George, E.B., Smith, M.J.T.: Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model. IEEE Trans. on SAP 5, 389–406 (1997)

    Google Scholar 

  2. Furui, S., Sondhi, M.M.: Advances in Speech Signal Processing. Dekker Inc., NY (1992)

    Google Scholar 

  3. Kleijn, W.B., Paliwal, K.K.: Speech Coding and Synthesis. Elsevier, Amsterdam (1995)

    Google Scholar 

  4. George, E.B.: Practical High-Quality Speech and Voice Synthesis Using Fixed Frame Rate AbS/OLA sinusoidal modeling. In: IEEE ICASSP, pp. 301–304 (1998)

    Google Scholar 

  5. Anderson, D.V.: Speech Analysis and Coding Using a Multi-Resolution Sinusoidal Transform. In: IEEE ICASSP, pp. 1037–1040 (1996)

    Google Scholar 

  6. Rodriguez-Hernandez, M., Casajus-Quiros, F.: Improving Time-Scale Modification of Audio Signals Using Wavelets. In: IEEE ICASSP, pp. 1573–1577 (1994)

    Google Scholar 

  7. Goodwin, M.: Multiresolution sinusoidal Modeling Using Adaptive Segmentation. In: IEEE ICASSP, pp. 1525–1528 (1998)

    Google Scholar 

  8. Kim, K.H., Hwang, I.H.: A Multi-Resolution Sinusoidal Model Using Adaptive Analysis Frame. In: EURASIP EUSIPCO, pp. 2267–2270 (2004)

    Google Scholar 

  9. Goodwin, M., Vetterli, M.: Time-Frequency Models for Music Analysis, Transformation, and Synthesis. In: Time-Frequency Time-Scale Symposium (1996)

    Google Scholar 

  10. Parks, T.W., Burrus, C.S.: Digital Filter Design. John Wiley & Sons, NY (1987)

    MATH  Google Scholar 

  11. Corral, C.A.: Designing Elliptic Filters with Maximum Selectivity, http://www.web-ee.com/primers/files/Elliptical_Filters.pdf

  12. McAulay, R.J., Quatieri, T.F.: Speech Analysis/Synthesis Based on Sinusoidal Representation. IEEE Trans. on SAP 34, 744–754 (1986)

    Article  Google Scholar 

  13. ITU-T Rec. P.862, Perceptual Evaluation of Speech Quality (PESQ) an Objective Assessment of Narrowband Telephone Networks and Speech Code (2002)

    Google Scholar 

  14. Rix, W., et al.: Perceptual Evaluation of Speech Quality(PESQ) – a New Method for Speech Quality Assessment of Telephone Networks and Code. In: ASSP, pp. 749–752 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, K., Hong, J., Lim, J. (2005). Analysis/Synthesis of Speech Signals Based on AbS/OLA Sinusoidal Modeling Using Elliptic Filter. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_19

Download citation

  • DOI: https://doi.org/10.1007/11508069_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26972-4

  • Online ISBN: 978-3-540-31693-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics