Analysis/Synthesis of Speech Signals Based on AbS/OLA Sinusoidal Modeling Using Elliptic Filter

Kim, Kihong; Hong, Jinkeun; Lim, Jongin

doi:10.1007/11508069_19

Kihong Kim¹⁹,
Jinkeun Hong²⁰ &
Jongin Lim¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3578))

Included in the following conference series:

International Conference on Intelligent Data Engineering and Automated Learning

1342 Accesses

Abstract

The analysis-by-synthesis/overlap-add (AbS/OLA) sinusoidal model has been applied to a broad range of speech and audio signal processing, such as coding, analysis and synthesis, fundamental frequency modification, time and frequency scale modification. This model uses an iterative analysis-by-synthesis procedure to estimate the sinusoidal parameters {amplitudes, frequencies, and phases}. However, one drawback of this model is that the analysis frame length is generally fixed in analyzing the signal. As a result, since each sinusoidal parameter has different frequencies, an analysis frame with fixed length cannot an optimal spectral resolution to each sinusoidal parameter. In this paper, in order to overcome this drawback and to estimate sinusoidal parameter more accurately, an AbS/OLA sinusoidal model using an elliptic filter is presented and evaluated against the performance of conventional AbS/OLA sinusoidal model. Our proposed AbS/OLA sinusoidal model is found to achieve better performance, in terms of spectral characteristics, phase characteristics, and the synthetic speech quality, than conventional model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

George, E.B., Smith, M.J.T.: Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model. IEEE Trans. on SAP 5, 389–406 (1997)
Google Scholar
Furui, S., Sondhi, M.M.: Advances in Speech Signal Processing. Dekker Inc., NY (1992)
Google Scholar
Kleijn, W.B., Paliwal, K.K.: Speech Coding and Synthesis. Elsevier, Amsterdam (1995)
Google Scholar
George, E.B.: Practical High-Quality Speech and Voice Synthesis Using Fixed Frame Rate AbS/OLA sinusoidal modeling. In: IEEE ICASSP, pp. 301–304 (1998)
Google Scholar
Anderson, D.V.: Speech Analysis and Coding Using a Multi-Resolution Sinusoidal Transform. In: IEEE ICASSP, pp. 1037–1040 (1996)
Google Scholar
Rodriguez-Hernandez, M., Casajus-Quiros, F.: Improving Time-Scale Modification of Audio Signals Using Wavelets. In: IEEE ICASSP, pp. 1573–1577 (1994)
Google Scholar
Goodwin, M.: Multiresolution sinusoidal Modeling Using Adaptive Segmentation. In: IEEE ICASSP, pp. 1525–1528 (1998)
Google Scholar
Kim, K.H., Hwang, I.H.: A Multi-Resolution Sinusoidal Model Using Adaptive Analysis Frame. In: EURASIP EUSIPCO, pp. 2267–2270 (2004)
Google Scholar
Goodwin, M., Vetterli, M.: Time-Frequency Models for Music Analysis, Transformation, and Synthesis. In: Time-Frequency Time-Scale Symposium (1996)
Google Scholar
Parks, T.W., Burrus, C.S.: Digital Filter Design. John Wiley & Sons, NY (1987)
MATH Google Scholar
Corral, C.A.: Designing Elliptic Filters with Maximum Selectivity, http://www.web-ee.com/primers/files/Elliptical_Filters.pdf
McAulay, R.J., Quatieri, T.F.: Speech Analysis/Synthesis Based on Sinusoidal Representation. IEEE Trans. on SAP 34, 744–754 (1986)
Article Google Scholar
ITU-T Rec. P.862, Perceptual Evaluation of Speech Quality (PESQ) an Objective Assessment of Narrowband Telephone Networks and Speech Code (2002)
Google Scholar
Rix, W., et al.: Perceptual Evaluation of Speech Quality(PESQ) – a New Method for Speech Quality Assessment of Telephone Networks and Code. In: ASSP, pp. 749–752 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Security, Korea University, 1, 5-Ka, Anam-dong, Sungbuk-ku, Seoul, 136-701, Korea
Kihong Kim & Jongin Lim
Division of Information & Communication, Cheonan University, 115 Anseo-dong, Cheonan-si, Chungnam, 330-704, Korea
Jinkeun Hong

Authors

Kihong Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jinkeun Hong
View author publications
You can also search for this author in PubMed Google Scholar
Jongin Lim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology and Electrical Engineering, University of Queensland, 4072, Australia
Marcus Gallagher
, POB 30031, FL 32503-1031, Pensacola
James P. Hogan
Faculty of Information Technology, Queensland University of Technology, Box 2434, Q 4001, Brisbane, Australia
Frederic Maire

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, K., Hong, J., Lim, J. (2005). Analysis/Synthesis of Speech Signals Based on AbS/OLA Sinusoidal Modeling Using Elliptic Filter. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_19

Download citation

DOI: https://doi.org/10.1007/11508069_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics