Abstract
The analysis-by-synthesis/overlap-add (AbS/OLA) sinusoidal model has been applied to a broad range of speech and audio signal processing, such as coding, analysis and synthesis, fundamental frequency modification, time and frequency scale modification. This model uses an iterative analysis-by-synthesis procedure to estimate the sinusoidal parameters {amplitudes, frequencies, and phases}. However, one drawback of this model is that the analysis frame length is generally fixed in analyzing the signal. As a result, since each sinusoidal parameter has different frequencies, an analysis frame with fixed length cannot an optimal spectral resolution to each sinusoidal parameter. In this paper, in order to overcome this drawback and to estimate sinusoidal parameter more accurately, an AbS/OLA sinusoidal model using an elliptic filter is presented and evaluated against the performance of conventional AbS/OLA sinusoidal model. Our proposed AbS/OLA sinusoidal model is found to achieve better performance, in terms of spectral characteristics, phase characteristics, and the synthetic speech quality, than conventional model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
George, E.B., Smith, M.J.T.: Speech Analysis/Synthesis and Modification Using an Analysis-by-Synthesis/Overlap-Add Sinusoidal Model. IEEE Trans. on SAP 5, 389–406 (1997)
Furui, S., Sondhi, M.M.: Advances in Speech Signal Processing. Dekker Inc., NY (1992)
Kleijn, W.B., Paliwal, K.K.: Speech Coding and Synthesis. Elsevier, Amsterdam (1995)
George, E.B.: Practical High-Quality Speech and Voice Synthesis Using Fixed Frame Rate AbS/OLA sinusoidal modeling. In: IEEE ICASSP, pp. 301–304 (1998)
Anderson, D.V.: Speech Analysis and Coding Using a Multi-Resolution Sinusoidal Transform. In: IEEE ICASSP, pp. 1037–1040 (1996)
Rodriguez-Hernandez, M., Casajus-Quiros, F.: Improving Time-Scale Modification of Audio Signals Using Wavelets. In: IEEE ICASSP, pp. 1573–1577 (1994)
Goodwin, M.: Multiresolution sinusoidal Modeling Using Adaptive Segmentation. In: IEEE ICASSP, pp. 1525–1528 (1998)
Kim, K.H., Hwang, I.H.: A Multi-Resolution Sinusoidal Model Using Adaptive Analysis Frame. In: EURASIP EUSIPCO, pp. 2267–2270 (2004)
Goodwin, M., Vetterli, M.: Time-Frequency Models for Music Analysis, Transformation, and Synthesis. In: Time-Frequency Time-Scale Symposium (1996)
Parks, T.W., Burrus, C.S.: Digital Filter Design. John Wiley & Sons, NY (1987)
Corral, C.A.: Designing Elliptic Filters with Maximum Selectivity, http://www.web-ee.com/primers/files/Elliptical_Filters.pdf
McAulay, R.J., Quatieri, T.F.: Speech Analysis/Synthesis Based on Sinusoidal Representation. IEEE Trans. on SAP 34, 744–754 (1986)
ITU-T Rec. P.862, Perceptual Evaluation of Speech Quality (PESQ) an Objective Assessment of Narrowband Telephone Networks and Speech Code (2002)
Rix, W., et al.: Perceptual Evaluation of Speech Quality(PESQ) – a New Method for Speech Quality Assessment of Telephone Networks and Code. In: ASSP, pp. 749–752 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, K., Hong, J., Lim, J. (2005). Analysis/Synthesis of Speech Signals Based on AbS/OLA Sinusoidal Modeling Using Elliptic Filter. In: Gallagher, M., Hogan, J.P., Maire, F. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2005. IDEAL 2005. Lecture Notes in Computer Science, vol 3578. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11508069_19
Download citation
DOI: https://doi.org/10.1007/11508069_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26972-4
Online ISBN: 978-3-540-31693-0
eBook Packages: Computer ScienceComputer Science (R0)