Analysis and synthesis of speech using least Pth norm filter design

Gokhale, M. Y.; Khanduja, Daljeet Kaur

doi:10.1007/s10772-009-9035-7

Analysis and synthesis of speech using least Pth norm filter design

Published: 26 June 2009

Volume 11, pages 51–61, (2008)
Cite this article

International Journal of Speech Technology Aims and scope Submit manuscript

M. Y. Gokhale¹ &
Daljeet Kaur Khanduja²

86 Accesses
3 Citations
Explore all metrics

Abstract

In this paper we analyze the combination of speech and FIR filter design aspect to achieve good results in speech quality. A new approach in the time domain based on the least Pth norm is presented to extract maximum information that represents speech. The aim of this paper is to improve the perceived quality of speech through the introduction of least Pth norm algorithm that attenuates speech contaminated with noise. This approach relates to a filter bank structure and a method for filtering and separating an information signal into different bands, particularly for filtering and separation of speech signals. Then the desired signal is reconstructed from the independent components representing every band. This approach differs from the traditional approaches since no priori knowledge of the noise statistics is required, instead the noise signals are only assumed to have finite energy. Since the estimation criterion for the filter design is to minimize the worst possible amplification of the estimation error signal in terms of modeling errors and additive noise, this approach is highly robust and appropriate in practical speech analysis and synthesis. This paper presents a least Pth approach to the optimal design of FIR digital filter banks in the minimax sense for speech analysis and synthesis. The signal to noise ratio (SNR) of around 50–60 dB is achieved with various speech samples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Allen, J. (1982). Application of the short-time Fourier transform to speech processing and spectral analysis. In Proc. int. conf. on acoust. speech and sig. proc. (pp. 1012–1015).
Deller, J., Proakis, J., & Hansen, J. (1993). Discrete-time processing of speech signals. New York: Macmillan.
Google Scholar
Ephraim, Y. (1990). A minimum mean square error approach for speech enhancement. In Proc. IEEE ICASSP (pp. 829–832).
Flanagan, J. L. (1965). Speech analysis synthesis and perception. New York: Academic Press. p. 119.
Google Scholar
Gibson, J. D., Koo, B., & Gray, S. D. (1991). Filtering of colored noise for speech enhancement and coding. IEEE Transactions on Signal Processing, 39, 1732–1742.
Article Google Scholar
Griffin, D., & Lim, J. (1984). Signal estimation from modified short-time Fourier transform. IEEE Transactions on Acoustics, Speech and Signal Processing, 32(2), 236–243.
Article Google Scholar
Karam, J. (2007). Various speech processing techniques for speech compression and recognition. In Proceedings of World Academy of Science, Engineering and Technology, 26. ISSN 1307-6884.
Lim, J. S., & Oppenheim, A. V. (1978). All-pole modeling of degraded speech. IEEE Transactions on Acoustics, Speech and Signal Processing, 26, 197–210.
Article MATH Google Scholar
Nayebi, K., Barnwell, T. P., & Smith, M. J. T. (1992). Time domain filter bank analysis. A new design theory, 40(6).
Paliwal, K. K., & Alsteris, L. (2003). Usefulness of phase spectrum in human speech perception. In Euro speech 2003, Geneva.
Paliwal, K. K., & Basu, A. (1987). A speech enhancement method based on Kalman filtering. In Proc. IEEE ICASSP (pp. 177–180).
Pitsikalis, V., & Maragos, P. (2002). Speech analysis and feature extraction using chaotic models. In Proc. int’l conf. acoustics speech and signal processing (ICASSP-2002), Orlando, USA, May 2002 (pp. 533–536).
Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition. Prentice-Hall: Englewood Cliffs.
Google Scholar
W’Ojcicki, K. K., & Paliwal, K. K. (2007). Importance of the dynamic range of an analysis window function for phase-only and magnitude-only reconstruction of speech. In ICASSP 2007.

Download references

Author information

Authors and Affiliations

Department of Mathematics, Maharashtra Institute of Technology, Kothrud, Pune, 38, India
M. Y. Gokhale
Department of Mathematics, Sinhgad Academy of Engineering, Kondhwa, Pune, 48, India
Daljeet Kaur Khanduja

Authors

M. Y. Gokhale
View author publications
You can also search for this author in PubMed Google Scholar
Daljeet Kaur Khanduja
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daljeet Kaur Khanduja.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gokhale, M.Y., Khanduja, D.K. Analysis and synthesis of speech using least Pth norm filter design. Int J Speech Technol 11, 51–61 (2008). https://doi.org/10.1007/s10772-009-9035-7

Download citation

Received: 24 April 2009
Accepted: 08 June 2009
Published: 26 June 2009
Issue Date: March 2008
DOI: https://doi.org/10.1007/s10772-009-9035-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis and synthesis of speech using least Pth norm filter design

Abstract

Access this article

Similar content being viewed by others

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Introduction to Acoustic Terminology and Signal Processing

Noise robust automatic speech recognition: review and analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Analysis and synthesis of speech using least Pth norm filter design

Abstract

Access this article

Similar content being viewed by others

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Introduction to Acoustic Terminology and Signal Processing

Noise robust automatic speech recognition: review and analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation