Least Squares Filtering of Speech Signals for Robust ASR

Tyagi, Vivek; Wellekens, Christian

doi:10.1007/11677482_23

Vivek Tyagi^18,19 &
Christian Wellekens^18,19

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3869))

Included in the following conference series:

International Workshop on Machine Learning for Multimodal Interaction

1967 Accesses

Abstract

The behavior of the least squares filter (LeSF) is analyzed for a class of non-stationary signals that are composed of multiple sinusoids whose frequencies, phases and the amplitudes may vary from block to block and which are embedded in white noise. Analytic expressions for the weights and the output of the LeSF are derived as a function of the block length and the signal SNR computed over the corresponding block. Recognizing that such a sinusoidal model is a valid approximation to the speech signals, we have used LeSF filter estimated on each block to enhance the speech signals embedded in white noise. Automatic speech recognition (ASR) experiments on a connected numbers task, OGI Numbers95[20] show that the proposed LeSF based features yield an increase in speech recognition performance in various non-stationary noise conditions when compared directly to the un-enhanced speech and noise robust JRASTA-PLP features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Satorius, E., Zeidler, J., Alexander, S.: Linear predictive digital filtering of narrowband processes in additive broad-band noise. Naval Ocean Systems Center, San Diego, CA, Tech. Rep. 331 (November 1978)
Google Scholar
Anderson, C.M., Satorius, E.H., Zeidler, J.R.: Adaptive Enhancement of Finite Bandwidth Signals in White Gaussian Noise. IEEE Trans. on ASSP ASSP-31(1) (February 1983)
Google Scholar
Zeidler, J.R., Satorius, E.H., Chabries, D.M., Wexler, H.T.: Adaptive Enhancement of Multiple Sinusoids in Uncorrelated Noise. IEEE Trans. on ASSP ASSP-26(3) (June 1978)
Google Scholar
Hermansky, H., Morgan, N.: Rasta Processing of Speech. IEEE Trans. on SAP 2(4) (October 1994)
Google Scholar
Sambur, M.R.: Adaptive noise canceling for Speech signals. IEEE Trans. on ASSP ASSP-26(5) (October 1978)
Google Scholar
Haykin, S.: Adaptive Filter Theory. Prentice-Hall Publishers, NJ (1993)
MATH Google Scholar
McAulay, R.J., Quatieri, T.F.: Speech Analysis/Synthesis Based on a Sinusoidal Representation. IEEE Trans. on ASSP ASSP-34(4) (August 1986)
Google Scholar
Widrow, B., et al.: Adaptive noise cancelling: Principles and applications. Proc. IEEE 65, 1692–1716 (December 1975)
Article Google Scholar
Sondhi, M., Berkley, D.: Silencing echoes on the telephone network. Proc. of IEEE 68, 948–963 (August 1980)
Article Google Scholar
Gersho, A.: Adaptive equalization of highly dispersive channels for data transmission. Bell Syst. Tech. J. 48, 55–70 (January 1969)
Article MATH Google Scholar
Satorius, E., Alexander, S.T.: Channel equalization using adaptive lattice algorithms. IEEE Trans. Commun. 27, 899–905 (June 1979)
Article MATH Google Scholar
Satorius, E., Pack, J.: Application of least squares lattice algorithms for adaptive equalization. IEEE Trans. on Commun. COM-29, 136–142 (February 1981)
Article Google Scholar
Bershad, N., Feintuch, P., Reed, F., Fisher, B.: Tracking characteristics of the LMS adaptive line-enhancer -Response to a linear chirp signal in noise. IEEE Trans. on ASSP ASSP-28, 504–517 (October 1980)
Article Google Scholar
Griffiths, L.J.: A simple adaptive algorithm for real time processsing in antenna arrays. Proc. of IEEE 57, 1696–1704 (October 1969)
Article Google Scholar
Frost, O.L.: An algorithm for linearly constrained adaptive array processing. Proc. of IEEE 60, 926–935 (August 1972)
Article Google Scholar
Marple, L.: Efficient least squares FIR system identification. IEEE Trans. on ASSP ASSP-29, 62–73 (February 1981)
Article Google Scholar
Davis, S.B., Mermelstein, P.: Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Trans. on ASSP ASSP-28(4) (August 1980)
Google Scholar
Young, S., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK Book. Cambridge University, Cambridge (1995)
Google Scholar
Cole, R.A., Fanty, M., Lander, T.: Telephone speech corpus at CSLU. In: Proc. of ICSLP, Yokohama, Japan (1994)
Google Scholar
Varga, A., Steeneken, H., Tomlinson, M., Jones, D.: The NOISEX-92 study on the effect of additive noise on automatic speech recognition. Technical report, DRA Speech Research Unit, Malvern, England (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute Eurecom, Sophia-Antipolis, France
Vivek Tyagi & Christian Wellekens
Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
Vivek Tyagi & Christian Wellekens

Authors

Vivek Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
Christian Wellekens
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Edinburgh, Edinburgh, Scotland
Steve Renals
IDIAP Research Institute, Martigny, Switzerland
Samy Bengio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tyagi, V., Wellekens, C. (2006). Least Squares Filtering of Speech Signals for Robust ASR. In: Renals, S., Bengio, S. (eds) Machine Learning for Multimodal Interaction. MLMI 2005. Lecture Notes in Computer Science, vol 3869. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11677482_23

Download citation

DOI: https://doi.org/10.1007/11677482_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32549-9
Online ISBN: 978-3-540-32550-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics