Real-Time Bayesian Inference: A Soft Computing Approach to Environmental Learning for On-Line Robust Automatic Speech Recognition

Chowdhury, Md Foezur Rahman; Selouani, Sid-Ahmed; O’Shaughnessy, Douglas

doi:10.1007/978-3-642-19644-7_47

Md Foezur Rahman Chowdhury⁸,
Sid-Ahmed Selouani⁹ &
Douglas O’Shaughnessy⁸

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 87))

1315 Accesses
1 Citations

Abstract

In this paper, we developed soft computing models for on-line automatic speech recognition (ASR) based on Bayesian on-line inference techniques.Bayesian on-line inference for change point detection (BOCPD) is tested for on-line environmental learning using highly non-stationary noisy speech samples from the Aurora2 speech database. Significant improvement in predicting and adapting to new acoustic conditions is obtained for highly non-stationary noises. The simulation results show that the Bayesian on-line inference-based soft computing approach would be one of the possible solutions to on-line ASR for real-time applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Acero, A.: Acoustical and environmental robustness in automatic speech recognition. Kluwer Academic Publisher, Massachussets (1993)
Google Scholar
Adams, R.P., MacKay, D.J.C.: Bayesian on-line changepoint detection. University of Cambridge Technical Report arXiv:0710.3742v1[stat.ML] (2007)
Google Scholar
Chowdhury, M.F.R., Selouani, S.-A., O’Shaughnessy, D.: Bayesian On-Line Change Point Detection Approach For Rapid Adaptation of Highly Non-Stationary Noise Tracking Algorithm. Submitted for: IEEE Int. Conf. Acoustics,Speech, Signal Proc. 2011 (2010)
Google Scholar
Cohen, I.: Noise Estimation by Minima Controlled Recursive Averaging for Robust Speech Enhancement. IEEE Signal Processing Letters 9(1), 12–15 (2002)
Article Google Scholar
Cohen, I.: Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging. IEEE Trans. Audio, Speech, Signal Proc. 2(5), 466–475 (2003)
Article Google Scholar
Fan, N.: Speech noise estimation using enhanced minima controlled recursive averaging. In: Proc. IEEE Int. Conf. Acoustics, Speech, Signal Proc., vol. 4, pp. 581–584 (2007)
Google Scholar
Hirsch, H.-G., Pearce, D.: The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proceedings of ISCA ITRW ASR2000 Automatic Speech Recognition: Challenges for the Next Millennium, pp. 181–188 (2000)
Google Scholar
Li, J., Deng, L., Yu, D., et al.: A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions. Computer Speech and Language 23, 389–405 (2009)
Article Google Scholar
O’Shaughnessy, D.: Speech Communications: Human and Machine. Wiley-IEEE Press (2000)
Google Scholar
Rangachari, S., Loizou, P.: A noise estimation algorithm for highly nonstationary environments. Speech Communication 28, 220–231 (2006)
Article Google Scholar
Turner, R.: Bayesian Change Point Detection for Satellite Fault Prediction. In: Proceedings of Interdisciplinary Graduate Conference (IGC), Cambridge, UK, pp. 213–221 (2010)
Google Scholar
Young S.: ATK An Application Toolkit for HTK. Machine Intelligence Laboratory, Cambridge University Engineering Dept, Cambridge, UK (2007), http://www.mi.eng.cam.ac.uk/research/dialogue/ATK_Manual.pdf (cited June 2007)

Download references

Author information

Authors and Affiliations

INRS-EMT, Université du Québec, Montréal, QC, Canada
Md Foezur Rahman Chowdhury & Douglas O’Shaughnessy
Université de Moncton, Campus de Shippagon, NB, Canada
Sid-Ahmed Selouani

Authors

Md Foezur Rahman Chowdhury
View author publications
You can also search for this author in PubMed Google Scholar
Sid-Ahmed Selouani
View author publications
You can also search for this author in PubMed Google Scholar
Douglas O’Shaughnessy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universidad de Salamanca, Plaza de la Merced S/N, 37008, Salamanca, Spain
Emilio Corchado
VŠB-TU Ostrava, 17. listopadu 15, 70833, Ostrava, Czech Republic
Václav Snášel
University of Burgos, Avenida Cantaria S/N, 09006, Burgos, Spain
Javier Sedano
Cairo University, 5 Ahmed Zewal St., Orman, Cairo, Egypt
Aboul Ella Hassanien
University of La Coruña, Avda. 19 de Febrero, S/N, A Coruña,, 15403, Ferrol, Spain
José Luis Calvo
Infobright, 47 Colborne Street, Suite 403, M5E1P8, Toronto, Ontario, Canada
Dominik Ślȩzak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chowdhury, M.F.R., Selouani, SA., O’Shaughnessy, D. (2011). Real-Time Bayesian Inference: A Soft Computing Approach to Environmental Learning for On-Line Robust Automatic Speech Recognition. In: Corchado, E., Snášel, V., Sedano, J., Hassanien, A.E., Calvo, J.L., Ślȩzak, D. (eds) Soft Computing Models in Industrial and Environmental Applications, 6th International Conference SOCO 2011. Advances in Intelligent and Soft Computing, vol 87. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19644-7_47

Download citation

DOI: https://doi.org/10.1007/978-3-642-19644-7_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19643-0
Online ISBN: 978-3-642-19644-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics