Abstract
While current speech recognisers give acceptable performance in carefully controlled environments, their performance degrades rapidly when they are applied in more realistic situations. Generally, the environmental noise may be classified into two classes: the wide-band noise and narrow band noise. While the multi-band model has been shown to be capable of dealing with speech corrupted by narrow-band noise, it is ineffective for wide-band noise. In this paper, we suggest a combination of the frequency-filtering technique with the probabilistic union model in the multi-band approach. The new system has been tested on the TIDIGITS database, corrupted by white noise, noise collected from a railway station, and narrow-band noise, respectively. The results have shown that this approach is capable of dealing with noise of narrow-band or wide-band characteristics, assuming no knowledge about the noisy environment.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Tibrewala, S., Hermansky, H., Sub-band Based Recognition of Noisy Speech, Proc. ICASSP’ 97, Munich, Germany, 1997, pp. 1255–1258.
Bourlard, H., Non-Stationary Multi-Channel (Multi-Stream) Processing Towards Robust and Adaptive ASR, Proceedings of Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, 1999, pp. 1–10.
Ming, J., Smith, J., Union: A New Approach for Combining Sub-band Observations for Noisy Speech Recognition, Proceedings of Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, 1999, pp. 175–178.
Nadeu, C., Hernando, J., Gorricho, M., On the Decorrelation of the Filter-Bank Energies in Speech Recognition, Proc. Eurospeech’ 95, pp. 1381–1384.
Macho, D., Nadeu, C.: On the Interaction between Time and Frequency Filtering of Speech Parameters for Robust Speech Recognition, Proc. ICSLP’ 98, pp. 1487–1490.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jančovič, P., Ming, J., Hanna, P., Stewart, D., Smith, J. (2000). Combining Multi-band and Frequency-Filtering Techniques for Speech Recognition in Noisy Environments. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2000. Lecture Notes in Computer Science(), vol 1902. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45323-7_45
Download citation
DOI: https://doi.org/10.1007/3-540-45323-7_45
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41042-3
Online ISBN: 978-3-540-45323-9
eBook Packages: Springer Book Archive