Signal Enhancement for Continuous Speech Recognition

Athanaselis, Theologos; Fotinea, Stavroula-Evita; Bakamidis, Stelios; Dologlou, Ioannis; Giannopoulos, Georgios

doi:10.1007/3-540-44989-2_133

Theologos Athanaselis⁷,
Stavroula-Evita Fotinea⁷,
Stelios Bakamidis⁷,
Ioannis Dologlou⁷ &
…
Georgios Giannopoulos⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2714))

Included in the following conference series:

1587 Accesses
3 Citations

Abstract

This paper presents a comparison between two parametric methods for Signal Enhancement in order to address the problem of robust Automatic Speech Recognition (ASR). An SVD–based technique (ISE) and a non-linear spectral subtraction method (NSS), have been evaluated by means of the Continuous Speech Recognition system that is used in the ERMIS project. The input signal is corrupted with coloured noise with variable signal-to-noise ratio. It was found that fine-tuning of the various parameters of the enhancement techniques is crucial for efficient optimisation of their performance. Both methods provide significant improvement of the speech recogniser performance in the presence of coloured noise, with the NSS method being slightly better.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Boersma, P.: Accurate short-term analysis of the fundamental frequency and the harmonicsto-noise ratio of a sampled sound. Proceedings of the Institute of Phonetic Sciences 17 (1993) 97–110
Google Scholar
Dendrinos, M., Bakamidis, S., Carayannis, G.: Speech enhancement from noise: A regenerative approach. Speech Communication, Vol. 10,no.2, February (1991) 45–57
Article Google Scholar
Doclo, S., Dologlou, I., Moonen, M.: A novel iterative signal enhancement algorithm for noise reduction in speech, Proceedings of ICSLP-98, Sydney, Australia, (1998) 1435–1439
Google Scholar
Kyriakou, C., Bakamidis, S., Dologlou, I,, Carayannis, G.: Robust Continuous Speech Recognition in the Presence of Coloured Noise., Proceedings of 4th European Conference on Noise Control EURONOISE2001, Vol. 2, Patra, January 14-17 (2001) 702–705
Google Scholar
Pellom, B. L., Hansen, J.H.L.: Voice Analysis in Adverse Conditions: The Centennial Olympic Park Bombing 911 Call, Proceedings of IEEE Midwest Symposium on Circuits & Systems, August (1997) 125–128
Google Scholar
Uhl, C., and Leib, M.: Experiments with an Extended Adaptive SVD Enhancement Scheme for Speech Enhancement, Proceedings of IEEE ICASSP, Vol. 1, Salt Lake City, Utah, USA, May (2001) 281–284
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Language and Speech processing (ILSP), 6, Artemidos str. & Epidavrou, Paradissos Amaroussiou, 151 25, Greece
Theologos Athanaselis, Stavroula-Evita Fotinea, Stelios Bakamidis, Ioannis Dologlou & Georgios Giannopoulos

Authors

Theologos Athanaselis
View author publications
You can also search for this author in PubMed Google Scholar
Stavroula-Evita Fotinea
View author publications
You can also search for this author in PubMed Google Scholar
Stelios Bakamidis
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Dologlou
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Giannopoulos
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bogazici University, Bebek, 34342, Istanbul, Turkey
Okyay Kaynak & Ethem Alpaydin &
Laboratory of Computer and Information Science, Helsinki University of Technology, P.O.B. 5400, 02015, Finland
Erkki Oja
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong
Lei Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Athanaselis, T., Fotinea, SE., Bakamidis, S., Dologlou, I., Giannopoulos, G. (2003). Signal Enhancement for Continuous Speech Recognition. In: Kaynak, O., Alpaydin, E., Oja, E., Xu, L. (eds) Artificial Neural Networks and Neural Information Processing — ICANN/ICONIP 2003. ICANN ICONIP 2003 2003. Lecture Notes in Computer Science, vol 2714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44989-2_133

Download citation

DOI: https://doi.org/10.1007/3-540-44989-2_133
Published: 18 June 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40408-8
Online ISBN: 978-3-540-44989-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics