Abstract
In automatic speech and speech emotion recognition, a good quality of input speech signal is often required. The hit rate of recognizers is lowered by degradation of speech quality due to noise. Blind source separation can be used to enhance the speech signal as a part of preprocessing techniques. This paper presents a multi channel linear blind source separation method that can be applied even in underdetermined case i.e. when the number of source signals is higher than the number of sensors. Experiments have shown that our system outperforms conventional time-frequency binary masking in both determined and underdetermined cases and significantly increases the hit rate of speech recognizers.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hyvarinen, A., Oja, E.: Independent Component Analysis: Algorithms and Applications. Neural Networks 13(4-5), 411–430 (2000)
Johnson, D., Dungeon, D.: Array Signal Processing. Prentice Hall, Englewood Cliffs (1993)
Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transactions on Signal Processing 52(7) (2004)
Cermak, J., Araki, S., Sawada, H., Makino, S.: Blind Source Separation Based on a Beamformer Array and Time-Frequency Binary Masking. In: ICASSP 2007, vol. 1, pp. 145–148 (2007) ISBN 1–4244–0728–1
Perceptual Evaluation of Speech Quality (PESQ). ITU-T Recommendation P.862, http://www.itu.int/rec/T-REC-p
Nouza, J., Zdansky, J., Cerva, P., Kolorenc, J.: A System for Information Retrieval from Large Records of Broadcast Programs. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS, vol. 4188, pp. 485–492. Springer, Heidelberg (2006)
Methods for Subjective Determination of Transmission Quality. ITU-T Recommendation P.800, http://www.itu.int/rec/T-REC-p
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cermak, J., Smekal, Z. (2009). Underdetermined Blind Source Separation Using Linear Separation System. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds) Multimodal Signals: Cognitive and Algorithmic Issues. Lecture Notes in Computer Science(), vol 5398. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00525-1_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-00525-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00524-4
Online ISBN: 978-3-642-00525-1
eBook Packages: Computer ScienceComputer Science (R0)