Abstract
Blind source separation and speech dereverberation are two important and common issues in the field of audio processing especially in the context of real meetings. In this paper a real time framework implementing a sequential source separation and speech dereverberation algorithm based on blind channel identification is taken as starting point. The major drawback of this approach consists in the inability of the BCI stage of estimating the room impulse responses when two or more sources are concurrently active. To overcome the aforementioned disadvantage a speaker diarization system have been successfully inserted in the reference framework to pilot the BCI stage. In such a way the identification task can be accomplished by using directly the microphone mixture making the overall structure well suited for real-time applications. The proposed solution works in frequency domain and the NU-Tech software platform has been used on purpose for real-time simulations.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Huang, Y., Benesty, J., Chen, J.: A blind channel identification-based two-stage approach to separation and dereverberation of speech signals in a reverberant environment. IEEE Transactions on Speech and Audio Processing 13(5), 882–895 (2005)
Rotili, R., De Simone, C., Perelli, A., Cifani, S., Squartini, S.: Joint Multichannel Blind Speech Separation and Dereverberation: A Real-Time Algorithmic Implementation. In: Huang, D.-S., McGinnity, M., Heutte, L., Zhang, X.-P. (eds.) ICIC 2010. Communications in Computer and Information Science, vol. 93, pp. 85–93. Springer, Heidelberg (2010)
Squartini, S., Ciavattini, E., Lattanzi, A., Zallocco, D., Bettarelli, F., Piazza, F.: Nu-tech:implementing dsp algorithms in a plug-in based software platform for real time audio applications. In: Proceedings of 118th Convention of the Audio Engineering Society (2005)
Araki, S., Fujimoto, M., Ishizuka, K., Sawada, H., Makino, S.: A doa based speaker diarization system for real meetings. In: Hands-Free Speech Communication and Microphone Arrays, HSCMA 2008, pp. 29–32 (May 2008)
Moulines, E., Duhamel, P., Cardoso, J., Mayrargue, S.: Subspace methods for the blind identification of multichannel FIR filters. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1994, pp. IV/573–IV/576 (1994)
Huang, Y., Benesty, J.: A class of frequency-domain adaptive approaches to blind multichannel identification. IEEE Transactions on Signal Processing 51(1), 11–24 (2003)
Rotili, R., Cifani, S., Principi, E., Squartini, S., Piazza, F.: A robust iterative inverse filtering approach for speech dereverberation in presence of disturbances. In: APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems, pp. 434–437 (2008)
Habets, E.A.P.: Room impulse response (RIR) generator (May 2008), http://home.tiscali.nl/ehabets/rirgenerator.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rotili, R., Principi, E., Squartini, S., Piazza, F. (2011). Real-Time Joint Blind Speech Separation and Dereverberation in Presence of Overlapping Speakers. In: Liu, D., Zhang, H., Polycarpou, M., Alippi, C., He, H. (eds) Advances in Neural Networks – ISNN 2011. ISNN 2011. Lecture Notes in Computer Science, vol 6676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21090-7_52
Download citation
DOI: https://doi.org/10.1007/978-3-642-21090-7_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21089-1
Online ISBN: 978-3-642-21090-7
eBook Packages: Computer ScienceComputer Science (R0)