Abstract
This paper is devoted to the problem of speech signal separation from a set of observables, when the mixing system is underdetermined and static with unknown delays. The approaches appeared in the literature so far have shown that algorithms based on the property of sparsity of the original signals (effectively satisfied by speech sources) can be successfully applied to such a problem, specially if implemented in the time-frequency domain. Here, a survey on the usage of different time-frequency transforms within the already available three-step procedure for the addressed separation problem is carried out. The novelty of the contribution can be seen from this perspective: Wavelet, Complex Wavelet and Stockwell Transforms are the new transforms used in our problem, in substitution of the usual Short Time Fourier Transform (STFT). Their performances are analyzed and compared to those attainable through the STFT, evaluating how much different is the influence that their sparseness and spectral disjointness properties on the algorithm behavior.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jutten, C., Hérault, J., Comon, P., Sorouchiary, E.: Blind Separation of Sources, Parts I, II and III. Signal Processing 24(1), 1–29 (1991)
Haykin, S.: Unsupervised Adaptive Filtering, vol. 1: Blind Source Separation. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)
Haykin, S.: Unsupervised Adaptive Filtering, vol. 2: Blind Deconvolution. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)
Cichocki, A., Amari, S.: Adaptive Blind Signal and Image Processing. In: Wiley (ed.) Learning Algorithms and Application (2002)
Lee, T.W., Lewicki, M.S., Girolami, M., Bell, A.J., Sejnowski, T.J.: Blind Source Separation of More Sources Using Overcomplete Representations. IEEE Signal Processing Letters 6(4), 87–90 (1999)
Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley & Sons, New York (2001)
Lee, T.W., Girolami, M., Bell, A.J., Sejnowski, T.J.: A Unifying Information Theoretic Framework for Independent Component Analysis. Computers & Mathematics with Applications 31(11), 1–21 (2000)
Theis, F.J., Lang, E.W.: Geometric Overcomplete ICA. In: Proc. of ESANN 2002, pp. 217–223 (2002)
Puntonet, G.C., Prieto, A., Jutten, C., Rodrìguez-Alvarez, M., Ortega, J.: Separation of Sources: a Geometry Based Procedure for Recostruction of n-Valued Signal. Elsevier Signal Processing 46(3), 267–284 (1995)
Bofill, P., Zibulevsky, M.: Blind Separation of More Sources than Mixtures Using the Sparsity of the Short-Time Fourier Transform. In: International Workshop on Independent Component Analysis and Blind Signal Separation, (Helsinki, Finland), pp. 87–92 (June 2000)
Bell, A.J., Sejnovsky, T.J.: An Information-Maximization Approach to Blind Separation and Blind Deconvolution. Neural Comput. 7, 1129–1159 (1995)
Cardoso, J.F.: Informax and Maximum Likelihood for Blind Source Separation. IEEE Sign. Process: Letters 4, 109–111 (1997)
Amari, S.: Natural Gradient Learning for Over- and Under-Complete Bases in ICA. Neural Computation 11(8), 1875–1883 (1999)
Theis, F.J., Lang, E.W., Lautenschlager, M.A., Puntonet, C.G.: A Theoretical Framework for Overcomplete Geometric BMMR. In: Proc. of SIP 2002, pp. 201–206 (2002)
Zibulevsky, M., Kisilev, P., Zeevi, Y.Y., Pearlmutter, B.A.: Blind Source Separation via Multinode Sparse Representation. In: Leen, T.K., Dietterich, T.G., Tresp, V. (eds.) Advances in Neural Information Processing Systems, vol. 13, MIT Press, Cambridge (2001)
Theis, F.J., Lang, E.W.: Formalization of the Two-Step Approach to Overcomplete BSS. SIP (2002)
Bofill, P.: Underdetermined Blind Separation of Delayed Sound Sources in the Frequency Domain. Neurocomputing, Special Issue ICA and BSS (March 2, 2001)
Lobo, M.S., Vandenberghe, L., Boyd, S., Lebret, H.: Applications of Second Order Cone Programming. Linear Algebra and Its Applications 284, 193–228 (1998)
Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transaction on Signal Processing 52(7) (July 2004)
Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Vetterli, M., Kovačević, J.: Wavelets and Subband Coding. Prentice Hall, Englewood Cliffs (1995)
Mallat, S.G.: A Wavelet Tour of Signal Processing. Academic Press, London (1998)
Mallat, S.G.: A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(7) (1989)
Coifman, R.R., Wickerhauser, M.V.: Entropy-Based Algorithms for Best Basis Selection. IEEE Trans. Inform. Theory 38(2)
Stockwell, R.G., Mansinha, L., Lowe, R.P.: Localization of the Complex Spectrum: The S Transform. IEEE Trans. Signal Process 44, 998–1001 (1996)
Daubechies, I.: The Wavelet Transform, Time Frequency Localization and Signal Analysis. IEEE Transactions on Information Theory 36(5) (September 1990)
Kingsbury, N.G.: The Dual Tree Complex Wavelet Transform: a New Technique for Shift Invariance and Directional Filters. In: Proc. IEEE DSP Workshop Bryce Canyon (August 1998)
Sawada, H., Mukai, R., Araki, S., Makino, S.: Convolutive Blind Source Separation for More Than Two Sources in the Frequency Domain. In: ICASSP 2004
Winter, S., Sawada, H., Araki, S., Makino, S.: Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures. In: Workshop on Statistical and Perceptual Audio Processing SAPA 2004, Jeju Korea, October 3 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bastari, A., Squartini, S., Piazza, F. (2005). Underdetermined Blind Separation of Speech Signals with Delays in Different Time-Frequency Domains. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_7
Download citation
DOI: https://doi.org/10.1007/11520153_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)