Separating Underdetermined Convolutive Speech Mixtures

Pedersen, Michael Syskind; Wang, DeLiang; Larsen, Jan; Kjems, Ulrik

doi:10.1007/11679363_84

Michael Syskind Pedersen^20,21,
DeLiang Wang²²,
Jan Larsen²⁰ &
…
Ulrik Kjems²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3889))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

2396 Accesses
6 Citations

Abstract

A limitation in many source separation tasks is that the number of source signals has to be known in advance. Further, in order to achieve good performance, the number of sources cannot exceed the number of sensors. In many real-world applications these limitations are too restrictive. We propose a method for underdetermined blind source separation of convolutive mixtures. The proposed framework is applicable for separation of instantaneous as well as convolutive speech mixtures. It is possible to iteratively extract each speech signal from the mixture by combining blind source separation techniques with binary time-frequency masking. In the proposed method, the number of source signals is not assumed to be known in advance and the number of sources is not limited to the number of microphones. Our approach needs only two microphones and the separated sounds are maintained as stereo signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Underdetermined blind source separation technique based on speech features extraction

Article 25 August 2016

A New Sparse Blind Source Separation Method for Determined Linear Convolutive Mixtures in Time-Frequency Domain

Underdetermined blind source separation of speech mixtures unifying dictionary learning and sparse representation

Article 03 August 2021

References

Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley, Chichester (2001)
Book Google Scholar
Roman, N., Wang, D.L., Brown, G.J.: Speech segregation based on sound localization. J. Acoust. Soc. Amer. 114, 2236–2252 (2003)
Article Google Scholar
Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Trans. Signal Processing 52, 1830–1847 (2004)
Article MathSciNet Google Scholar
Wang, D.L., Brown, G.J.: Separation of speech from interfering sounds based on oscillatory correlation. IEEE Trans. Neural Networks 10, 684–697 (1999)
Article Google Scholar
Bregman, A.S.: Auditory Scene Analysis, 2nd edn. MIT Press, Cambridge (1990)
Google Scholar
Jourjine, A., Rickard, S., Yilmaz, O.: Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures. In: Proc. ICASSP, pp. 2985–2988 (2000)
Google Scholar
Roweis, S.: One microphone source separation. In: NIPS 2000, pp. 793–799 (2000)
Google Scholar
Hu, G., Wang, D.L.: Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks 15, 1135–1150 (2004)
Article MathSciNet Google Scholar
Wang, D.L.: On ideal binary mask as the computational goal of auditory scene analysis. In: Divenyi, P. (ed.) Speech Separation by Humans and Machines, pp. 181–197. Kluwer, Norwell (2005)
Chapter Google Scholar
Araki, S., Makino, S., Sawada, H., Mukai, R.: Underdetermined blind separation of convolutive mixtures of speech with directivity pattern based mask and ICA. In: Puntonet, C.G., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, pp. 898–905. Springer, Heidelberg (2004)
Chapter Google Scholar
Kolossa, D., Orglmeister, R.: Nonlinear postprocessing for blind speech separation. In: Puntonet, C.G., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, pp. 832–839. Springer, Heidelberg (2004)
Chapter Google Scholar
Pedersen, M.S., Wang, D.L., Larsen, J., Kjems, U.: Overcomplete blind source separation by combining ICA and binary time-frequency masking. In: Proceedings of the MLSP workshop, Mystic, CT, USA (2005)
Google Scholar
Parra, L., Spence, C.: Convolutive blind separation of non-stationary sources. IEEE Trans. Speech and Audio Processing 8, 320–327 (2000)
Article Google Scholar
Büchler, M.C.: Algorithms for Sound Classification in Hearing Instruments. PhD thesis, Swiss Federal Institute of Technology, Zurich (2002)
Google Scholar
Allen, J.B., Berkley, D.A.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Amer. 65, 943–950 (1979)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Informatics and Mathematical Modelling, Technical University of Denmark, Richard Petersens Plads, Building 321, DK-2800, Kgs. Lyngby, Denmark
Michael Syskind Pedersen & Jan Larsen
Oticon A/S, Kongebakken 9, DK-2765, Smørum, Denmark
Michael Syskind Pedersen & Ulrik Kjems
Department of Computer Science and Engineering & Center for Cognitive Science, The Ohio State University, Columbus, OH, 43210-1277, USA
DeLiang Wang

Authors

Michael Syskind Pedersen
View author publications
You can also search for this author in PubMed Google Scholar
DeLiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jan Larsen
View author publications
You can also search for this author in PubMed Google Scholar
Ulrik Kjems
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Siemens Corporate Research, 755 College Road East, 08540, Princeton, NJ, USA
Justinian Rosca
Department of CSEE, Oregon Health and Science University, Portland, Oregon, USA
Deniz Erdogmus
Dep. of Electrical and Computer Engineering, University of Florida, Gainesville, Florida, USA
José C. Príncipe
McMaster University, 1280 Main Street West, L8S 4K1, Hamilton, Ontario, Canada
Simon Haykin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pedersen, M.S., Wang, D., Larsen, J., Kjems, U. (2006). Separating Underdetermined Convolutive Speech Mixtures. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_84

Download citation

DOI: https://doi.org/10.1007/11679363_84
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Separating Underdetermined Convolutive Speech Mixtures

Abstract

Access this chapter

Preview

Similar content being viewed by others

Underdetermined blind source separation technique based on speech features extraction

A New Sparse Blind Source Separation Method for Determined Linear Convolutive Mixtures in Time-Frequency Domain

Underdetermined blind source separation of speech mixtures unifying dictionary learning and sparse representation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Separating Underdetermined Convolutive Speech Mixtures

Abstract

Access this chapter

Preview

Similar content being viewed by others

Underdetermined blind source separation technique based on speech features extraction

A New Sparse Blind Source Separation Method for Determined Linear Convolutive Mixtures in Time-Frequency Domain

Underdetermined blind source separation of speech mixtures unifying dictionary learning and sparse representation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation