Skip to main content

Separation of Mixed Audio Signals by Source Localization and Binary Masking with Hilbert Spectrum

  • Conference paper
Independent Component Analysis and Blind Signal Separation (ICA 2006)

Abstract

The Hilbert transformation together with empirical mode decomposition (EMD) produces Hilbert spectrum (HS) which is a fine-resolution time-frequency (TF) representation of any nonlinear and non-stationary signal. A method of audio signal separation from stereo mixtures based on the spatial location of the sources is presented in this paper. The TF representation of the audio signal is obtained by HS. The sources are localized in the space of time and intensity differences between two microphones’ signals. The separation is performed by masking the target signal in TF domain considering that the sources are disjoint orthogonal. The experimental results of the proposed method show a noticeable improvement of separation efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transactions on Signal Processing 52(7), 1830–1847 (2004)

    Article  MathSciNet  Google Scholar 

  2. Roman, N., Wang, D., Brown, G.J.: Speech segregation based on sound localization. Acost. Soc. of America 114(4), 2236–2252 (2003)

    Article  Google Scholar 

  3. Baeck, M., Zolzer, U.: Real-Time Implementation of Source Separation Algorithm. DAFx- 03, London, UK (2003)

    Google Scholar 

  4. http://sound.media.mit.edu/KEMAR.html

  5. Huang, N.E., et al.: The empirical mode decomposition and Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. Roy. Soc. London A 454, 903–995 (1998)

    Article  MATH  Google Scholar 

  6. Flandrin, P., Rilling, G., Goncalves, P.: Emperical Mode Decomposition as a filter bank. IEEE Sig. Proc. Letter (2003)

    Google Scholar 

  7. Wu, B.Z., Huang, N.E.: A study of the characteristics of white noise using the empirical mode decomposition method. Proc. R. Soc. Lond. A (460), 1597–1611 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Molla, M.K.I., Hirose, K., Minematsu, N. (2006). Separation of Mixed Audio Signals by Source Localization and Binary Masking with Hilbert Spectrum. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_80

Download citation

  • DOI: https://doi.org/10.1007/11679363_80

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32630-4

  • Online ISBN: 978-3-540-32631-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics