Skip to main content

Independent Vector Analysis Exploiting Pre-learned Banks of Relative Transfer Functions for Assumed Target’s Positions

  • Conference paper
  • First Online:
Latent Variable Analysis and Signal Separation (LVA/ICA 2018)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10891))

Abstract

On-line frequency-domain blind separation of audio sources performed through Independent Vector Analysis (IVA) suffers from the problem of determining the order of the separated outputs. In this work, we apply a supervised IVA based on pilot components obtained using a bank of Relative Transfer Functions (RTF). The bank is assumed to be available for potential positions of a target speaker within a confined area. In every frame, the most suitable RTF is selected from the bank based on a criterion. The pilot components are obtained as pre-separated target and interference, respectively, through the Minimum-Power Distortionless Beamforming and Null Beamforming. The supervised IVA is tested in a real-world scenario with various levels of up-to-dateness of the bank. We show that the global permutation problem is resolved even when the bank contains only pure delay filters. The Signal-to-Interference Ratio in separated signals is mostly better than that achieved by the pre-separation, unless the bank contains very precise RTFs.

This paper was supported by The Czech Science Foundation through Project No. 17-00902S and partly supported by the Student Grant Scheme 2018 project of the Technical University in Liberec and by the United States Department of the Navy, Office of Naval Research Global, through Project No. N62909-18-1-2040.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Douglas, S.C., Gupta, M.: Scaled natural gradient algorithms for instantaneous and convolutive blind source separation. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP 2007, vol. 2, pp. II-637–II-640 (2007)

    Google Scholar 

  2. Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Process. 49(8), 1614–1626 (2001)

    Article  Google Scholar 

  3. Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L.: Darpa timit acoustic phonetic continuous speech corpus cdrom (1993)

    Google Scholar 

  4. Khan, A.H., Taseska, M., Habets, E.A.P.: A geometrically constrained independent vector analysis algorithm for online source extraction. In: Vincent, E., Yeredor, A., Koldovský, Z., Tichavský, P. (eds.) LVA/ICA 2015. LNCS, vol. 9237, pp. 396–403. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22482-4_46

    Chapter  Google Scholar 

  5. Kim, T., Attias, H.T., Lee, S.Y., Lee, T.W.: Blind source separation exploiting higher-order frequency dependencies. IEEE Trans. Audio Speech Lang. Process. 15, 70–79 (2007)

    Article  Google Scholar 

  6. Koldovský, Z., Málek, J., Tichavský, P., Nesta, F.: Semi-blind noise extraction using partially known position of the target source. IEEE Trans. Audio Speech Lang. Process. 21(10), 2029–2041 (2013)

    Article  Google Scholar 

  7. Koldovský, Z., Tichavský, P., Botka, D.: Noise reduction in dual-microphone mobile phones using a bank of pre-measured target-cancellation filters. In: Proceedings of IEEE International Conference on Audio, Speech and Signal Processing, pp. 679–683 (2013)

    Google Scholar 

  8. Lee, I., Kim, T., Lee, T.W.: Independent vector analysis for convolutive blind speech separation. In: Makino, S., Sawada, H., Lee, T.W. (eds.) Blind Speech Separation. Signals and Communication Technology, pp. 169–192. Springer, Dordrecht (2007). https://doi.org/10.1007/978-1-4020-6479-1_6

    Chapter  Google Scholar 

  9. Liang, Y., Naqvi, S.M., Chambers, J.A.: Audio video based fast fixed-point independent vector analysis for multisource separation in a room environment. EURASIP J. Adv. Signal Process. 2012(1), 183 (2012)

    Article  Google Scholar 

  10. Málek, J., Koldovský, Z., Gannot, S., Tichavský, P.: Informed generalized sidelobe canceler utilizing sparsity of speech signals. In: 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2013)

    Google Scholar 

  11. Matsuoka, K.: Minimal distortion principle for blind source separation. In: Proceedings of the 41st SICE Annual Conference, SICE 2002, vol. 4, pp. 2138–2143 (2002)

    Google Scholar 

  12. Nesta, F., Fakhry, M.: Unsupervised spatial dictionary learning for sparse underdetermined multichannel source separation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 86–90 (2013)

    Google Scholar 

  13. Nesta, F., Koldovský, Z.: Supervised independent vector analysis through pilot dependent components. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 536–540 (2017)

    Google Scholar 

  14. Nesta, F., Mosayyebpour, S., Koldovský, Z., Paleček, K.: Audio/video supervised independent vector analysis through multimodal pilot dependent components. In: Proceedings of European Signal Processing Conference, pp. 1190–1194 (2017)

    Google Scholar 

  15. Ono, N.: Stable and fast update rules for independent vector analysis based on auxiliary function technique. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 189–192 (2011)

    Google Scholar 

  16. Ono, T., Ono, N., Sagayama, S.: User-guided independent vector analysis with source activity tuning. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2417–2420 (2012)

    Google Scholar 

  17. Sawada, H., Mukai, R., Araki, S., Makino, S.: A robust and precise method for solving the permutation problem of frequency-domain blind source separation. In: Proceedings of International Conference on Independent Component Analysis and Signal Separation, pp. 505–510 (2003)

    Google Scholar 

  18. Smaragdis, P.: Blind separation of convolved mixtures in the frequency domain. Neurocomputing 22, 21–34 (1998)

    Article  Google Scholar 

Download references

Acknowledgements

We are due to Dr. Francesco Nesta from Synaptics for his helpful comments and useful suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jaroslav Čmejla .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Čmejla, J., Kounovský, T., Málek, J., Koldovský, Z. (2018). Independent Vector Analysis Exploiting Pre-learned Banks of Relative Transfer Functions for Assumed Target’s Positions. In: Deville, Y., Gannot, S., Mason, R., Plumbley, M., Ward, D. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2018. Lecture Notes in Computer Science(), vol 10891. Springer, Cham. https://doi.org/10.1007/978-3-319-93764-9_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-93764-9_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-93763-2

  • Online ISBN: 978-3-319-93764-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics