Independent Vector Analysis Exploiting Pre-learned Banks of Relative Transfer Functions for Assumed Target’s Positions

Čmejla, Jaroslav; Kounovský, Tomáš; Málek, Jiří; Koldovský, Zbyněk

doi:10.1007/978-3-319-93764-9_26

Jaroslav Čmejla¹⁸,
Tomáš Kounovský¹⁸,
Jiří Málek¹⁸ &
…
Zbyněk Koldovský¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10891))

Included in the following conference series:

International Conference on Latent Variable Analysis and Signal Separation

1691 Accesses
2 Citations

Abstract

On-line frequency-domain blind separation of audio sources performed through Independent Vector Analysis (IVA) suffers from the problem of determining the order of the separated outputs. In this work, we apply a supervised IVA based on pilot components obtained using a bank of Relative Transfer Functions (RTF). The bank is assumed to be available for potential positions of a target speaker within a confined area. In every frame, the most suitable RTF is selected from the bank based on a criterion. The pilot components are obtained as pre-separated target and interference, respectively, through the Minimum-Power Distortionless Beamforming and Null Beamforming. The supervised IVA is tested in a real-world scenario with various levels of up-to-dateness of the bank. We show that the global permutation problem is resolved even when the bank contains only pure delay filters. The Signal-to-Interference Ratio in separated signals is mostly better than that achieved by the pre-separation, unless the bank contains very precise RTFs.

This paper was supported by The Czech Science Foundation through Project No. 17-00902S and partly supported by the Student Grant Scheme 2018 project of the Technical University in Liberec and by the United States Department of the Navy, Office of Naval Research Global, through Project No. N62909-18-1-2040.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Douglas, S.C., Gupta, M.: Scaled natural gradient algorithms for instantaneous and convolutive blind source separation. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP 2007, vol. 2, pp. II-637–II-640 (2007)
Google Scholar
Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Process. 49(8), 1614–1626 (2001)
Article Google Scholar
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L.: Darpa timit acoustic phonetic continuous speech corpus cdrom (1993)
Google Scholar
Khan, A.H., Taseska, M., Habets, E.A.P.: A geometrically constrained independent vector analysis algorithm for online source extraction. In: Vincent, E., Yeredor, A., Koldovský, Z., Tichavský, P. (eds.) LVA/ICA 2015. LNCS, vol. 9237, pp. 396–403. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22482-4_46
Chapter Google Scholar
Kim, T., Attias, H.T., Lee, S.Y., Lee, T.W.: Blind source separation exploiting higher-order frequency dependencies. IEEE Trans. Audio Speech Lang. Process. 15, 70–79 (2007)
Article Google Scholar
Koldovský, Z., Málek, J., Tichavský, P., Nesta, F.: Semi-blind noise extraction using partially known position of the target source. IEEE Trans. Audio Speech Lang. Process. 21(10), 2029–2041 (2013)
Article Google Scholar
Koldovský, Z., Tichavský, P., Botka, D.: Noise reduction in dual-microphone mobile phones using a bank of pre-measured target-cancellation filters. In: Proceedings of IEEE International Conference on Audio, Speech and Signal Processing, pp. 679–683 (2013)
Google Scholar
Lee, I., Kim, T., Lee, T.W.: Independent vector analysis for convolutive blind speech separation. In: Makino, S., Sawada, H., Lee, T.W. (eds.) Blind Speech Separation. Signals and Communication Technology, pp. 169–192. Springer, Dordrecht (2007). https://doi.org/10.1007/978-1-4020-6479-1_6
Chapter Google Scholar
Liang, Y., Naqvi, S.M., Chambers, J.A.: Audio video based fast fixed-point independent vector analysis for multisource separation in a room environment. EURASIP J. Adv. Signal Process. 2012(1), 183 (2012)
Article Google Scholar
Málek, J., Koldovský, Z., Gannot, S., Tichavský, P.: Informed generalized sidelobe canceler utilizing sparsity of speech signals. In: 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2013)
Google Scholar
Matsuoka, K.: Minimal distortion principle for blind source separation. In: Proceedings of the 41st SICE Annual Conference, SICE 2002, vol. 4, pp. 2138–2143 (2002)
Google Scholar
Nesta, F., Fakhry, M.: Unsupervised spatial dictionary learning for sparse underdetermined multichannel source separation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 86–90 (2013)
Google Scholar
Nesta, F., Koldovský, Z.: Supervised independent vector analysis through pilot dependent components. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 536–540 (2017)
Google Scholar
Nesta, F., Mosayyebpour, S., Koldovský, Z., Paleček, K.: Audio/video supervised independent vector analysis through multimodal pilot dependent components. In: Proceedings of European Signal Processing Conference, pp. 1190–1194 (2017)
Google Scholar
Ono, N.: Stable and fast update rules for independent vector analysis based on auxiliary function technique. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 189–192 (2011)
Google Scholar
Ono, T., Ono, N., Sagayama, S.: User-guided independent vector analysis with source activity tuning. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2417–2420 (2012)
Google Scholar
Sawada, H., Mukai, R., Araki, S., Makino, S.: A robust and precise method for solving the permutation problem of frequency-domain blind source separation. In: Proceedings of International Conference on Independent Component Analysis and Signal Separation, pp. 505–510 (2003)
Google Scholar
Smaragdis, P.: Blind separation of convolved mixtures in the frequency domain. Neurocomputing 22, 21–34 (1998)
Article Google Scholar

Download references

Acknowledgements

We are due to Dr. Francesco Nesta from Synaptics for his helpful comments and useful suggestions.

Author information

Authors and Affiliations

Acoustic Signal Analysis and Processing Group, Faculty of Mechatronics, Informatics and Interdisciplinary Studies, Technical University of Liberec, Studentská 2, 461 17, Liberec, Czech Republic
Jaroslav Čmejla, Tomáš Kounovský, Jiří Málek & Zbyněk Koldovský

Authors

Jaroslav Čmejla
View author publications
You can also search for this author in PubMed Google Scholar
Tomáš Kounovský
View author publications
You can also search for this author in PubMed Google Scholar
Jiří Málek
View author publications
You can also search for this author in PubMed Google Scholar
Zbyněk Koldovský
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jaroslav Čmejla .

Editor information

Editors and Affiliations

Paul Sabatier University, Toulouse, France
Yannick Deville
Bar-Ilan University, Ramat Gan, Israel
Sharon Gannot
University of Surrey, Guildford, United Kingdom
Russell Mason
University of Surrey, Guildford, United Kingdom
Mark D. Plumbley
University of Surrey, Guildford, United Kingdom
Dominic Ward

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Čmejla, J., Kounovský, T., Málek, J., Koldovský, Z. (2018). Independent Vector Analysis Exploiting Pre-learned Banks of Relative Transfer Functions for Assumed Target’s Positions. In: Deville, Y., Gannot, S., Mason, R., Plumbley, M., Ward, D. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2018. Lecture Notes in Computer Science(), vol 10891. Springer, Cham. https://doi.org/10.1007/978-3-319-93764-9_26

Download citation

DOI: https://doi.org/10.1007/978-3-319-93764-9_26
Published: 06 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93763-2
Online ISBN: 978-3-319-93764-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics