Abstract
On-line frequency-domain blind separation of audio sources performed through Independent Vector Analysis (IVA) suffers from the problem of determining the order of the separated outputs. In this work, we apply a supervised IVA based on pilot components obtained using a bank of Relative Transfer Functions (RTF). The bank is assumed to be available for potential positions of a target speaker within a confined area. In every frame, the most suitable RTF is selected from the bank based on a criterion. The pilot components are obtained as pre-separated target and interference, respectively, through the Minimum-Power Distortionless Beamforming and Null Beamforming. The supervised IVA is tested in a real-world scenario with various levels of up-to-dateness of the bank. We show that the global permutation problem is resolved even when the bank contains only pure delay filters. The Signal-to-Interference Ratio in separated signals is mostly better than that achieved by the pre-separation, unless the bank contains very precise RTFs.
This paper was supported by The Czech Science Foundation through Project No. 17-00902S and partly supported by the Student Grant Scheme 2018 project of the Technical University in Liberec and by the United States Department of the Navy, Office of Naval Research Global, through Project No. N62909-18-1-2040.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Douglas, S.C., Gupta, M.: Scaled natural gradient algorithms for instantaneous and convolutive blind source separation. In: 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP 2007, vol. 2, pp. II-637–II-640 (2007)
Gannot, S., Burshtein, D., Weinstein, E.: Signal enhancement using beamforming and nonstationarity with applications to speech. IEEE Trans. Signal Process. 49(8), 1614–1626 (2001)
Garofolo, J.S., Lamel, L.F., Fisher, W.M., Fiscus, J.G., Pallett, D.S., Dahlgren, N.L.: Darpa timit acoustic phonetic continuous speech corpus cdrom (1993)
Khan, A.H., Taseska, M., Habets, E.A.P.: A geometrically constrained independent vector analysis algorithm for online source extraction. In: Vincent, E., Yeredor, A., Koldovský, Z., Tichavský, P. (eds.) LVA/ICA 2015. LNCS, vol. 9237, pp. 396–403. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-22482-4_46
Kim, T., Attias, H.T., Lee, S.Y., Lee, T.W.: Blind source separation exploiting higher-order frequency dependencies. IEEE Trans. Audio Speech Lang. Process. 15, 70–79 (2007)
Koldovský, Z., Málek, J., Tichavský, P., Nesta, F.: Semi-blind noise extraction using partially known position of the target source. IEEE Trans. Audio Speech Lang. Process. 21(10), 2029–2041 (2013)
Koldovský, Z., Tichavský, P., Botka, D.: Noise reduction in dual-microphone mobile phones using a bank of pre-measured target-cancellation filters. In: Proceedings of IEEE International Conference on Audio, Speech and Signal Processing, pp. 679–683 (2013)
Lee, I., Kim, T., Lee, T.W.: Independent vector analysis for convolutive blind speech separation. In: Makino, S., Sawada, H., Lee, T.W. (eds.) Blind Speech Separation. Signals and Communication Technology, pp. 169–192. Springer, Dordrecht (2007). https://doi.org/10.1007/978-1-4020-6479-1_6
Liang, Y., Naqvi, S.M., Chambers, J.A.: Audio video based fast fixed-point independent vector analysis for multisource separation in a room environment. EURASIP J. Adv. Signal Process. 2012(1), 183 (2012)
Málek, J., Koldovský, Z., Gannot, S., Tichavský, P.: Informed generalized sidelobe canceler utilizing sparsity of speech signals. In: 2013 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pp. 1–6. IEEE (2013)
Matsuoka, K.: Minimal distortion principle for blind source separation. In: Proceedings of the 41st SICE Annual Conference, SICE 2002, vol. 4, pp. 2138–2143 (2002)
Nesta, F., Fakhry, M.: Unsupervised spatial dictionary learning for sparse underdetermined multichannel source separation. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 86–90 (2013)
Nesta, F., Koldovský, Z.: Supervised independent vector analysis through pilot dependent components. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 536–540 (2017)
Nesta, F., Mosayyebpour, S., Koldovský, Z., Paleček, K.: Audio/video supervised independent vector analysis through multimodal pilot dependent components. In: Proceedings of European Signal Processing Conference, pp. 1190–1194 (2017)
Ono, N.: Stable and fast update rules for independent vector analysis based on auxiliary function technique. In: Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 189–192 (2011)
Ono, T., Ono, N., Sagayama, S.: User-guided independent vector analysis with source activity tuning. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2417–2420 (2012)
Sawada, H., Mukai, R., Araki, S., Makino, S.: A robust and precise method for solving the permutation problem of frequency-domain blind source separation. In: Proceedings of International Conference on Independent Component Analysis and Signal Separation, pp. 505–510 (2003)
Smaragdis, P.: Blind separation of convolved mixtures in the frequency domain. Neurocomputing 22, 21–34 (1998)
Acknowledgements
We are due to Dr. Francesco Nesta from Synaptics for his helpful comments and useful suggestions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Čmejla, J., Kounovský, T., Málek, J., Koldovský, Z. (2018). Independent Vector Analysis Exploiting Pre-learned Banks of Relative Transfer Functions for Assumed Target’s Positions. In: Deville, Y., Gannot, S., Mason, R., Plumbley, M., Ward, D. (eds) Latent Variable Analysis and Signal Separation. LVA/ICA 2018. Lecture Notes in Computer Science(), vol 10891. Springer, Cham. https://doi.org/10.1007/978-3-319-93764-9_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-93764-9_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93763-2
Online ISBN: 978-3-319-93764-9
eBook Packages: Computer ScienceComputer Science (R0)