Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions

Llerena, Cosme; Gil-Pita, Roberto; Álvarez, Lorena; Rosa-Zurera, Manuel

doi:10.1007/978-3-642-38658-9_52

Cosme Llerena²³,
Roberto Gil-Pita²³,
Lorena Álvarez²³ &
…
Manuel Rosa-Zurera²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7894))

Included in the following conference series:

International Conference on Artificial Intelligence and Soft Computing

1772 Accesses

Abstract

Blind Source Separation (BSS) techniques aim at recovering unobserved source signals from observed mixtures (typically, the outputs of an array of sensors). Practically all classical BSS techniques do not work properly under reverberant conditions and therefore, it still remains an open problem. In this sense, we propose in this document the use of synchronization of speech mixtures in order to improve the results of classical BSS techniques. Specifically, we have applied the synchronization of mixtures combined with one of the most well-known and robust BSS algorithms that works under non-reverberant conditions, the Degenerate Unmixing Estimation Technique (DUET). In the aim of synchronizing speech mixtures prior to the speech source separation, the suitability of working with seven Time Delay Estimation (TDE) techniques has been analyzed. Results show the feasibility of using synchronization since the results of DUET are improved and additionally, it has been observed what is the most useful TDE algorithm in this framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cao, X.R., Liu, R.: General approach to blind source separation. IEEE Transactions on Signal Processing, 562–571 (1996)
Google Scholar
Hérault, J., Jutten, C., Ans, B.: Détection de grandeurs primitives dans un message composite par une architecture de calcul neuromimétique en apprentissage non supervisé. In: 10 Colloque sur le Traitement du Signal et Des Images, France (1985)
Google Scholar
Diggavi, S.N., Al-Dhahir, N., Stamoulis, A., Calderbank, A.R.: Great expectations: The value of spatial diversity in wireless networks. Proceedings of the IEEE, 219–270 (2004)
Google Scholar
Cichocki, A., Georgiev, P.: Blind source separation algorithms with matrix constraints. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 522–531 (2003)
Google Scholar
Te-Won, L.: Independent component analysis: theory and applications. Kluwer Academic Publishers, Boston (1998)
MATH Google Scholar
Hurley, N., Rickard, S.: Comparing measures of sparsity. IEEE Transactions on Information Theory, 4723–4741 (2009)
Google Scholar
Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 1830–1847 (2004)
Google Scholar
Zicheng, L.: Sound source separation with distributed microphone arrays in the presence of clock synchronization errors. In: Proc. Int. Workshop Acoustic Echo and Noise Control, IWAENC (2008)
Google Scholar
Lienhart, R., Kozintsev, I., Wehr, S., Yeung, M.: On the importance of exact synchronization for distributed audio signal processing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, vol. 4, pp. IV-840–IV-843. IEEE (2003)
Google Scholar
Brandstein, M.S., Adcock, J.E., Silverman, H.F.: A practical time-delay estimator for localizing speech sources with a microphone array. Computer Speech and Language, 153–170 (1995)
Google Scholar
Yegnanarayana, B., Prasanna, S.R.M., Duraiswami, R., Zotkin, D.: Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing, 1110–1118 (2005)
Google Scholar
Carter, G.C.: Coherence and time delay estimation. Proceedings of the IEEE, 236–255 (1987)
Google Scholar
Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech and Signal Processing, 320–327 (1976)
Google Scholar
Emile, B., Comon, P., Le Roux, J.: Estimation of time delays with fewer sensors than sources. IEEE Transactions on Signal Processing, 2012–2015 (1998)
Google Scholar
Wehr, S., Kozintsev, I., Lienhart, R., Kellermann, W.: Synchronization of acoustic sensors for distributed ad-hoc audio networks and its use for blind source separation. In: Proceedings of the IEEE Sixth International Symposium on Multimedia Software Engineering, pp. 18–25. IEEE (2004)
Google Scholar
Francourt, C., Parra, L.: The coherence function in blind source separation of convolutive mixtures of non-stationary signals. In: IEEE Workshop on Neural Networks for Signal Processing, pp. 303–312 (2001)
Google Scholar
Donohue, K.D., Agrinsoni, A., Hannemann, J.: Audio signal delay estimation using partial whitening. In: Proceedings of the IEEE SoutheastCon, pp. 466–471. IEEE (2007)
Google Scholar
Saarnisaari, H.: ML time delay estimation in a multipath channel. In: Proceedings of the IEEE 4th International Symposium on Spread Spectrum Techniques and Applications, pp. 1007–1011. IEEE (1996)
Google Scholar
Roth, P.R.: Effective measurements using digital signal analysis. IEEE Spectrum 8, 62–70 (1971)
Article Google Scholar
Carter, G.C., Nuttall, A.H., Cable, P.G.: The smoothed coherence transform. Proceedings of the IEEE, 1497–1498 (1973)
Google Scholar
Jacovitti, G., Scarano, G.: Discrete time techniques for time delay estimation. IEEE Transactions on Signal Processing, 525–533 (1993)
Google Scholar
Seneff, S., Zue, V.: Transcription and alignment of the timit database, TIMIT CD-ROM Documentation (1998)
Google Scholar
McGovern, S.G: A model for room acoustics, http://www.2pi.us/rir.html

Download references

Author information

Authors and Affiliations

Polytechnic School, University of Alcalá, Ctra. Madrid-Barcelona Km 33.200, 28850, Alcalá de Henares, Spain
Cosme Llerena, Roberto Gil-Pita, Lorena Álvarez & Manuel Rosa-Zurera

Authors

Cosme Llerena
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Gil-Pita
View author publications
You can also search for this author in PubMed Google Scholar
Lorena Álvarez
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Rosa-Zurera
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Częstochowa University of Technology, Armii Krajowej 36, 42-200, Częstochowa, Poland
Leszek Rutkowski , Marcin Korytkowski & Rafał Scherer , &
AGH University of Science and Technology, Michiewicza 30, 30-059, Kraków, Poland
Ryszard Tadeusiewicz
Department of Electrical Engineering and Computer Sciences, University of California, 94720-1776, Berkeley, CA, USA
Lotfi A. Zadeh
Electrical and Computer Engineering, University of Louisville, 405 Lutz Hall, 40292, Louisville, KY, USA
Jacek M. Zurada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Llerena, C., Gil-Pita, R., Álvarez, L., Rosa-Zurera, M. (2013). Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7894. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38658-9_52

Download citation

DOI: https://doi.org/10.1007/978-3-642-38658-9_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38657-2
Online ISBN: 978-3-642-38658-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics