Skip to main content

Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions

  • Conference paper
Artificial Intelligence and Soft Computing (ICAISC 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7894))

Included in the following conference series:

  • 1772 Accesses

Abstract

Blind Source Separation (BSS) techniques aim at recovering unobserved source signals from observed mixtures (typically, the outputs of an array of sensors). Practically all classical BSS techniques do not work properly under reverberant conditions and therefore, it still remains an open problem. In this sense, we propose in this document the use of synchronization of speech mixtures in order to improve the results of classical BSS techniques. Specifically, we have applied the synchronization of mixtures combined with one of the most well-known and robust BSS algorithms that works under non-reverberant conditions, the Degenerate Unmixing Estimation Technique (DUET). In the aim of synchronizing speech mixtures prior to the speech source separation, the suitability of working with seven Time Delay Estimation (TDE) techniques has been analyzed. Results show the feasibility of using synchronization since the results of DUET are improved and additionally, it has been observed what is the most useful TDE algorithm in this framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cao, X.R., Liu, R.: General approach to blind source separation. IEEE Transactions on Signal Processing, 562–571 (1996)

    Google Scholar 

  2. Hérault, J., Jutten, C., Ans, B.: Détection de grandeurs primitives dans un message composite par une architecture de calcul neuromimétique en apprentissage non supervisé. In: 10 Colloque sur le Traitement du Signal et Des Images, France (1985)

    Google Scholar 

  3. Diggavi, S.N., Al-Dhahir, N., Stamoulis, A., Calderbank, A.R.: Great expectations: The value of spatial diversity in wireless networks. Proceedings of the IEEE, 219–270 (2004)

    Google Scholar 

  4. Cichocki, A., Georgiev, P.: Blind source separation algorithms with matrix constraints. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 522–531 (2003)

    Google Scholar 

  5. Te-Won, L.: Independent component analysis: theory and applications. Kluwer Academic Publishers, Boston (1998)

    MATH  Google Scholar 

  6. Hurley, N., Rickard, S.: Comparing measures of sparsity. IEEE Transactions on Information Theory, 4723–4741 (2009)

    Google Scholar 

  7. Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 1830–1847 (2004)

    Google Scholar 

  8. Zicheng, L.: Sound source separation with distributed microphone arrays in the presence of clock synchronization errors. In: Proc. Int. Workshop Acoustic Echo and Noise Control, IWAENC (2008)

    Google Scholar 

  9. Lienhart, R., Kozintsev, I., Wehr, S., Yeung, M.: On the importance of exact synchronization for distributed audio signal processing. In: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, vol. 4, pp. IV-840–IV-843. IEEE (2003)

    Google Scholar 

  10. Brandstein, M.S., Adcock, J.E., Silverman, H.F.: A practical time-delay estimator for localizing speech sources with a microphone array. Computer Speech and Language, 153–170 (1995)

    Google Scholar 

  11. Yegnanarayana, B., Prasanna, S.R.M., Duraiswami, R., Zotkin, D.: Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing, 1110–1118 (2005)

    Google Scholar 

  12. Carter, G.C.: Coherence and time delay estimation. Proceedings of the IEEE, 236–255 (1987)

    Google Scholar 

  13. Knapp, C., Carter, G.: The generalized correlation method for estimation of time delay. IEEE Transactions on Acoustics, Speech and Signal Processing, 320–327 (1976)

    Google Scholar 

  14. Emile, B., Comon, P., Le Roux, J.: Estimation of time delays with fewer sensors than sources. IEEE Transactions on Signal Processing, 2012–2015 (1998)

    Google Scholar 

  15. Wehr, S., Kozintsev, I., Lienhart, R., Kellermann, W.: Synchronization of acoustic sensors for distributed ad-hoc audio networks and its use for blind source separation. In: Proceedings of the IEEE Sixth International Symposium on Multimedia Software Engineering, pp. 18–25. IEEE (2004)

    Google Scholar 

  16. Francourt, C., Parra, L.: The coherence function in blind source separation of convolutive mixtures of non-stationary signals. In: IEEE Workshop on Neural Networks for Signal Processing, pp. 303–312 (2001)

    Google Scholar 

  17. Donohue, K.D., Agrinsoni, A., Hannemann, J.: Audio signal delay estimation using partial whitening. In: Proceedings of the IEEE SoutheastCon, pp. 466–471. IEEE (2007)

    Google Scholar 

  18. Saarnisaari, H.: ML time delay estimation in a multipath channel. In: Proceedings of the IEEE 4th International Symposium on Spread Spectrum Techniques and Applications, pp. 1007–1011. IEEE (1996)

    Google Scholar 

  19. Roth, P.R.: Effective measurements using digital signal analysis. IEEE Spectrum 8, 62–70 (1971)

    Article  Google Scholar 

  20. Carter, G.C., Nuttall, A.H., Cable, P.G.: The smoothed coherence transform. Proceedings of the IEEE, 1497–1498 (1973)

    Google Scholar 

  21. Jacovitti, G., Scarano, G.: Discrete time techniques for time delay estimation. IEEE Transactions on Signal Processing, 525–533 (1993)

    Google Scholar 

  22. Seneff, S., Zue, V.: Transcription and alignment of the timit database, TIMIT CD-ROM Documentation (1998)

    Google Scholar 

  23. McGovern, S.G: A model for room acoustics, http://www.2pi.us/rir.html

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Llerena, C., Gil-Pita, R., Álvarez, L., Rosa-Zurera, M. (2013). Synchronizing Speech Mixtures in Speech Separation Problems under Reverberant Conditions. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7894. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38658-9_52

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-38658-9_52

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-38657-2

  • Online ISBN: 978-3-642-38658-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics