A Multi-channel Speech Enhancement Method Based on Subband Affine Projection Algorithm in Combination with Proposed Circular Nested Microphone Array

Firoozabadi, Ali Dehghan; Irarrazaval, Pablo; Adasme, Pablo; Durney, Hugo; Olave, Miguel Sanhueza; Zabala-Blanco, David; Azurdia-Meza, Cesar

doi:10.1007/978-3-030-58669-0_41

Ali Dehghan Firoozabadi¹⁹,
Pablo Irarrazaval²⁰,
Pablo Adasme²¹,
Hugo Durney¹⁹,
Miguel Sanhueza Olave¹⁹,
David Zabala-Blanco²² &
…
Cesar Azurdia-Meza²³

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1261))

Included in the following conference series:

International Conference on Advanced Intelligent Systems and Informatics

3008 Accesses
1 Citations

Abstract

In this paper, a novel multi-channel speech enhancement system is introduced based on a proposed circular nested microphone array (C-NMA) in combination with subband affine projection algorithm (SB-APA). The multi-channel speech enhancement methods have better accuracy because of information redundancy in comparison with single-channel methods. Firstly, a novel C-NMA is proposed with low computational complexity in comparison with other speech recording microphones. The C-NMA eliminates the spatial aliasing in microphone signals. Then, a subband step is implemented based on the speech components to increase the frequency resolution. The affine projection algorithm is implemented adaptively on the subband signals by C-NMA. Finally, the subband signals are combined by the synthesize filters and the enhanced signal is produced. The accuracy of the proposed method is compared with least mean square (LMS), traditional APA, recursive least square (RLS), and real-time generalized cross-correlation non-negative matrix factorization (RT-GCC-NMF). The results show the superiority of the proposed method in comparison with other previous works in noisy and reverberant environmental conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Prasad, P.B.M., Ganesh, M.S., Gangashetty, S.V.: Two microphone technique to improve the speech intelligibility under noisy environment. In: 14th International Colloquium on Signal Processing & Its Applications, Batu Feringghi, pp. 13–18 (2018)
Google Scholar
Fukui, M., Shimauchi, S., Hioka, Y., Nakagawa, A., Haneda, Y.: Acoustic echo and noise canceller for personal hands-free video IP phone. IEEE Trans. Consum. Electron. 62(4), 454–462 (2016)
Article Google Scholar
Kwon, K., Shin, J.W., Kim, N.S.: NMF-based speech enhancement using bases update. IEEE Signal Process. Lett. 22(4), 450–454 (2015)
Article Google Scholar
Chung, H., Badeau, R., Plourde, E.: Training and compensation of class-conditioned NMF bases for speech enhancement. Neurocomputing 284, 107–118 (2018)
Article Google Scholar
Compernolle, D.V.: Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings. In: IEEE International Conference on Acoustics, Speech and Signal Processing, Albuquerque, USA, pp. 833–836 (1990)
Google Scholar
Ephaim, Y., Van Trees, H.L.: A signal subspace approach for speech enhancement. IEEE Trans. Speech Audio Process. 3, 251–266 (1995)
Article Google Scholar
Markovich-Golan, S., Bertrand, A., Moonen, M.: Optimal distributed minimum-variance beamforming approaches for speech enhancement in wireless acoustic sensor networks. IEEE Trans. Sig. Process. 107, 4–20 (2015)
Google Scholar
Cheng, N., Liu, W.: Perceptual properties based signal subspace microphone array speech enhancement algorithm. Acta Autom. Sin. 35(12), 1481–1487 (2009)
Article Google Scholar
Haykin, S.: Adaptive Filter Theory, 3rd edn. Prentice Hall Inc., Pearson (1996)
MATH Google Scholar
Rakesh, P., Kishore Kumar, T.: A novel RLS based adaptive filtering method for speech enhancement. In: 17th International Conference on Communications, Control and Signal Processing, London, pp. 176–181 (2015)
Google Scholar
Gonzalez, A.; Ferrer, M.; Albu, F.; Diego, M.: Affine projection algorithms: Evolution to smart and fast algorithms and applications. In Proceedings of the 20th European Signal Processing Conference (EUSIPCO), Bucharest, Romania, pp. 1965–1969(2012)
Google Scholar
Wood, S.U.N., Rouat, J.: Unsupervised low latency speech enhancement with RT-GCC-NMF. IEEE J. Sel. Top. Sig. Process. 13, 332–346 (2019)
Article Google Scholar
Firoozabadi, A.D., Abutalebi, H.R.: Combination of nested microphone array and subband processing for multiple simultaneous speaker localization. In: 6th International Symposium on Telecommunications (IST), Tehran, pp. 907–912 (2012)
Google Scholar
Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Trans. Signal Process. 52, 1830–1847 (2004)
Article MathSciNet Google Scholar
Gonzalez, A., Ferrer, M., Albu, F., Diego, M.D.: Affine projection algorithms: Evolution to smart and fast algorithms and applications. In: Proceedings of the 20th European Signal Processing Conference (EUSIPCO), Bucharest, pp. 1965–1969 (2012)
Google Scholar
Ozeki, K., Umeda, T.: An adaptive filtering algorithm using an orthogonal projection to an affine subspace and its properties. Electron. Commun. Jpn. 67(5), 19–27 (1984)
Article MathSciNet Google Scholar
Garofolo, J.S., et al.: TIMIT Acoustic-Phonetic Continuous Speech Corpus LDC93S1. Web Download. Philadelphia: Linguistic Data Consortium (1993). https://catalog.ldc.upenn.edu/LDC93S1. Accessed March 2019
Allen, J., Berkley, D.: Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 65(4), 943–950 (1979)
Article Google Scholar

Download references

Acknowledgment

The authors acknowledge financial support from: FONDECYT No. 3190147 and FONDECYT No. 11180107.

Author information

Authors and Affiliations

Department of Electricity, Universidad Tecnológica Metropolitana, Av. Jose Pedro Alessandri 1242, 7800002, Santiago, Chile
Ali Dehghan Firoozabadi, Hugo Durney & Miguel Sanhueza Olave
Electrical Engineering Department, Pontificia Universidad Católica de Chile, Santiago, Chile
Pablo Irarrazaval
Electrical Engineering Department, Universidad de Santiago de Chile, Santiago, Chile
Pablo Adasme
Department of Computing and Industries, Universidad Católica Del Maule, 3466706, Talca, Chile
David Zabala-Blanco
Department of Electrical Engineering, Universidad de Chile, Santiago, Chile
Cesar Azurdia-Meza

Authors

Ali Dehghan Firoozabadi
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Irarrazaval
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Adasme
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Durney
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Sanhueza Olave
View author publications
You can also search for this author in PubMed Google Scholar
David Zabala-Blanco
View author publications
You can also search for this author in PubMed Google Scholar
Cesar Azurdia-Meza
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ali Dehghan Firoozabadi .

Editor information

Editors and Affiliations

Faculty of Computers and Artificial Intelligence, Information Technology Department, and Chair of the Scientific Research Group in Egypt, Cairo University, Cairo, Egypt
Aboul Ella Hassanien
Department of Electronics and Computer Science, Koszalin University of Technology, Koszalin, Poland
Adam Slowik
Faculty of Electrical Engineering and Computer Science, VŠB-Technical University of Ostrava, Ostrava-Poruba, Moravskoslezsky, Czech Republic
Václav Snášel
Rector of the Electronic Research Institute, Cairo, Egypt
Hisham El-Deeb
Faculty of computers and information, Ain Shams University, Cairo, Egypt
Fahmy M. Tolba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Firoozabadi, A.D. et al. (2021). A Multi-channel Speech Enhancement Method Based on Subband Affine Projection Algorithm in Combination with Proposed Circular Nested Microphone Array. In: Hassanien, A.E., Slowik, A., Snášel, V., El-Deeb, H., Tolba, F.M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2020. AISI 2020. Advances in Intelligent Systems and Computing, vol 1261. Springer, Cham. https://doi.org/10.1007/978-3-030-58669-0_41

Download citation

DOI: https://doi.org/10.1007/978-3-030-58669-0_41
Published: 20 September 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-58668-3
Online ISBN: 978-3-030-58669-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics