Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System

Tang, Rongjiang; Zhang, Yue; Zuo, Yingxiang; Lin, Bo; Liang, Meng

doi:10.1007/s00034-023-02521-0

Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System

Published: 15 October 2023

Volume 43, pages 1549–1571, (2024)
Cite this article

Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Rongjiang Tang¹,
Yue Zhang ORCID: orcid.org/0000-0001-5582-7388¹,
Yingxiang Zuo¹,
Bo Lin¹ &
…
Meng Liang¹

347 Accesses
Explore all metrics

Abstract

Sound source localization is a technique that utilizes microphone arrays to detect the position of sound sources. It has a wide range of applications in areas such as smart homes, robot navigation, and conference recording. However, due to the complexity of the acoustic environment and the impact of noise interference, the accuracy of localization algorithms has always been a core concern in this field. Traditional time difference of arrival (TDOA) techniques struggle to achieve high precision and efficiency in localization results. To address this issue, this paper proposes a microphone array sound source localization method based on incremental broad learning (Enhance) algorithm. This method extracts shallow and deep features from audio signals and maps them to feature nodes and enhancement nodes in the broad learning system (BLS); a neural network model is constructed, which allows for fast adjustment of network structure and parameters. The model employs ridge regression to calculate connection weights and utilizes enhancement nodes to modify and optimize the feature nodes, thus achieving accurate prediction of sound source locations. The proposed method is experimentally validated using the NOIZEUS dataset and compared with the single-structure broad learning (One-shot) algorithm, back-propagation neural network (BP) algorithm, and recurrent neural network (RNN) algorithm. In the experiments, a microphone array consisting of four microphones is used, with a room size of 5 m × 4 m × 3 m. Different reverberation times (T₆₀) and signal-to-noise ratios (SNRs) are employed to simulate various acoustic environments, and the performance of the four algorithms is evaluated in terms of outlier percentage and mean-squared error (MSE). The results demonstrate that under high reverberation (T₆₀ = 700 ms) and low SNR (SNR = 0 dB) conditions, the proposed method achieves outlier percentage of only 0.308% and MSE of 0.92°. Compared to the other three algorithms, Enhance algorithm exhibits superior localization accuracy, noise robustness, and stability, thus holding significant research and practical value.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Research on Sound Source Localization Algorithm Based on Multilayer Perceptron

Dynamic speaker localization based on a novel lightweight R–CNN model

Article 21 January 2023

Simultaneous Sound Source Localization by Proposed Cuboids Nested Microphone Array Based on Subband Generalized Eigenvalue Decomposition

Data Availability

The datasets and codes of this paper for the simulation are available from the corresponding author if requested.

References

F. Bin, X. Lei, The combination of spectrum subtraction and cross-power spectrum phase method for time delay estimation. Arch. Acoust. 45, 453–458 (2020)
Google Scholar
C.P. Chen, Z. Liu, Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans. Neural Netw. Learn. Syst. 29(1), 10–24 (2017)
Article MathSciNet PubMed Google Scholar
M.-A. Chung, H.-C. Chou, C.-W. Lin, Sound localization based on acoustic source using multiple microphone array in an indoor environment. Electronics 11(6), 890 (2022)
Article Google Scholar
Y. Dai, P. Teng, J. Lv et al., Noise reduction in infrasound signals based on mask coefficient binary weighting – generalized cross correlation – non-negative matrix factorization algorithm. Appl. Acoust. 186, 108452 (2022)
Article Google Scholar
V. Ershov, V. Palchikovskiy, Comparison of beamforming algorithms based on localization of calibrating sound sources and air jet noise. Int. J. Eng. Technol. 7(2.23), 119–223 (2018)
Article Google Scholar
S. Feng, C.P. Chen, Fuzzy broad learning system: a novel neuro-fuzzy model for regression and classification. IEEE Trans. Cybern. 50(2), 414–424 (2018)
Article PubMed Google Scholar
Y. Geng, J. Jung, D.H. Seol, Sound-source localization system based on neural network for mobile robots, in 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) (2008), pp. 3126–30.
E.A. Habets, Room impulse response generator. Technische Universiteit Eindhoven. Tech. Rep. 2(24), 1 (2006)
Google Scholar
H.-G. Hirsch, D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in Proceedings of the ASR2000-Automatic Speech Recognition: Challenges for the New Millennium ISCA Tutorial and Research Workshop (ITRW), F (2000).
Y. Hu, P.C. Loizou, Subjective comparison and evaluation of speech enhancement algorithms. Speech Commun. 49(7–8), 588–601 (2007)
Article PubMed PubMed Central Google Scholar
W. Ke, X. Zhang, Y. Yuan et al., Compressing sensing based source localization for controlled acoustic signals using distributed microphone arrays. Math. Probl. Eng. 2017, 1–11 (2017)
MathSciNet Google Scholar
Y. Li, H. Chen, Reverberation robust feature extraction for sound source localization using a small-sized microphone array. IEEE Sens. J. 17(19), 6331–6339 (2017)
Article ADS Google Scholar
T. Nagai, K. Kondo, M. Kaneko, et al., Estimation of source location based on 2-D MUSIC and its application to speech recognition in cars, in Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings (Cat No 01CH37221), F, (2001). IEEE.
F. Nesta, M. Omologo, Generalized state coherence transform for multidimensional TDOA estimation of multiple sources. IEEE Trans. Audio Speech Lang. Process. 20(1), 246–260 (2011)
Article Google Scholar
M. Omologo, P. Svaizer, Acoustic event localization using a crosspower-spectrum phase based technique, in Proceedings of ICASSP'94 IEEE International Conference on Acoustics, Speech and Signal Processing, F (1994). IEEE.
M. Qin, D. Hu, Z. Chen et al., Compressive sensing-based sound source localization for microphone arrays. Circuits Syst. Signal Process. 40, 4696–4719 (2021)
Article Google Scholar
E. Rothauser, IEEE recommended practice for speech quality measurements. IEEE Trans. Audio Electroacoust. 17(3), 225–246 (1969)
Article Google Scholar
S. Sakavičius, A. Serackis, Estimation of azimuth and elevation for multiple acoustic sources using tetrahedral microphone arrays and convolutional neural networks. Electronics 10(21), 2585 (2021)
Article Google Scholar
A.P. Singh, N. Tiwari, An improved method to localize simultaneously close and coherent sources based on symmetric-Toeplitz covariance matrix. Appl. Acoust. 182, 108176 (2021)
Article Google Scholar
K.-T. Song, J.-S. Hu, C.-Y. Tsai, et al., Speaker attention system for mobile robots using microphone array and face tracking, in Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006 ICRA 2006, F (2006). IEEE.
V. Stojanovic, V. Filipovic, Adaptive input design for identification of output error model with constrained output. Circuits Syst. Signal Process. 33(1), 97–113 (2014)
Article MathSciNet Google Scholar
V. Stojanovic, N. Nedic, Robust Kalman filtering for nonlinear multivariable stochastic systems in the presence of non-Gaussian noise. Int. J. Robust Nonlinear Control 26(3), 445–460 (2016)
Article MathSciNet Google Scholar
R. Tang, Y. Zuo, W. Liu et al., Efficient energy-based orthogonal matching pursuit algorithm for multiple sound source localization with unknown source count. Meas. Sci. Technol. 33(4), 045018 (2022)
Article ADS Google Scholar
H. Tao, L. Cheng, J. Qiu et al., Few shot cross equipment fault diagnosis method based on parameter optimization and feature mertic. Meas. Sci. Technol. 33(11), 115005 (2022)
Article ADS CAS Google Scholar
J. Vandendriessche, B. Da Silva, L. Lhoest et al., M3-AC: a multi-mode multithread soc FPGA based acoustic camera. Electronics 10(3), 317 (2021)
Article Google Scholar
Y. Wu, R. Ayyalasomayajula, M.J. Bianco, et al., Sslide: Sound source localization for indoors based on deep learning, in Proceedings of the ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), F (2021). IEEE.
M. Zhang, D. Liu, Q. Wang et al., Detection of alertness-related EEG signals based on decision fused BP neural network. Biomed. Signal Process. Control 74, 103479 (2022)
Article Google Scholar
T. Zhang, X. Wang, X. Xu et al., GCB-Net: graph convolutional broad network and its application in emotion recognition. IEEE Trans. Affect. Comput. 13(1), 379–388 (2019)
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their constructive comments and recommendations.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 52165010), National Natural Science Foundation of China (Grant No. 52065013), Liuzhou Science and Technology Project: Development of key technologies for improving the fuel economy of commercial vehicles (2021AAA0108).

Author information

Authors and Affiliations

School of Mechanical and Electrical Engineering, Guilin University of Electronic Technology, Guilin, 541004, China
Rongjiang Tang, Yue Zhang, Yingxiang Zuo, Bo Lin & Meng Liang

Authors

Rongjiang Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yingxiang Zuo
View author publications
You can also search for this author in PubMed Google Scholar
Bo Lin
View author publications
You can also search for this author in PubMed Google Scholar
Meng Liang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Rongjiang Tang did conceptualization and funding acquisition and writing—review and editing; Yue Zhang contributed to methodology, writing—original draft preparation; Yingxiang Zuo contributed to software; Bo Lin and Meng Liang performed formal analysis. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Rongjiang Tang.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Tang, R., Zhang, Y., Zuo, Y. et al. Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System. Circuits Syst Signal Process 43, 1549–1571 (2024). https://doi.org/10.1007/s00034-023-02521-0

Download citation

Received: 20 April 2023
Revised: 13 September 2023
Accepted: 13 September 2023
Published: 15 October 2023
Issue Date: March 2024
DOI: https://doi.org/10.1007/s00034-023-02521-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System

Abstract

Access this article

Similar content being viewed by others

Research on Sound Source Localization Algorithm Based on Multilayer Perceptron

Dynamic speaker localization based on a novel lightweight R–CNN model

Simultaneous Sound Source Localization by Proposed Cuboids Nested Microphone Array Based on Subband Generalized Eigenvalue Decomposition

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System

Abstract

Access this article

Similar content being viewed by others

Research on Sound Source Localization Algorithm Based on Multilayer Perceptron

Dynamic speaker localization based on a novel lightweight R–CNN model

Simultaneous Sound Source Localization by Proposed Cuboids Nested Microphone Array Based on Subband Generalized Eigenvalue Decomposition

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation