Skip to main content
Log in

Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System

  • Published:
Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Abstract

Sound source localization is a technique that utilizes microphone arrays to detect the position of sound sources. It has a wide range of applications in areas such as smart homes, robot navigation, and conference recording. However, due to the complexity of the acoustic environment and the impact of noise interference, the accuracy of localization algorithms has always been a core concern in this field. Traditional time difference of arrival (TDOA) techniques struggle to achieve high precision and efficiency in localization results. To address this issue, this paper proposes a microphone array sound source localization method based on incremental broad learning (Enhance) algorithm. This method extracts shallow and deep features from audio signals and maps them to feature nodes and enhancement nodes in the broad learning system (BLS); a neural network model is constructed, which allows for fast adjustment of network structure and parameters. The model employs ridge regression to calculate connection weights and utilizes enhancement nodes to modify and optimize the feature nodes, thus achieving accurate prediction of sound source locations. The proposed method is experimentally validated using the NOIZEUS dataset and compared with the single-structure broad learning (One-shot) algorithm, back-propagation neural network (BP) algorithm, and recurrent neural network (RNN) algorithm. In the experiments, a microphone array consisting of four microphones is used, with a room size of 5 m × 4 m × 3 m. Different reverberation times (T60) and signal-to-noise ratios (SNRs) are employed to simulate various acoustic environments, and the performance of the four algorithms is evaluated in terms of outlier percentage and mean-squared error (MSE). The results demonstrate that under high reverberation (T60 = 700 ms) and low SNR (SNR = 0 dB) conditions, the proposed method achieves outlier percentage of only 0.308% and MSE of 0.92°. Compared to the other three algorithms, Enhance algorithm exhibits superior localization accuracy, noise robustness, and stability, thus holding significant research and practical value.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Data Availability

The datasets and codes of this paper for the simulation are available from the corresponding author if requested.

References

  1. F. Bin, X. Lei, The combination of spectrum subtraction and cross-power spectrum phase method for time delay estimation. Arch. Acoust. 45, 453–458 (2020)

    Google Scholar 

  2. C.P. Chen, Z. Liu, Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans. Neural Netw. Learn. Syst. 29(1), 10–24 (2017)

    Article  MathSciNet  PubMed  Google Scholar 

  3. M.-A. Chung, H.-C. Chou, C.-W. Lin, Sound localization based on acoustic source using multiple microphone array in an indoor environment. Electronics 11(6), 890 (2022)

    Article  Google Scholar 

  4. Y. Dai, P. Teng, J. Lv et al., Noise reduction in infrasound signals based on mask coefficient binary weighting – generalized cross correlation – non-negative matrix factorization algorithm. Appl. Acoust. 186, 108452 (2022)

    Article  Google Scholar 

  5. V. Ershov, V. Palchikovskiy, Comparison of beamforming algorithms based on localization of calibrating sound sources and air jet noise. Int. J. Eng. Technol. 7(2.23), 119–223 (2018)

    Article  Google Scholar 

  6. S. Feng, C.P. Chen, Fuzzy broad learning system: a novel neuro-fuzzy model for regression and classification. IEEE Trans. Cybern. 50(2), 414–424 (2018)

    Article  PubMed  Google Scholar 

  7. Y. Geng, J. Jung, D.H. Seol, Sound-source localization system based on neural network for mobile robots, in 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence) (2008), pp. 3126–30.

  8. E.A. Habets, Room impulse response generator. Technische Universiteit Eindhoven. Tech. Rep. 2(24), 1 (2006)

    Google Scholar 

  9. H.-G. Hirsch, D. Pearce, The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions, in Proceedings of the ASR2000-Automatic Speech Recognition: Challenges for the New Millennium ISCA Tutorial and Research Workshop (ITRW), F (2000).

  10. Y. Hu, P.C. Loizou, Subjective comparison and evaluation of speech enhancement algorithms. Speech Commun. 49(7–8), 588–601 (2007)

    Article  PubMed  PubMed Central  Google Scholar 

  11. W. Ke, X. Zhang, Y. Yuan et al., Compressing sensing based source localization for controlled acoustic signals using distributed microphone arrays. Math. Probl. Eng. 2017, 1–11 (2017)

    MathSciNet  Google Scholar 

  12. Y. Li, H. Chen, Reverberation robust feature extraction for sound source localization using a small-sized microphone array. IEEE Sens. J. 17(19), 6331–6339 (2017)

    Article  ADS  Google Scholar 

  13. T. Nagai, K. Kondo, M. Kaneko, et al., Estimation of source location based on 2-D MUSIC and its application to speech recognition in cars, in Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing Proceedings (Cat No 01CH37221), F, (2001). IEEE.

  14. F. Nesta, M. Omologo, Generalized state coherence transform for multidimensional TDOA estimation of multiple sources. IEEE Trans. Audio Speech Lang. Process. 20(1), 246–260 (2011)

    Article  Google Scholar 

  15. M. Omologo, P. Svaizer, Acoustic event localization using a crosspower-spectrum phase based technique, in Proceedings of ICASSP'94 IEEE International Conference on Acoustics, Speech and Signal Processing, F (1994). IEEE.

  16. M. Qin, D. Hu, Z. Chen et al., Compressive sensing-based sound source localization for microphone arrays. Circuits Syst. Signal Process. 40, 4696–4719 (2021)

    Article  Google Scholar 

  17. E. Rothauser, IEEE recommended practice for speech quality measurements. IEEE Trans. Audio Electroacoust. 17(3), 225–246 (1969)

    Article  Google Scholar 

  18. S. Sakavičius, A. Serackis, Estimation of azimuth and elevation for multiple acoustic sources using tetrahedral microphone arrays and convolutional neural networks. Electronics 10(21), 2585 (2021)

    Article  Google Scholar 

  19. A.P. Singh, N. Tiwari, An improved method to localize simultaneously close and coherent sources based on symmetric-Toeplitz covariance matrix. Appl. Acoust. 182, 108176 (2021)

    Article  Google Scholar 

  20. K.-T. Song, J.-S. Hu, C.-Y. Tsai, et al., Speaker attention system for mobile robots using microphone array and face tracking, in Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006 ICRA 2006, F (2006). IEEE.

  21. V. Stojanovic, V. Filipovic, Adaptive input design for identification of output error model with constrained output. Circuits Syst. Signal Process. 33(1), 97–113 (2014)

    Article  MathSciNet  Google Scholar 

  22. V. Stojanovic, N. Nedic, Robust Kalman filtering for nonlinear multivariable stochastic systems in the presence of non-Gaussian noise. Int. J. Robust Nonlinear Control 26(3), 445–460 (2016)

    Article  MathSciNet  Google Scholar 

  23. R. Tang, Y. Zuo, W. Liu et al., Efficient energy-based orthogonal matching pursuit algorithm for multiple sound source localization with unknown source count. Meas. Sci. Technol. 33(4), 045018 (2022)

    Article  ADS  Google Scholar 

  24. H. Tao, L. Cheng, J. Qiu et al., Few shot cross equipment fault diagnosis method based on parameter optimization and feature mertic. Meas. Sci. Technol. 33(11), 115005 (2022)

    Article  ADS  CAS  Google Scholar 

  25. J. Vandendriessche, B. Da Silva, L. Lhoest et al., M3-AC: a multi-mode multithread soc FPGA based acoustic camera. Electronics 10(3), 317 (2021)

    Article  Google Scholar 

  26. Y. Wu, R. Ayyalasomayajula, M.J. Bianco, et al., Sslide: Sound source localization for indoors based on deep learning, in Proceedings of the ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), F (2021). IEEE.

  27. M. Zhang, D. Liu, Q. Wang et al., Detection of alertness-related EEG signals based on decision fused BP neural network. Biomed. Signal Process. Control 74, 103479 (2022)

    Article  Google Scholar 

  28. T. Zhang, X. Wang, X. Xu et al., GCB-Net: graph convolutional broad network and its application in emotion recognition. IEEE Trans. Affect. Comput. 13(1), 379–388 (2019)

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their constructive comments and recommendations.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 52165010), National Natural Science Foundation of China (Grant No. 52065013), Liuzhou Science and Technology Project: Development of key technologies for improving the fuel economy of commercial vehicles (2021AAA0108).

Author information

Authors and Affiliations

Authors

Contributions

Rongjiang Tang did conceptualization and funding acquisition and writing—review and editing; Yue Zhang contributed to methodology, writing—original draft preparation; Yingxiang Zuo contributed to software; Bo Lin and Meng Liang performed formal analysis. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Rongjiang Tang.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tang, R., Zhang, Y., Zuo, Y. et al. Sound Source Localization Algorithm of Microphone Array Based on Incremental Broad Learning System. Circuits Syst Signal Process 43, 1549–1571 (2024). https://doi.org/10.1007/s00034-023-02521-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00034-023-02521-0

Keywords

Navigation