ABSTRACT
In this paper, a multi-sound source localization method based on combining phase consistency and angle deviation is proposed. Firstly, a single source point (SSP) detection method based on phase consistency is used to detect SSP. Secondly, the outliers were removed based on the method of angle deviation to obtain the detected SSP points. Finally, the detected SSP points were post-processed to obtain the Direction of Arrival (DOA) estimates. Experimental result shows that the proposed method has a higher localization accuracy compared with reference methods.
- Konrad Kowalczyk Parametric Spatial Sound Processing: A flexible and efficient solution to sound scene acquisition, modification, and reproduction. [J]. IEEE Signal Process. Mag., 2015, 32(2): 31-42.Google Scholar
- JIA M., YANG Z., BAO C., Encoding Multiple Audio Objects Using Intra Object Sparsity[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015, 23(6):1082–1095.Google Scholar
- Zheng, X., Ritz, C., & Xi, J. (2016). Encoding and communicating navigable speech soundfields. Multimedia Tools and Applications, 75(9), 5183-5204.Google ScholarDigital Library
- Chen, J., Benesty, J., & Huang, Y. (2006). Time delay estimation in room acoustic environments. EURASIP Journal on Advances in Signal Processing.Google Scholar
- Nesta, F., Omologo, M. (2012). Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources. , 20(1), 246–260. doi:10.1109/tasl.2011.2160168Google ScholarDigital Library
- Whitmire, Eric, Latif, Tahmid, Novak, & Tristan, (2016). Sound localization sensors for search and rescue biobots. IEEE Sensors Journal.Google Scholar
- Rui, L. I., & Huang, X. (2010). A modified fastica algorithm for blind signal separation. Communication Countermeasures.Google Scholar
- Hafezi, S., Moore, A. H., & Naylor, P. A. (2017). Multiple DOA estimation based on estimation consistency and spherical harmonic multiple signal classification. 2017 25th European Signal Processing Conference (EUSIPCO).Google ScholarCross Ref
- Valin, J. M., Michaud, François, Hadjou, B., & Rouat, J. (2004). Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach. IEEE International Conference on Robotics & Automation. IEEE.Google ScholarCross Ref
- Zibulevsky, M., B.A. Pearlmutter, Bofill, P., & Kisilev, P. (2001). Blind source separation by sparse decomposition in a signal dictionary.Google ScholarCross Ref
- Zheng, X., Ritz, C., & Xi, J. (2013). Collaborative blind source separation using location informed spatial microphones. IEEE.Google ScholarCross Ref
- Yilmaz, O., & Rickard, S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing: A publication of the IEEE Signal Processing Society (7), 52.Google Scholar
- Pavlidi, D, Griffin, A, Puigt, & M, (2013). Real-time multiple sound source localization and counting using a circular microphone array. IEEE transactions on audio, speech and language processing, 21(10), 2193-2206.Google ScholarDigital Library
- Jia, M., Sun, J., & Bao, C. (2017). Real-time multiple sound source localization and counting using a soundfield microphone. Journal of Ambient Intelligence & Humanized Computing, 8(6), 829-844.Google ScholarCross Ref
- Cobos, M., Lopez, J. J., & Martinez, D. (2011). Two-microphone multi-speaker localization based on a laplacian mixture model. Digital Signal Processing, 21(1), 66-76.Google ScholarDigital Library
- Araki, S., Sawada, H., Mukai, R., & Makino, S. (2011). Doa estimation for multiple sparse sources with arbitrarily arranged multiple sensors. Journal of Signal Processing Systems, 63(3), 265-275.Google ScholarDigital Library
- Swartling, M., B Sällberg, & N Grbić. (2011). Source localization for multiple speech sources using low complexity non-parametric source separation and clustering. Signal Processing, 91 (8), 1781-1788.Google ScholarDigital Library
- Rafaely, B., & Alhaiany, K (2018). Speaker localization using direct path dominance test based on sound field directivity. Signal Processing, 143, 42-47.Google ScholarCross Ref
- Lior, Madmoni, Boaz, & Rafaely. (2018). Direction of arrival estimation for reverberant speech based on enhanced decomposition of the direct sound. Selected Topics in Signal Processing IEEE Journal of.Google Scholar
- Jia, M., Jia, Y., Gao, S., Wang, J., & Wang, S (2020). Multi-source doa estimation in reverberant environments using potential single-source points enhancement. Applied Acoustics, 174(4), 107782.Google Scholar
- Jia, M., Sun, J., & Bao, C. (2018). Separation of multiple speech sources by recovering sparse and non-sparse components from b-format microphone recordings. Speech Communication: An International Journal, 96, 184-196.Google ScholarCross Ref
- Kai, W., Gopalan, R. V., & Khong, A (2018). Multi-source doa estimation in a reverberant environment using a single acoustic vector sensor. IEEE/ACM Transactions on Audio, Speech, and Language Processing, PP, 1-1.Google Scholar
- Yan Q., Sun S., Zhang H, Efficient Estimation of Mixing Matrix Using a Two-sensor Array[C]// ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020.Google Scholar
Index Terms
- Sound Source Localization by Combining Phase Consistency and Angle Deviation
Recommendations
Sound Source Omnidirectional Positioning Calibration Method Based on Microphone Observation Angle
To get source azimuth from microphone observation angle of view in a complex real environment, this article, on the basis of the analysis of geometric positioning method, established a seven-element microphone array model and proposed a sound source ...
RF angle of arrival-based node localisation
Several localisation algorithms exist for wireless sensor networks that use angle of arrival measurements to derive node position. However, there are limited options for actually obtaining the angle of arrival using resource-constrained devices. In this ...
Virtual world consistency: A condition for STM systems (with a versatile protocol with invisible read operations)
The aim of a Software Transactional Memory (STM) is to discharge the programmers from the management of synchronization in multiprocess programs that access concurrent objects. To that end, an STM system provides the programmer with the concept of a ...
Comments