skip to main content
10.1145/3594315.3594381acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Sound Source Localization by Combining Phase Consistency and Angle Deviation

Authors Info & Claims
Published:02 August 2023Publication History

ABSTRACT

In this paper, a multi-sound source localization method based on combining phase consistency and angle deviation is proposed. Firstly, a single source point (SSP) detection method based on phase consistency is used to detect SSP. Secondly, the outliers were removed based on the method of angle deviation to obtain the detected SSP points. Finally, the detected SSP points were post-processed to obtain the Direction of Arrival (DOA) estimates. Experimental result shows that the proposed method has a higher localization accuracy compared with reference methods.

References

  1. Konrad Kowalczyk Parametric Spatial Sound Processing: A flexible and efficient solution to sound scene acquisition, modification, and reproduction. [J]. IEEE Signal Process. Mag., 2015, 32(2): 31-42.Google ScholarGoogle Scholar
  2. JIA M., YANG Z., BAO C., Encoding Multiple Audio Objects Using Intra Object Sparsity[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015, 23(6):1082–1095.Google ScholarGoogle Scholar
  3. Zheng, X., Ritz, C., & Xi, J. (2016). Encoding and communicating navigable speech soundfields. Multimedia Tools and Applications, 75(9), 5183-5204.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chen, J., Benesty, J., & Huang, Y. (2006). Time delay estimation in room acoustic environments. EURASIP Journal on Advances in Signal Processing.Google ScholarGoogle Scholar
  5. Nesta, F., Omologo, M. (2012). Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources. , 20(1), 246–260. doi:10.1109/tasl.2011.2160168Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Whitmire, Eric, Latif, Tahmid, Novak, & Tristan, (2016). Sound localization sensors for search and rescue biobots. IEEE Sensors Journal.Google ScholarGoogle Scholar
  7. Rui, L. I., & Huang, X. (2010). A modified fastica algorithm for blind signal separation. Communication Countermeasures.Google ScholarGoogle Scholar
  8. Hafezi, S., Moore, A. H., & Naylor, P. A. (2017). Multiple DOA estimation based on estimation consistency and spherical harmonic multiple signal classification. 2017 25th European Signal Processing Conference (EUSIPCO).Google ScholarGoogle ScholarCross RefCross Ref
  9. Valin, J. M., Michaud, François, Hadjou, B., & Rouat, J. (2004). Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach. IEEE International Conference on Robotics & Automation. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  10. Zibulevsky, M., B.A. Pearlmutter, Bofill, P., & Kisilev, P. (2001). Blind source separation by sparse decomposition in a signal dictionary.Google ScholarGoogle ScholarCross RefCross Ref
  11. Zheng, X., Ritz, C., & Xi, J. (2013). Collaborative blind source separation using location informed spatial microphones. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  12. Yilmaz, O., & Rickard, S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing: A publication of the IEEE Signal Processing Society (7), 52.Google ScholarGoogle Scholar
  13. Pavlidi, D, Griffin, A, Puigt, & M, (2013). Real-time multiple sound source localization and counting using a circular microphone array. IEEE transactions on audio, speech and language processing, 21(10), 2193-2206.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Jia, M., Sun, J., & Bao, C. (2017). Real-time multiple sound source localization and counting using a soundfield microphone. Journal of Ambient Intelligence & Humanized Computing, 8(6), 829-844.Google ScholarGoogle ScholarCross RefCross Ref
  15. Cobos, M., Lopez, J. J., & Martinez, D. (2011). Two-microphone multi-speaker localization based on a laplacian mixture model. Digital Signal Processing, 21(1), 66-76.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Araki, S., Sawada, H., Mukai, R., & Makino, S. (2011). Doa estimation for multiple sparse sources with arbitrarily arranged multiple sensors. Journal of Signal Processing Systems, 63(3), 265-275.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Swartling, M., B Sällberg, & N Grbić. (2011). Source localization for multiple speech sources using low complexity non-parametric source separation and clustering. Signal Processing, 91 (8), 1781-1788.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Rafaely, B., & Alhaiany, K (2018). Speaker localization using direct path dominance test based on sound field directivity. Signal Processing, 143, 42-47.Google ScholarGoogle ScholarCross RefCross Ref
  19. Lior, Madmoni, Boaz, & Rafaely. (2018). Direction of arrival estimation for reverberant speech based on enhanced decomposition of the direct sound. Selected Topics in Signal Processing IEEE Journal of.Google ScholarGoogle Scholar
  20. Jia, M., Jia, Y., Gao, S., Wang, J., & Wang, S (2020). Multi-source doa estimation in reverberant environments using potential single-source points enhancement. Applied Acoustics, 174(4), 107782.Google ScholarGoogle Scholar
  21. Jia, M., Sun, J., & Bao, C. (2018). Separation of multiple speech sources by recovering sparse and non-sparse components from b-format microphone recordings. Speech Communication: An International Journal, 96, 184-196.Google ScholarGoogle ScholarCross RefCross Ref
  22. Kai, W., Gopalan, R. V., & Khong, A (2018). Multi-source doa estimation in a reverberant environment using a single acoustic vector sensor. IEEE/ACM Transactions on Audio, Speech, and Language Processing, PP, 1-1.Google ScholarGoogle Scholar
  23. Yan Q., Sun S., Zhang H, Efficient Estimation of Mixing Matrix Using a Two-sensor Array[C]// ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020.Google ScholarGoogle Scholar

Index Terms

  1. Sound Source Localization by Combining Phase Consistency and Angle Deviation

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence
        March 2023
        824 pages
        ISBN:9781450399029
        DOI:10.1145/3594315

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 2 August 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)16
        • Downloads (Last 6 weeks)1

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format