ABSTRACT
The ability to quickly obtain a comprehensive picture of the environment is an important prerequisite for the realization of robust and safe autonomous navigation with mobile robots. Due to their comparatively low computational requirements and cost, as well as their ability to provide omnidirectional perception, audio systems can be an excellent complement to more sophisticated or expensive sensing systems such as cameras or lidars. In this paper, we describe the development of a low-cost binaural audio system capable of interpreting the acoustic information present in an environment to assist a lidar system in making decisions that need to be made by a mobile robot. By taking advantage of the robot’s mobility, the system can resolve the uncertainties inherent in consciously choosing two off-the-shelf microphones and effectively realize 360-degree coverage. This opens up new possibilities, for example, to make room monitoring much more reliable.
- Dmitry Bogdanov, Nicolas Wack, Emilia Gómez, Sankalp Gulati, Perfecto Herrera, Oscar Mayor, Gerard Roma, Justin Salamon, José Zapata, and Xavier Serra. 2013. ESSENTIA: An Open-Source Library for Sound and Music Analysis. In Proceedings of the 21st ACM International Conference on Multimedia (Barcelona, Spain) (MM ’13). Association for Computing Machinery, New York, NY, USA, 855–858. https://doi.org/10.1145/2502081.2502229Google ScholarDigital Library
- A. Deleforge and R. Horaud. 2012. The cocktail party robot: Sound source separation and localisation with an active binaural head. In 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI). 431–438.Google Scholar
- J. Heitmann, F. Steinmetz, and C. Renner. 2018. Self-Localization of Micro AUVs Using a Low-Power, Low-Cost Acoustic Modem. In OCEANS 2018 MTS/IEEE Charleston. 1–9.Google Scholar
- Y. Hioka, K. Niwa, S. Sakauchi, K. Furuya, and Y. Haneda. 2011. Estimating Direct-to-Reverberant Energy Ratio Using D/R Spatial Correlation Matrix Model. IEEE Transactions on Audio, Speech, and Language Processing 19, 8 (2011), 2374–2384.Google ScholarDigital Library
- Fraunhofer IDMT. Last seen on: 09.06.2023. Hearing Car. https://www.idmt.fraunhofer.de/en/Press_and_Media/press_releases/2020/Hearing_Car.html.Google Scholar
- Ui-Hyun Kim and Hiroshi G. Okuno. 2013. Robust localization and tracking of multiple speakers in real environments for binaural robot audition. In 2013 14th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS). 1–4. https://doi.org/10.1109/WIAMIS.2013.6616137Google ScholarCross Ref
- Y. Lu and M. Cooke. 2010. Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources. IEEE Transactions on Audio, Speech, and Language Processing 18, 7 (2010), 1793–1805.Google ScholarDigital Library
- Nadimpalli Mahadev and Kevin B. Austin. 2015. Sound Localization by Robot Using Inter-Aural Time Differences. J. Comput. Sci. Coll. 30, 4 (April 2015), 50–56.Google Scholar
- M. Moussa, A. Moussa, and N. El-Sheimy. 2018. Multiple ultrasonic aiding system for car navigation in GNSS denied environment. In 2018 IEEE/ION Position, Location and Navigation Symposium (PLANS). 133–140.Google Scholar
- K. Nakamura, K. Nakadai, F. Asano, Y. Hasegawa, and H. Tsujino. 2009. Intelligent sound source localization for dynamic environments. In 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems. 664–669.Google Scholar
- D. Pavlidi, A. Griffin, M. Puigt, and A. Mouchtaris. 2013. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array. IEEE Transactions on Audio, Speech, and Language Processing 21, 10 (2013), 2193–2206.Google ScholarDigital Library
- C. Renner and A. J. Golkowski. 2016. Acoustic Modem for Micro AUVs: Design and Practical Evaluation. In Proceedings of the 11th ACM International Conference on Underwater Networks and Systems (Shanghai, China) (WUWNet ’16). Association for Computing Machinery, New York, NY, USA, Article 2, 8 pages. https://doi.org/10.1145/2999504.3001076Google ScholarDigital Library
- F. Rund and F. Saturka. 2012. Alternatives to HRTF measurement. In 2012 35th International Conference on Telecommunications and Signal Processing (TSP). 648–652.Google Scholar
- S. Sakamoto, S. Hongo, R. Kadoi, and Y. Suzuki. 2008. SENZI and ASURA: New High-Precision Sound-Space Sensing Systems based on Symmetrically Arranged Numerous Microphones. In 2008 Second International Symposium on Universal Communication. 429–434. https://doi.org/10.1109/ISUC.2008.15Google ScholarDigital Library
- Seung Seob Yeom, Jong Suk Choi, Yoon Seob Lim, and Mignon Park. 2008. DSP implementation of probabilistic sound source localization. In 2008 IEEE Workshop on Signal Processing Systems. 204–209.Google ScholarCross Ref
- Ryuichi Shimoyama and Reo Fukuda. 2014. Room volume estimation based on statistical properties of binaural signals using humanoid robot. In 2014 23rd International Conference on Robotics in Alpe-Adria-Danube Region (RAAD). 1–6. https://doi.org/10.1109/RAAD.2014.7002274Google ScholarCross Ref
- J. . Valin, F. Michaud, J. Rouat, and D. Letourneau. 2003. Robust sound source localization using a microphone array on a mobile robot. In Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453), Vol. 2. 1228–1233 vol.2.Google Scholar
- W. Wang, Z. Li, W. Yu, and J. Zhang. 2009. An autonomous docking method based on ultrasonic sensors for self-reconfigurable mobile robot. In 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO). 1744–1749.Google Scholar
- K. Watanabe, H. Suzuki, T. Qi, and K. Toshida. 1998. Basic research on underwater docking of flexible structures. In Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146), Vol. 1. 458–463 vol.1.Google Scholar
- M. Yiwere and E.J. Rhee. 2017. Distance estimation and localization of sound sources in reverberant conditions using deep neural networks. International Journal of Applied Engineering Research 12 (01 2017), 12384–12389.Google Scholar
- Pavel Zahorik. 2002. Direct-to-reverberant energy ratio sensitivity. The Journal of the Acoustical Society of America 112, 5 (2002), 2110–2117. https://doi.org/10.1121/1.1506692 arXiv:https://doi.org/10.1121/1.1506692Google ScholarCross Ref
Index Terms
- A low-cost binaural hearing support system for mobile robots
Recommendations
Binaural Noise Reduction Algorithms for Hearing Aids That Preserve Interaural Time Delay Cues
Binaural hearing aids use microphone inputs from both the left and right hearing aid to generate an output for each ear. On the other hand, a monaural hearing aid generates an output by processing only its own microphone inputs. This correspondence ...
Locating virtual sound sources at arbitrary distances in real-time binaural reproduction
A real-time system for sound spatialization via headphones is presented. Conventional headphone spatialization techniques effectively place sources on the surface of a virtual sphere around the listener. In the new system, sources can be spatialized at ...
A Probabilistic Model for Binaural Sound Localization
This paper proposes a biologically inspired and technically implemented sound localization system to robustly estimate the position of a sound source in the frontal azimuthal half-plane. For localization, binaural cues are extracted using cochleagrams ...
Comments