Skip to main content
Log in

Building ears for robots: Sound localization and separation

  • Original Article
  • Published:
Artificial Life and Robotics Aims and scope Submit manuscript

Abstract

This paper describes our research on bio-mimetic robot audition. Among the many binaural and monaural sound localization cues in the human auditory system, the interaural time difference cue is selected as it can easily be obtained by omnidirectional microphones. We have used a three-microphone system to remove the anterior-posterior ambiguity which occurs in two-microphone (or ear) systems. The echo-avoidance model of the precedence effect is used to cope with the echoes and reverberations of real environments. We mimicked the cocktail party effect by perceptual grouping of continuous components according to the spatial information obtained by the sound localization method. A wheel-based mobile robot equipped with an auditory system was developed. The auditory system has two sound processing parts. One is a DSP-based realtime system; the other is an off-line system composed of remote computers. Experiments of localizing and separating multiple sound sources and robot navigation were conducted to demonstrate the system's ability and potential applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Heffner RS, Heffner HE (1992) Evolution of sound localization in mammals. In: Webster DB, Fay RR, Popper AN (eds) The evolutionary biology of hearing. Springer, Berlin Heidelberg New York, pp. 691–715.

    Google Scholar 

  2. Nishizawa Y, Yagi Y, Yachida M 1993, Generation of environmental map and estimation of free space for a mobile robot using omnidirectional image sensor COPIS (in Japanese). J Robotics Soc Jpn 11: 868–874

    Google Scholar 

  3. Brooks RA, Stein LA (1993) Building brains for bodies. AI Memo 1439, MIT

  4. Huang J, Ohnishi N, Sugie N (1988) Research on a sound source localization system suggested by living orgarnnisms—Localization of a single sound source in an echoic environment (in Japanese). IEICE Trans (A), J71-A: 1780–1789.

    Google Scholar 

  5. Huang J, Ohnishi N, Sugie N (1995) A biomimetic system for localization and separation of multiple sound sources. IEEE Trans Instrum Meas 44: 733–738

    Article  Google Scholar 

  6. Huang J, Ohnishi N, Sugie N (1995) Sound separation based on perceptual grouping of sound segments. Proceedings International Symposium Nonlinear Theory and its Applications Las Vegas Dec. 1995 pp 67–72

  7. Huang J. Ohnishi N, Sugie N (1997) Sound localization in reverberant environment based on the model of the precedence effect. IEEE Trans Instrum Meas 46: 842–846

    Article  Google Scholar 

  8. Grantham DW (1995) Spatial hearing and related phenomena. In Moore BCJ (ed) Hearing. Academic Press, New York, pp 297–345

    Google Scholar 

  9. Blauert J (1997) Spatial hearing: The psychophysics of human sound localization. MIT Press, Cambridge

    Google Scholar 

  10. Knudsen EI (1981) The hearing of the barn owl. Sci Am 245: 82–91

    Article  Google Scholar 

  11. Konishi M (1986) Centrally synthesized maps of sensory space. Trends Neurosci 9: 163–168

    Article  Google Scholar 

  12. Wallach H, Newman EB, Rosenzweig MR (1949) The precedence effect in sound localization. J. Psychol Am 62: 315–336

    Article  Google Scholar 

  13. Gardner MB (1968) Historical background of the Haas and/or precedence effect. J Acoust Soc Am 43: 1243–1248

    Article  Google Scholar 

  14. Zurek PM (1987) The precedence effect. In Yost WA, Gourevitch G (eds) Directional hearing. Springer, Berlin Heidelberg New York, pp 85–105

    Google Scholar 

  15. Clifton RK, Freyman RL (1996) The precedence effect: Beyond echo suppression. In Gilkey RH, and Anderson TB (eds) Binaural and spatial hearing, Lawrence Erlbaum, Hilldale

    Google Scholar 

  16. Huang J, Ohnishi N, Guo X, Sugie N (1996) A computational model of the precedence effect based on echo-avoidance. Proceedings ASA/ASJ Third Joint Meeting, Honolulu, Dec. 1996 pp 635–640

  17. Cherry EC (1953) Some experiments on the recognition of speech with one and with two ears. J Acoust Soc Am 25: 975–979

    Article  Google Scholar 

  18. Spieth W, Curtis JF, Webster JC (1954) Responding to one of two simultaneous messages. J Acoust Soc Am 26: 391–396

    Article  Google Scholar 

  19. Mitchell OMM, Ross CA, Yates GH (1971) Singal processing for a cocktail party effect. J Acoust Soc Am 50: 656–660

    Article  Google Scholar 

  20. Bodden M (1993) Modeling human sound-source localization and the cocktail-party effect. Acta Acustica 1: 43–55

    Google Scholar 

  21. Bregman AS (1990) Auditory scene analysis: The perceptual organization of sound: MIT Press, Cambridge

    Google Scholar 

  22. Cooke M (1993) Modeling auditory processing and organisation. Cambridge University Press Cambridge

    Google Scholar 

  23. Ellis DPW (1994) A computer model of psychoacoustic grouping rules. Proceedings 12th International Conference on Pattern Recognition, Jerusalem, Oct. 1994

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jie Huang.

About this article

Cite this article

Huang, J., Ohnishi, N. & Sugie, N. Building ears for robots: Sound localization and separation. Artificial Life and Robotics 1, 157–163 (1997). https://doi.org/10.1007/BF02471133

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02471133

Key words

Navigation