Abstract
This paper describes our research on bio-mimetic robot audition. Among the many binaural and monaural sound localization cues in the human auditory system, the interaural time difference cue is selected as it can easily be obtained by omnidirectional microphones. We have used a three-microphone system to remove the anterior-posterior ambiguity which occurs in two-microphone (or ear) systems. The echo-avoidance model of the precedence effect is used to cope with the echoes and reverberations of real environments. We mimicked the cocktail party effect by perceptual grouping of continuous components according to the spatial information obtained by the sound localization method. A wheel-based mobile robot equipped with an auditory system was developed. The auditory system has two sound processing parts. One is a DSP-based realtime system; the other is an off-line system composed of remote computers. Experiments of localizing and separating multiple sound sources and robot navigation were conducted to demonstrate the system's ability and potential applications.
Similar content being viewed by others
References
Heffner RS, Heffner HE (1992) Evolution of sound localization in mammals. In: Webster DB, Fay RR, Popper AN (eds) The evolutionary biology of hearing. Springer, Berlin Heidelberg New York, pp. 691–715.
Nishizawa Y, Yagi Y, Yachida M 1993, Generation of environmental map and estimation of free space for a mobile robot using omnidirectional image sensor COPIS (in Japanese). J Robotics Soc Jpn 11: 868–874
Brooks RA, Stein LA (1993) Building brains for bodies. AI Memo 1439, MIT
Huang J, Ohnishi N, Sugie N (1988) Research on a sound source localization system suggested by living orgarnnisms—Localization of a single sound source in an echoic environment (in Japanese). IEICE Trans (A), J71-A: 1780–1789.
Huang J, Ohnishi N, Sugie N (1995) A biomimetic system for localization and separation of multiple sound sources. IEEE Trans Instrum Meas 44: 733–738
Huang J, Ohnishi N, Sugie N (1995) Sound separation based on perceptual grouping of sound segments. Proceedings International Symposium Nonlinear Theory and its Applications Las Vegas Dec. 1995 pp 67–72
Huang J. Ohnishi N, Sugie N (1997) Sound localization in reverberant environment based on the model of the precedence effect. IEEE Trans Instrum Meas 46: 842–846
Grantham DW (1995) Spatial hearing and related phenomena. In Moore BCJ (ed) Hearing. Academic Press, New York, pp 297–345
Blauert J (1997) Spatial hearing: The psychophysics of human sound localization. MIT Press, Cambridge
Knudsen EI (1981) The hearing of the barn owl. Sci Am 245: 82–91
Konishi M (1986) Centrally synthesized maps of sensory space. Trends Neurosci 9: 163–168
Wallach H, Newman EB, Rosenzweig MR (1949) The precedence effect in sound localization. J. Psychol Am 62: 315–336
Gardner MB (1968) Historical background of the Haas and/or precedence effect. J Acoust Soc Am 43: 1243–1248
Zurek PM (1987) The precedence effect. In Yost WA, Gourevitch G (eds) Directional hearing. Springer, Berlin Heidelberg New York, pp 85–105
Clifton RK, Freyman RL (1996) The precedence effect: Beyond echo suppression. In Gilkey RH, and Anderson TB (eds) Binaural and spatial hearing, Lawrence Erlbaum, Hilldale
Huang J, Ohnishi N, Guo X, Sugie N (1996) A computational model of the precedence effect based on echo-avoidance. Proceedings ASA/ASJ Third Joint Meeting, Honolulu, Dec. 1996 pp 635–640
Cherry EC (1953) Some experiments on the recognition of speech with one and with two ears. J Acoust Soc Am 25: 975–979
Spieth W, Curtis JF, Webster JC (1954) Responding to one of two simultaneous messages. J Acoust Soc Am 26: 391–396
Mitchell OMM, Ross CA, Yates GH (1971) Singal processing for a cocktail party effect. J Acoust Soc Am 50: 656–660
Bodden M (1993) Modeling human sound-source localization and the cocktail-party effect. Acta Acustica 1: 43–55
Bregman AS (1990) Auditory scene analysis: The perceptual organization of sound: MIT Press, Cambridge
Cooke M (1993) Modeling auditory processing and organisation. Cambridge University Press Cambridge
Ellis DPW (1994) A computer model of psychoacoustic grouping rules. Proceedings 12th International Conference on Pattern Recognition, Jerusalem, Oct. 1994
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Huang, J., Ohnishi, N. & Sugie, N. Building ears for robots: Sound localization and separation. Artificial Life and Robotics 1, 157–163 (1997). https://doi.org/10.1007/BF02471133
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF02471133