Skip to main content

Detection of Speaker Direction Based on the On-and-Off Microphone Combination for Entertainment Robots

  • Conference paper
Entertainment Computing - ICEC 2005 (ICEC 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3711))

Included in the following conference series:

  • 1909 Accesses

Abstract

An important function of entertainment robots is voice communication with humans. For realizing them, accurate speech recognition and a speaker-direction detection mechanism are necessary. The direct-noise problem is serious in such speech processing. The microphone attached to the robot body receives not only human voices but also motor and mechanical noises directly. The direct noises are often larger than distance voices and fatally degrade the speech recognition rate. Even if the microphone close to the user (”on-mic”) is used for speech recognition, the body microphones (”off-mic”) are still necessary for detecting the speaker direction under the severe condition with direct noises. This paper describes a new method for detecting the speaker direction based on the on-and-off microphone combination. The system searches for the spectral elements of ”on-mic” voice in the other ”off-mic” channels. The segregated power ratio or the time delay between the ”off-mic” channels is used for detecting the speaker direction. Experiments show that the proposed method effectively improves the direction detection accuracy during the robot moves.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jeffress, L.A.: A place theory of sound localization. J. Comp. Physiol. Psychol. 41, 35–39 (1948)

    Article  Google Scholar 

  2. Blauert, J.: Spatial hearing: The psychophysics of human sound localization (Revised ed.). MIT Press, Cambridge (1997)

    Google Scholar 

  3. Nakatani, T., Okuno, H.: Harmonic Sound Stream Segregation Using Localization and Its Application to Speech Stream Segregation. Speech Communcations 27(3-4), 209–222 (1999)

    Article  Google Scholar 

  4. Aoki, M., Okamoto, M., Aoki, S., Matsui, H., Sakurai, T., Kaneda, Y.: Sound source segregation based on estimating incident angle of each frequency component of input signals acquired by multiple microphones. Acoustical Science and Technology 22(2), 149–157 (2001)

    Article  Google Scholar 

  5. Huang, J., Ohnishi, N., Guo, X., Sugie, N.: Echo avoidance in a computational model of the precedence. Speech Communication 27, 223–233 (1999)

    Article  Google Scholar 

  6. Renevey, P., Vetter, R., Kraus, J.: Robust speech recognition using missing feature theory and vector quantization. In: Proc. Eurospeech 2001, vol. 2, pp. 1107–1110 (2001)

    Google Scholar 

  7. Nakadai, K., Matusura, D., Okuno, H., Kitano, H.: Applying Scattering Theory to Robot Audition System. In: Proc. IROS 2003, pp. 1147–1152 (2003)

    Google Scholar 

  8. Furukawa, S., Maki, K., Kashino, M., Riquimaroux, H.: Dependence of the interaural phase difference sensitivities of inferior collicular neurons on a preceding tone and its implications in neural population coding. J. Neurophysiol (2005) (in press)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 IFIP International Federation for Information Processing

About this paper

Cite this paper

Kawabata, T., Fujiwara, M., Shibutani, T. (2005). Detection of Speaker Direction Based on the On-and-Off Microphone Combination for Entertainment Robots. In: Kishino, F., Kitamura, Y., Kato, H., Nagata, N. (eds) Entertainment Computing - ICEC 2005. ICEC 2005. Lecture Notes in Computer Science, vol 3711. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558651_25

Download citation

  • DOI: https://doi.org/10.1007/11558651_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29034-6

  • Online ISBN: 978-3-540-32054-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics