Skip to main content

Development of a Sound Source Localization System for Assisting Group Conversation

  • Conference paper
Intelligent Robotics and Applications (ICIRA 2013)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8102))

Included in the following conference series:

Abstract

In this study, we developed a sound source localization system, which consists of Jellyfish-02 and HARK robot audition software, in order to reduce the number of wires for evaluating speech duration. Sound source localization performance of Jellyfish-02 is evaluated by precision, recall, and F-measure. Performance of Jellyfish-02 is superior to conventional microphone arrays. During the experiment, we found that F-measure becomes smaller as the number of speakers increases. We investigated the percentage of speech overlapped periods in natural conversation for the purpose of examining the applicability of the system to measure speech duration in group conversation. From the results, Jelyyfish-02 surpasses conventional microphone array in design and usability. It is potentially applicable for assisting group conversion by measuring duration of speech for each participant.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fay, R.R., Popper, A.N.: Evolution of hearing in vertebrates: The inner ears and processing. Hear. Res. 149, 1–10 (2000)

    Article  Google Scholar 

  2. Zhao, S., Ahmed, S., Liang, Y., Rupnow, K., Chen, D., Jones, D.L.: A Real-Time 3D Sound Localization System with Miniature Microphone Array for Virtual Reality. In: 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA), pp. 1853–1857 (2012)

    Google Scholar 

  3. Nakamura, K., Nakadai, K., Ince, G.: Real-time super-resolution Sound Source Localization for robots. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 694–699 (2012)

    Google Scholar 

  4. Nakashima, H., Mukai, T.: 3D Sound Source Localization System Based on Learning of Binaural Hearing. In: 2005 IEEE International Conference on Man and Cybernetics, Systems, pp. 3534–3539 (2005)

    Google Scholar 

  5. Cho, Y., Yook, D., Chang, S., Kim, H.: Sound Source Localization for Robot Auditory Systems. IEEE Transactions on Consumer Electronics 55(3), 1663–16692 (2009)

    Article  Google Scholar 

  6. Nakadai, K., Takahashi, T., Okuno, H.G., Nakajima, H., Hasegawa, Y., Tsujino, H.: Design and Implementation of Robot Audition System ’HARK’- Open Source Software for Listening to Three Simultaneous Speakers. Advanced Robotics 24(5-6), 739–761 (2010)

    Article  Google Scholar 

  7. Yamamoto, S., Nakadai, K., Nakano, M., Tsujino, H., Valin, J.-M., Komatani, K., Ogata, T., Okuno, H.G.: Design and implementation of a robot audition system for automatic Speech recognition of simultaneous speech. In: Proc. of ASRU, pp. 11–116 (2007)

    Google Scholar 

  8. Yamaguchi, T., Ota, J., Otake, M.: A system that assists group conversation of older adults by evaluating speech duration and facial expression of each participant during conversation. In: Proc. of 2012 IEEE International Conference on Robotics and Automation, pp. 4481–4487 (2012)

    Google Scholar 

  9. Otake, M.: Coimagination method: Sharing imagination with images and time limit. In: Proceedings of the International Reminiscence and Life Review Conference, pp. 97–103 (2009)

    Google Scholar 

  10. Otake, M., Kato, M., Takagi, T., Asama, H.: Coimagination method: supporting interactive conversation for activation, of episodic memory, division of attention, planning function, and its evaluation via conversation interactivity measuring method. In: Proceedings of the 2009 International Symposium on Early Detection and Rehabilitation Technology of Dementia, pp. 167–170 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Otake, M., Nergui, M., Moon, Se., Takagi, K., Kamashima, T., Nakadai, K. (2013). Development of a Sound Source Localization System for Assisting Group Conversation. In: Lee, J., Lee, M.C., Liu, H., Ryu, JH. (eds) Intelligent Robotics and Applications. ICIRA 2013. Lecture Notes in Computer Science(), vol 8102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40852-6_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40852-6_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40851-9

  • Online ISBN: 978-3-642-40852-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics