Abstract
In this study, we developed a sound source localization system, which consists of Jellyfish-02 and HARK robot audition software, in order to reduce the number of wires for evaluating speech duration. Sound source localization performance of Jellyfish-02 is evaluated by precision, recall, and F-measure. Performance of Jellyfish-02 is superior to conventional microphone arrays. During the experiment, we found that F-measure becomes smaller as the number of speakers increases. We investigated the percentage of speech overlapped periods in natural conversation for the purpose of examining the applicability of the system to measure speech duration in group conversation. From the results, Jelyyfish-02 surpasses conventional microphone array in design and usability. It is potentially applicable for assisting group conversion by measuring duration of speech for each participant.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fay, R.R., Popper, A.N.: Evolution of hearing in vertebrates: The inner ears and processing. Hear. Res. 149, 1–10 (2000)
Zhao, S., Ahmed, S., Liang, Y., Rupnow, K., Chen, D., Jones, D.L.: A Real-Time 3D Sound Localization System with Miniature Microphone Array for Virtual Reality. In: 2012 7th IEEE Conference on Industrial Electronics and Applications (ICIEA), pp. 1853–1857 (2012)
Nakamura, K., Nakadai, K., Ince, G.: Real-time super-resolution Sound Source Localization for robots. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 694–699 (2012)
Nakashima, H., Mukai, T.: 3D Sound Source Localization System Based on Learning of Binaural Hearing. In: 2005 IEEE International Conference on Man and Cybernetics, Systems, pp. 3534–3539 (2005)
Cho, Y., Yook, D., Chang, S., Kim, H.: Sound Source Localization for Robot Auditory Systems. IEEE Transactions on Consumer Electronics 55(3), 1663–16692 (2009)
Nakadai, K., Takahashi, T., Okuno, H.G., Nakajima, H., Hasegawa, Y., Tsujino, H.: Design and Implementation of Robot Audition System ’HARK’- Open Source Software for Listening to Three Simultaneous Speakers. Advanced Robotics 24(5-6), 739–761 (2010)
Yamamoto, S., Nakadai, K., Nakano, M., Tsujino, H., Valin, J.-M., Komatani, K., Ogata, T., Okuno, H.G.: Design and implementation of a robot audition system for automatic Speech recognition of simultaneous speech. In: Proc. of ASRU, pp. 11–116 (2007)
Yamaguchi, T., Ota, J., Otake, M.: A system that assists group conversation of older adults by evaluating speech duration and facial expression of each participant during conversation. In: Proc. of 2012 IEEE International Conference on Robotics and Automation, pp. 4481–4487 (2012)
Otake, M.: Coimagination method: Sharing imagination with images and time limit. In: Proceedings of the International Reminiscence and Life Review Conference, pp. 97–103 (2009)
Otake, M., Kato, M., Takagi, T., Asama, H.: Coimagination method: supporting interactive conversation for activation, of episodic memory, division of attention, planning function, and its evaluation via conversation interactivity measuring method. In: Proceedings of the 2009 International Symposium on Early Detection and Rehabilitation Technology of Dementia, pp. 167–170 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Otake, M., Nergui, M., Moon, Se., Takagi, K., Kamashima, T., Nakadai, K. (2013). Development of a Sound Source Localization System for Assisting Group Conversation. In: Lee, J., Lee, M.C., Liu, H., Ryu, JH. (eds) Intelligent Robotics and Applications. ICIRA 2013. Lecture Notes in Computer Science(), vol 8102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40852-6_54
Download citation
DOI: https://doi.org/10.1007/978-3-642-40852-6_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40851-9
Online ISBN: 978-3-642-40852-6
eBook Packages: Computer ScienceComputer Science (R0)