Abstract
In this study, we designed and developed an interactive interface to optimize sound source localization with the multi-channel robot audition software, HARK. With the developed interface, the system can lighten the loads of optimizing parameters and supports users easily to handle the parameter optimization in sound source localization. In order to properly handle the multi-channel sounds, it is better dynamically to indicate the parameter from both temporal and spatial perspectives, though almost all of the software can only indicate a static threshold. We developed an interactive interface, with which the user can create or delete the sound source on the MUSIC spectrum and can set up an appropriate parameter settings for the environment. We also conducted an evaluation of the software and revealed that our proposed interface was superior than that of the current HARK interface from the view points of intuitiveness and visibility.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Asono, F., et al.: Sound source localization and signal separation for office robot “Jijo-2". In: Proceedings of 1999 IEEE/SICE/RSJ International Conference on Multisensor Fusion and Integration for Intelligent Systems. MFI 1999, pp. 243–248. IEEE (1999)
Nakadai, Kazuhiro, et al.: Active audition for humanoid. AAAI/IAAI (2000)
Hara, I., et al.: Robust speech interface based on audio and video information fusion for humanoid HRP-2. In: Proceedings of 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2004), vol. 3. IEEE (2004)
Nakadai, K., et al.: An open source software system for robot audition HARK and its evaluation. In: 2008 8th IEEE-RAS International Conference on Humanoid Robots. Humanoids 2008. IEEE (2008)
Takahashi, T., et al.: Improvement in listening capability for humanoid robot HRP-2. In: 2010 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2010)
Nakamura, K., et al.: Intelligent sound source localization for dynamic environments. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2009. IEEE (2009)
Nakamura, K., et al.: Real-time super-resolution sound source localization for robots. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE (2012)
Mizumoto, T., et al.: Design and implementation of selectable sound separation on the Texai telepresence system using hark. In: 2011 IEEE International Conference on Robotics and Automation (ICRA). IEEE (2011)
Ct, Carle, et al.: Code reusability tools for programming mobile robots. In: 2004 Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, (IROS 2004), vol. 2. IEEE (2004)
Quinn, B.G., et al.: The estimation and tracking of frequency (Vol. 9). Cambridge University Press
Salomon, D.: Curves and surfaces for computer graphics. Springer, New york (2007)
Catmull, E., Clark, J.: Recursively generated B-spline surfaces on arbitrary topological meshes. Computer-aided design 10(6), 350–355 (1978)
Acknowledgments
This work was supported by KAKENHI-No. 24220006.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Sugiyama, O., Kojima, R., Nakadai, K. (2015). Interactive Interface to Optimize Sound Source Localization with HARK. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-19066-2_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19065-5
Online ISBN: 978-3-319-19066-2
eBook Packages: Computer ScienceComputer Science (R0)