Abstract
An effective human-robot interaction is essential for wide penetration of service robots into the market. Such robots need vision systems to recognize objects. It is, however, difficult to realize vision systems that can work in various conditions. More robust techniques of object recognition and image segmentation are essential. Thus, we have proposed to use the human user’s assistance for objects recognition through speech. Our previous system assumes that it can segment images without failure. However, if there are occluded objects and/or objects composed of multicolor parts, segmentation failures cannot be avoided. This paper presents an extended system that can recognize objects in occlusion and/or multicolor cases using geometric and photometric analysis of images. If the robot is not sure about the segmentation results, it asks questions of the user by appropriate expressions depending on the certainty to remove the ambiguity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ehrenmann, M., Zollner, R., Rogalla, O., Dillmann, R.: Programming service tasks in household environments by human demonstration. In: ROMAN 2002, pp. 460–467 (2002)
Hans, M., Graf, B., Schraft, R.D.: Robotics home assistant care-o-bot: past-present-future. In: ROMAN 2002, pp. 380–385 (2002)
Berry, G.A., Pavlovic, V., Huang, T.S.: BattleView: a multimodal HCI research application. In: Workshop on Perceptual User Interfaces, pp. 67–70 (1998)
Raisamo, R.: A multimodal user interface for public information kiosks. In: Workshop on Perceptual User Interfaces, pp. 7–12 (1998)
Takahashi, T., Nakanishi, S., Kuno, Y., Shirai, Y.: Human-robot interface by verbal and nonverbal communication. In: IROS 1998, pp. 924–929 (1998)
Yoshizaki, M., Kuno, Y., Nakamura, A.: Mutual assistance between speech and vision for human-robot interface. In: IROS 2002, pp. 1308–1313 (2002)
Kurnia, R., Hossain, M.A., Nakamura, A., Kuno, Y.: Object recognition through human-robot interaction by speech. In: ROMAN 2004, pp. 619–624 (2004)
Takizawa, M., Makihara, Y., Shimada, N., Miura, J., Shirai, Y.: A service robot with interactive vision- objects recognition using dialog with user. In: First International Workshop on Language Understanding and Agents for Real World Interaction, Hokkaido (2003)
Inamura, T., Inaba, M., Inoue, H.: Dialogue control for task achievement based on evaluation of situational vagueness and stochastic representation of experiences. In: International Conference on Intelligent Robots and Systems, Sendai, pp. 2861–2866 (2004)
Cremers, A.: Object reference in task-oriented keyboard dialogues, multimodal human-computer communication: system, techniques and experiments, pp. 279–293. Springer, Heidelberg (1998)
Winograd, T.: Understanding natural language. Academic Press, New York (1972)
Roy, D., Schiele, B., Pentland, A.: Learning audio-visual associations using mutual information. In: ICCV, Workshop on Integrating Speech and Image Understanding, Greece (1999)
Hossain, M.A., Kurnia, R., Nakamura, A., Kuno, Y.: Color objects segmentation for helper robot. In: ICECE 2004, pp. 206–209 (2004)
Nayar, S.K., Bolle, R.M.: Reflectance based object recognition. Inter. Journal of Computer Vision 17(3), 219–240 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hossain, M.A., Kurnia, R., Kuno, Y. (2005). Geometric and Photometric Analysis for Interactively Recognizing Multicolor or Partially Occluded Objects. In: Bebis, G., Boyle, R., Koracin, D., Parvin, B. (eds) Advances in Visual Computing. ISVC 2005. Lecture Notes in Computer Science, vol 3804. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11595755_17
Download citation
DOI: https://doi.org/10.1007/11595755_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30750-1
Online ISBN: 978-3-540-32284-9
eBook Packages: Computer ScienceComputer Science (R0)