Abstract
We present a neural architecture for gesture-based interaction between a mobile robot and human users. One crucial problem for natural interface techniques is the robustness under highly varying environmental conditions. Therefore, we propose a multiple cue approach for the localisation of a potential user in the operation field, followed by the aquisition and interpretaion of its gestural instructions. The whole approach is motivated in the context of a reliable operation scenario, but can be extended easily for other applications, such as videoconferencing.
Supported by the Thuringian Ministry of Science, Research and Culture (TMWFK, GESTIK-Project)
Preview
Unable to display preview. Download preview PDF.
References
Amari, S. (1977). Dynamics of Pattern Formation in Lateral-Inhibition Type Neural Fields. In: Biological Cybernetics No.27, pp. 77–87
Bobick, A.F. (1996). Computers Seeing Action. In: Proc. of British Machine Vision Conference ’97, MIT Media Lab. Perc. Comp. Sec. TR No. 394
Boehme, H.-J., Brakensiek, A., Braumann, U.-D., Krabbes, M., Gross, H.-M. (1997). Visually-Based Human-Machine-Interaction in a Neural Architecture. In: Proc. SOAVE’97-Workshop, Ilmenau, Germany, VDI Verlag, pp. 166–175
Boehme, H.-J., Braumann, U.-D., Brakensiek, A., Gross, H.-M., Krabbes, M. (1998). Neural Approach to Video-Based User Localisation for Human-Machine-Interaction. submitted to: IJCNN’98
Brakensiek, A., Braumann, U.-D., Boehme, H.-J., Rieck, C., Gross, H.-M. (1997). Farb-und strukturbasierte, neuronale Verfahren zur Lokalisation von Gesichtern in Real-World-Szenen. Mustererkennung 1997, Braunschweig, pp. 113–120
Braumann, U.-D., Brakensiek, A., Boehme, H.-J., Gross, H.-M. (1997). Ein neuronales Verfahren zur Segmentierung hautfarbener Bildregionen. 3. Workshop Farbbildverarbeitung, Erlangen, Germany, IRB Verlag Stuttgart, pp. 67–72
Braumann, U.-D, Brakensiek, A., Boehme, H.-J., Gross, H.-M. (1998). Gaborfilterbasierte visuelle Personenlokalisation mit dreidimensionalen Feldern dynamischer Neuronen. To appear in: Tagungsband des 3. Internationalen Workshops “Neuronale Netze in der Anwendung” (NN’98, Magdeburg)
Darrell, T., Basu, S., Wren, C., Pentland, A. (1997). Perceptually-driven Avatars and Interfaces: active methods for direct control. In: MIT Media Lab Perc. Comp. Sec. TR No. 416
Davis, J.W., Bobick, A.F. (1996). The Representation and Recognition of Action Using Temporal Templates. In: MIT Media Lab. Perc. Comp. Sec. TR No. 402
Fritzke, B. (1995). A Growing Neural Gas Network Learns Topologies. In: Advances in Neural Information Processing Systems 7, pp. 625–632
Goodridge, S.G. (1997.) Multimedia Sensor Fusion for Intelligent Camera Control and Human-Computer-Interaction. PhD Thesis, North Carolina State University
Gross, H.-M., Franke, R., Boehme, H.-J., Beck, C. (1992). A Neural Network Hierarchy for Data and Knowledge Controlled Selective Visual Attention. In: Proc. of ICANN’92, Brighton, pp. 825–828
Gross, H.-M., Boehme, H.-J. (1997). Verhaltensorganisation in interaktiven und lernfaehigen mobilen Systemen —das MILVA-Projekt. Proc. SOAVE’97-Workshop, Ilmenau, Germany, VDI Verlag
Hunke M.H. (1994). Locating and Tracking of Human Faces with Neural Networks. CMU-CS-94-155
Kopecz, K. (1996). Neural field dynamics provide robust control for attentional resources. In: B. Mertsching (ed.), Aktives Sehen in technischen und biologischen Systemen, Infix-Verlag, pp. 137–144
Kortenkamp, D., Huber, E., Bonasso, P.R. (1996). Recognizing and interpreting gestures on a mobile robot. In: Proc. of the Thirteenth Nat. Conf. on Art. Intell. (AAI-96)
Krabbes, M., H.-J. Boehme, V. Stephan, H.-M. Gross (1997). Extension of the ALVINN-Architecture for Robust Visual Guidance of a Miniature Robot. Proc. EUROBOT’97, to appear
Kubisch, R., Ritter, H. (1996). Erkennung menschlicher Kopfhaltungen mittels kuenstlicher neuronaler Netze. In: Mustererkennung 1996, pp. 109–117
Littmann, E., Ritter, H. (1995). Neural and Statistical Methods for Adaptive Color Segmentation —A Comparison. In: Mustererkennung 1995, pp. 84–93
Maggioni, C., Kaemmerer, B. (1996). GestureComputer —History, Design, and Applications. In: Proceedings of the Workshop on Computer Vision in Man-Machine Interfaces, R. Cippola (ed.), Cambridge University Press
Moghaddam, B., Pentland, A. (1995). Maximum Likelihood Detection of Faces and Hands. In: Proc. Int. Worksh. on Aut. Face-and Gesture-Recog., pp. 122–128
Pomerleau, D.A. (1993). Neural Network Perception for Mobile Robot Guidance. Kluwer Academic Publishers
Pomierski, T., Gross, H.-M. (1996). Biological Neural Architecture for Chromatic Adaptation Resulting in Constant Color Sensations. In: Proc. of the Int. Conf. on Neural Networks (ICNN’96), pp. 734–739
Rowley, H. A., Baluja, S. Kanade, T. (1995). Human Face Detection in Visual Scenes. In: Technical Report CMU-CS-95-158R, Pittsburgh
Schiele, B., Waibel, A. (1995). Gaze Tracking Based on Face-Color. In: Proc. Int. Worksh. on Aut. Face-and Gesture-Recog., pp. 344–349
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag
About this paper
Cite this paper
Boehme, HJ., Brakensiek, A., Braumann, UD., Krabbes, M., Gross, HM. (1998). Neural architecture for gesture-based human-machine-interaction. In: Wachsmuth, I., Fröhlich, M. (eds) Gesture and Sign Language in Human-Computer Interaction. GW 1997. Lecture Notes in Computer Science, vol 1371. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0053002
Download citation
DOI: https://doi.org/10.1007/BFb0053002
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64424-8
Online ISBN: 978-3-540-69782-4
eBook Packages: Springer Book Archive