Abstract
This paper describes the development of a real-time perceptive user interface. Two cameras are used to detect a user’s head, eyes, hand, fingers and gestures. These cues are interpreted to control a user interface on a large screen. The result is a fully functional integrated system that processes roughly 7.5 frames per second on a Pentium IV system. The calibration of this setup is carried out through a few simple and intuitive routines, making the system adaptive and accessible to non-expert users. The minimal hardware requirements are two web-cams and a computer. The paper will describe how the user is observed (head, eye, hand and finger detection, gesture recognition), the 3D geometry involved, and the calibration steps necessary to set up the system.
This work was supported by GOA/2004/05, Research Fund K.U.Leuven and the ETH project Blue C II
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wren, C.R., Azerbayejani, A., Darell, T., Pentland, A.P.: Pfinder: Real-time tracking of the human body. IEEE Transactions on Pattern Analysis and Machine Intelligence 19, 780–785 (1997)
Plänkers, R., Fua, P.: Articulated soft objects for video-based body modeling. In: Proceedings 8th International Conference on Computer Vision (2001)
Jones, M.J., Rehg, J.M.: Statistical color models with application to skin detection. Cambridge Research Laboratory Technical Report Series CRL98/11 (1998)
Jedynak, B., Zheng, H., Daoudi, M., Barret, D.: Maximum entropy models for skin detection. Technical Report publication IRMA 57 (2002)
Mester, R., Aach, T., Dümbgen, L.: Illumination-invariant change detection using a statistical colinearity criterion. In: Pattern Recognition: Proceedings 23rd DAGM Symposium, pp. 170–177 (2001)
Hung, Y.P., Yang, Y.S., Chen, Y.S., Hsieh, I.B., Fuh, C.S.: Free-hand pointer by use of an active stereo vision system. In: Proceedings of Third Asian Conference on Computer Vision 1, pp. 632–639 (1998)
Sánchez-Nielsen, E., Antón-Canaís, L., Hernández-Tejera, M.: Hand gesture recognition for human-machine interaction. Journal of WSCG 12 (2004)
Gool, L.V.: Beeldinterpretatie en Computer Vision II. Visics, KULeuven (2002-2003)
Koninckx, T., Griesser, A., Van Gool, L.: Real-time range scanning of deformable surfaces by adaptively coded structured light. In: Kawada, S. (ed.) Fourth International Conference on 3-D Digital Imaging and Modeling (3DIM 2003), pp. 293–302 (2003)
Koninckx, T., Van Gool, L.: High-speed active 3d acquisition based on a patternspecific mesh. In: SPIE’s 15th annual symposium on electronic imaging - videometrics VII, vol. 5013, pp. 26–37 (2003)
Fischler, M., Bolles, R.: Random sampling consensus: a paradigm for model fitting with application to image analysis and automated cartography. Commun. Assoc. Comp. Mach. 24, 381–395 (1981)
Hartley, R.I., Zisserman, A.: Multiple view geometry in computer vision, 239–240 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Van den Bergh, M., Servaes, W., Caenen, G., De Roeck, S., Van Gool, L. (2005). Perceptive User Interface, a Generic Approach. In: Sebe, N., Lew, M., Huang, T.S. (eds) Computer Vision in Human-Computer Interaction. HCI 2005. Lecture Notes in Computer Science, vol 3766. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573425_6
Download citation
DOI: https://doi.org/10.1007/11573425_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29620-1
Online ISBN: 978-3-540-32129-3
eBook Packages: Computer ScienceComputer Science (R0)