Abstract.
We propose a system that simultaneously utilizes the stereo disparity and optical flow information of real-time stereo grayscale multiresolution images for the recognition of objects and gestures in human interactions. For real-time calculation of the disparity and optical flow information of a stereo image, the system first creates pyramid images using a Gaussian filter. The system then determines the disparity and optical flow of a low-density image and extracts attention regions in a high-density image. The three foremost regions are recognized using higher-order local autocorrelation features and linear discriminant analysis. As the recognition method is view based, the system can process the face and hand recognitions simultaneously in real time. The recognition features are independent of parallel translations, so the system can use unstable extractions from stereo depth information. We demonstrate that the system can discriminate the users, monitor the basic movements of the user, smoothly learn an object presented by users, and can communicate with users by hand signs learned in advance.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Received: 31 January 2000 / Accepted: 1 May 2001
Correspondence to: I. Yoda (e-mail: yoda@ieee.org, Tel.: +81-298-615941, Fax: +81-298-613313)
Rights and permissions
About this article
Cite this article
Yoda, I., Sakaue, K. Utilization of stereo disparity and optical flow information for the computer analysis of human interactions. Machine Vision and Applications 13, 185–193 (2003). https://doi.org/10.1007/s00138-002-0062-5
Issue Date:
DOI: https://doi.org/10.1007/s00138-002-0062-5