Abstract
Our goal is automatic recognition of basic human actions, such as stand, sit and wave hands, to aid in natural communication between a human and a computer. Human actions are inferred from human body joint motions, but such data has high dimensionality and large spatial and temporal variations may occur in executing the same action. We present a learning-based approach for the representation and recognition of 3D human action. Each action is represented by a template consisting of a set of channels with weights. Each channel corresponds to the evolution of one 3D joint coordinate and its weight is learned according to the Neyman-Pearson criterion. We use the learned templates to recognize actions based on χ 2 error measurement. Results of recognizing 22 actions on a large set of motion capture sequences as well as several annotated and automatically tracked sequences show the effectiveness of the proposed algorithm.
This research was supported, in part, by the Advanced Research and Development Activity of the U.S. Government under contract No. MDA904-03-C1786
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agarwal, A., Triggs, B.: 3D Human Pose from Silhouettes by Relevance Vector Regression. In: Proc. of CVPR, pp. 882–888 (2004)
Campbell, L., Bobick, A.: Recognition of human body motion using phase space constraints. In: Proc. of ICCV, pp. 624–630 (1995)
Davis, J., Bobick, A.: The Representation and Recognition of Action Using Temporal Templates. In: Proc. of CVPR, pp. 928–934 (1997)
Derpanis, K., Wildes, R., Tsotsos, J.: Hand Gesture Recognition within a Linguistics-Based Framework. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 282–296. Springer, Heidelberg (2004)
Gong, S., Walter, M., Psarrou, A.: Recognition of temporal structures: Learning prior and propagating observation augmented densities via hidden Markov states. In: Proc. of ICCV, pp. 157–162 (1999)
Johansson, G.: Visual perception of biological motion and a model for its analysis. Perception and Psychophysics 14(2), 201–211 (1973)
Lee, M.W., Nevatia, R.: Dynamic Human Pose Estimation using Markov chain Monte Carlo Approach. In: Proc. of the IEEE Workshop on Motion and Video Computing, WACV/MOTION 2005 (2005)
Oikonomopoulos, A., Patras, I., Pantic, M.: Spatiotemporal saliency for human action recognition. In Proc. of IEEE Int’l Conf. on Multimedia and Expo (ICME 2005) 2005
Parameswaran, V., Chellappa, R.: View invariants for human action recognition. In: Proc. of CVPR, pp. 613–619 (2003)
Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. of the IEEE 77(2), 257–286 (1989)
Rao, C., Yilmaz, A., Shah, M.: View-Invariant Representation and Recognition of Actions. Int’l Journal of Computer Vision 50(2), 203–226 (2002)
Shechtman, E., Irani, M.: Space-Time Behavior Based Correlation. In: Proc. of CVPR, pp. 405–412 (2005)
Shokoufandeh, A., Dickinson, S.J., Jonsson, C., Bretzner, L., Lindeberg, T.: On the representation and matching of qualitative shape at multiple scales. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 759–775. Springer, Heidelberg (2002)
Trees, V., Detection, H.L.: Estimation and Modulation Theory, Part I, 6th edn. John Wiley and Sons, New York (1968) ISBN 0-47109517-6
Zhang, Z., Wu, Y., Shan, Y., Shafer, S.: Visual panel: Virtual mouse keyboard and 3d controller with an ordinary piece of paper. In: Workshop on Perceptive User Interfaces, ACM Digital Library, New York (November 2001) ISBN 1-58113448-7
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lv, F., Nevatia, R., Lee, M.W. (2005). 3D Human Action Recognition Using Spatio-temporal Motion Templates. In: Sebe, N., Lew, M., Huang, T.S. (eds) Computer Vision in Human-Computer Interaction. HCI 2005. Lecture Notes in Computer Science, vol 3766. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573425_12
Download citation
DOI: https://doi.org/10.1007/11573425_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29620-1
Online ISBN: 978-3-540-32129-3
eBook Packages: Computer ScienceComputer Science (R0)