Skip to main content

3D Human Action Recognition Using Spatio-temporal Motion Templates

  • Conference paper
Computer Vision in Human-Computer Interaction (HCI 2005)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3766))

Included in the following conference series:

Abstract

Our goal is automatic recognition of basic human actions, such as stand, sit and wave hands, to aid in natural communication between a human and a computer. Human actions are inferred from human body joint motions, but such data has high dimensionality and large spatial and temporal variations may occur in executing the same action. We present a learning-based approach for the representation and recognition of 3D human action. Each action is represented by a template consisting of a set of channels with weights. Each channel corresponds to the evolution of one 3D joint coordinate and its weight is learned according to the Neyman-Pearson criterion. We use the learned templates to recognize actions based on χ 2 error measurement. Results of recognizing 22 actions on a large set of motion capture sequences as well as several annotated and automatically tracked sequences show the effectiveness of the proposed algorithm.

This research was supported, in part, by the Advanced Research and Development Activity of the U.S. Government under contract No. MDA904-03-C1786

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, A., Triggs, B.: 3D Human Pose from Silhouettes by Relevance Vector Regression. In: Proc. of CVPR, pp. 882–888 (2004)

    Google Scholar 

  2. Campbell, L., Bobick, A.: Recognition of human body motion using phase space constraints. In: Proc. of ICCV, pp. 624–630 (1995)

    Google Scholar 

  3. Davis, J., Bobick, A.: The Representation and Recognition of Action Using Temporal Templates. In: Proc. of CVPR, pp. 928–934 (1997)

    Google Scholar 

  4. Derpanis, K., Wildes, R., Tsotsos, J.: Hand Gesture Recognition within a Linguistics-Based Framework. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 282–296. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  5. Gong, S., Walter, M., Psarrou, A.: Recognition of temporal structures: Learning prior and propagating observation augmented densities via hidden Markov states. In: Proc. of ICCV, pp. 157–162 (1999)

    Google Scholar 

  6. Johansson, G.: Visual perception of biological motion and a model for its analysis. Perception and Psychophysics 14(2), 201–211 (1973)

    Article  Google Scholar 

  7. Lee, M.W., Nevatia, R.: Dynamic Human Pose Estimation using Markov chain Monte Carlo Approach. In: Proc. of the IEEE Workshop on Motion and Video Computing, WACV/MOTION 2005 (2005)

    Google Scholar 

  8. Oikonomopoulos, A., Patras, I., Pantic, M.: Spatiotemporal saliency for human action recognition. In Proc. of IEEE Int’l Conf. on Multimedia and Expo (ICME 2005) 2005

    Google Scholar 

  9. Parameswaran, V., Chellappa, R.: View invariants for human action recognition. In: Proc. of CVPR, pp. 613–619 (2003)

    Google Scholar 

  10. Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. of the IEEE 77(2), 257–286 (1989)

    Article  Google Scholar 

  11. Rao, C., Yilmaz, A., Shah, M.: View-Invariant Representation and Recognition of Actions. Int’l Journal of Computer Vision 50(2), 203–226 (2002)

    Article  MATH  Google Scholar 

  12. Shechtman, E., Irani, M.: Space-Time Behavior Based Correlation. In: Proc. of CVPR, pp. 405–412 (2005)

    Google Scholar 

  13. Shokoufandeh, A., Dickinson, S.J., Jonsson, C., Bretzner, L., Lindeberg, T.: On the representation and matching of qualitative shape at multiple scales. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2352, pp. 759–775. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  14. Trees, V., Detection, H.L.: Estimation and Modulation Theory, Part I, 6th edn. John Wiley and Sons, New York (1968) ISBN 0-47109517-6

    MATH  Google Scholar 

  15. Zhang, Z., Wu, Y., Shan, Y., Shafer, S.: Visual panel: Virtual mouse keyboard and 3d controller with an ordinary piece of paper. In: Workshop on Perceptive User Interfaces, ACM Digital Library, New York (November 2001) ISBN 1-58113448-7

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lv, F., Nevatia, R., Lee, M.W. (2005). 3D Human Action Recognition Using Spatio-temporal Motion Templates. In: Sebe, N., Lew, M., Huang, T.S. (eds) Computer Vision in Human-Computer Interaction. HCI 2005. Lecture Notes in Computer Science, vol 3766. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573425_12

Download citation

  • DOI: https://doi.org/10.1007/11573425_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29620-1

  • Online ISBN: 978-3-540-32129-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics