Skip to main content
Log in

Recognition of human actions using texture descriptors

  • Special Issue Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

Human motion can be seen as a type of texture pattern. In this paper, we adopt the ideas of spatiotemporal analysis and the use of local features for motion description. Two methods are proposed. The first one uses temporal templates to capture movement dynamics and then uses texture features to characterize the observed movements. We then extend this idea into a spatiotemporal space and describe human movements with dynamic texture features. Following recent trends in computer vision, the method is designed to work with image data rather than silhouettes. The proposed methods are computationally simple and suitable for various applications. We verify the performance of our methods on the popular Weizmann and KTH datasets, achieving high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Blank, M., Gorelick, L., Shechtman, E., Irani, M. Basri, R.: Actions as space-time shapes. In: Proceedings of the ICCV, pp. 1395–1402 (2005)

  2. Bobick A., Davis J.: The recognition of human movement using temporal templates. PAMI 23(3), 257–267 (2001)

    Article  Google Scholar 

  3. Boiman, O., Irani, M.: Similarity by composition. In: Proceedings of the Neural Information Processing Systems (NIPS) (2006)

  4. Dollár, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS Workshop (2005)

  5. Gavrila D.M.: The visual analysis of human movement: a survey. CVIU 73(3), 82–98 (1999)

    MATH  Google Scholar 

  6. Heikkilä M., Pietikäinen M.: A texture-based method for modeling the background and detecting moving objects. PAMI 28(4), 657–662 (2006)

    Article  Google Scholar 

  7. Ikizler, N., Duygulu, P.: Human action recognition using distribution of oriented rectangular patches. In: ICCV Workshop on Human Motion Understanding, Modeling, Capture and Animation (2007)

  8. Ke, Y., Sukthankar, R. Hebert, M.: Efficient visual event detection using volumetric features. In: Proceedings of the ICCV, pp. 165–173 (2005)

  9. Ke, Y., Sukthankar, R., Hebert, M.: Spatio-temporal shape and flow correlation for action recognition. In: Proceedings of the CVPR, 8 pp (2007)

  10. Kellokumpu, V., Zhao, G., Pietikäinen, M.: Human activity recognition using a dynamic texture based method. In: Proceedings of the BMVC, 10 pp (2008)

  11. Kellokumpu, V., Pietikäinen, M., Heikkilä, J.: Human activity recognition using sequences of postures. In: Proceedings of the MVA, pp. 570–573 (2005)

  12. Kellokumpu, V., Zhao, G., Pietikäinen, M.: texture based description of movements for activity analysis. In: Proceedings of the VISAPP (2008)

  13. Kim K., Chalidabhongse T.H., Harwood D., Davis L.: Background modeling and subtraction by codebook construction. Proc. ICIP 5, 3061–3064 (2004)

    Google Scholar 

  14. Kim, T., Wong, S. Cipolla, R.: Tensor canonical correlation analysis for action classification. In: Proceedings of the CVPR, 8 pp (2007)

  15. Kobyashi T., Otsu N.: Action and simultaneous multiple-person identification using cubic higher-order auto-correlation. Proc. ICPR 4, 741–744 (2004)

    Google Scholar 

  16. Laptev I., Lindeberg T.: Space-time interest points. Proc. ICCV 1, 432–439 (2003)

    Google Scholar 

  17. Moeslund T.B., Hilton A., Krüger V.: A survey of advances in vision-based human motion capture and analysis. CVIU 104(2–3), 90–126 (2006)

    Google Scholar 

  18. Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: Proceedings of the CVPR, 8 pp (2007)

  19. Niebles J., Wang H., Fei-Fei L.: Unsupervised learning of human action categories using spatial-temporal words. IJCV 79(3), 299–318 (2008)

    Article  Google Scholar 

  20. Niyogi, S.A., Adelson, E.H.: Analysing and recognizing walking figs in XYT. In: Proceedings of the CVPR, pp. 469–474 (1994)

  21. Ojala T., Pietikäinen M., Mäenpää T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. PAMI 24(7), 971–987 (2002)

    Article  Google Scholar 

  22. Rabiner L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–285 (1989)

    Article  Google Scholar 

  23. Schindler, K., van Gool, L.: Action snippets: how many frames does human action recognition require? In: Proceedings of the CVPR, 8 pp (2008)

  24. Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the ICPR, pp. 32–36 (2004)

  25. Scovanner, P., Ali, S., Shah, M.: A 3-dimensional SIFT descriptor and its application to action recognition. In: Proceedings of the ACM Multimedia, pp. 357–360 (2007)

  26. Shechtman E., Irani M.: Space-time behavior based correlation. Proc. CVPR 1, 405–412 (2005)

    Google Scholar 

  27. Stauffer C., Grimson W.E.L.: Adaptive background mixture models for real-time tracking. Proc. CVPR 2, 246–252 (1999)

    Google Scholar 

  28. Wang, L., Suter, D.: Recognizing human activities from silhouettes: motion subspace and factorial discriminative graphical model. In: Proceedings of the CVPR, 8 pp (2007)

  29. Wong, S., Kim, T., Cipolla, R.: Learning motion categories using both semantic and structural information. In: Proceedings of the CVPR, 6 pp (2007)

  30. Weinland D., Ronfard R., Boyer E.: Free viewpoint action recognition using motion history volumes. CVIU 104(2-3), 249–257 (2006)

    Google Scholar 

  31. Yilmaz A., Shah M.: Action sketch: a novel action representation. Proc. CVPR 1, 984–989 (2005)

    Google Scholar 

  32. Zhao G., Pietikäinen M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. PAMI 29(6), 915–928 (2007)

    Article  Google Scholar 

  33. Zhao G., Barnard M., Pietikäinen M.: Lipreading with local spatiotemporal descriptors. IEEE Trans. Multimed. 11(7), 1254–1265 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vili Kellokumpu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kellokumpu, V., Zhao, G. & Pietikäinen, M. Recognition of human actions using texture descriptors. Machine Vision and Applications 22, 767–780 (2011). https://doi.org/10.1007/s00138-009-0233-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-009-0233-8

Keywords

Navigation