Recognition of human actions using texture descriptors

Kellokumpu, Vili; Zhao, Guoying; Pietikäinen, Matti

doi:10.1007/s00138-009-0233-8

Recognition of human actions using texture descriptors

Special Issue Paper
Published: 15 December 2009

Volume 22, pages 767–780, (2011)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Vili Kellokumpu¹,
Guoying Zhao¹ &
Matti Pietikäinen¹

493 Accesses
68 Citations
Explore all metrics

Abstract

Human motion can be seen as a type of texture pattern. In this paper, we adopt the ideas of spatiotemporal analysis and the use of local features for motion description. Two methods are proposed. The first one uses temporal templates to capture movement dynamics and then uses texture features to characterize the observed movements. We then extend this idea into a spatiotemporal space and describe human movements with dynamic texture features. Following recent trends in computer vision, the method is designed to work with image data rather than silhouettes. The proposed methods are computationally simple and suitable for various applications. We verify the performance of our methods on the popular Weizmann and KTH datasets, achieving high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Blank, M., Gorelick, L., Shechtman, E., Irani, M. Basri, R.: Actions as space-time shapes. In: Proceedings of the ICCV, pp. 1395–1402 (2005)
Bobick A., Davis J.: The recognition of human movement using temporal templates. PAMI 23(3), 257–267 (2001)
Article Google Scholar
Boiman, O., Irani, M.: Similarity by composition. In: Proceedings of the Neural Information Processing Systems (NIPS) (2006)
Dollár, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS Workshop (2005)
Gavrila D.M.: The visual analysis of human movement: a survey. CVIU 73(3), 82–98 (1999)
MATH Google Scholar
Heikkilä M., Pietikäinen M.: A texture-based method for modeling the background and detecting moving objects. PAMI 28(4), 657–662 (2006)
Article Google Scholar
Ikizler, N., Duygulu, P.: Human action recognition using distribution of oriented rectangular patches. In: ICCV Workshop on Human Motion Understanding, Modeling, Capture and Animation (2007)
Ke, Y., Sukthankar, R. Hebert, M.: Efficient visual event detection using volumetric features. In: Proceedings of the ICCV, pp. 165–173 (2005)
Ke, Y., Sukthankar, R., Hebert, M.: Spatio-temporal shape and flow correlation for action recognition. In: Proceedings of the CVPR, 8 pp (2007)
Kellokumpu, V., Zhao, G., Pietikäinen, M.: Human activity recognition using a dynamic texture based method. In: Proceedings of the BMVC, 10 pp (2008)
Kellokumpu, V., Pietikäinen, M., Heikkilä, J.: Human activity recognition using sequences of postures. In: Proceedings of the MVA, pp. 570–573 (2005)
Kellokumpu, V., Zhao, G., Pietikäinen, M.: texture based description of movements for activity analysis. In: Proceedings of the VISAPP (2008)
Kim K., Chalidabhongse T.H., Harwood D., Davis L.: Background modeling and subtraction by codebook construction. Proc. ICIP 5, 3061–3064 (2004)
Google Scholar
Kim, T., Wong, S. Cipolla, R.: Tensor canonical correlation analysis for action classification. In: Proceedings of the CVPR, 8 pp (2007)
Kobyashi T., Otsu N.: Action and simultaneous multiple-person identification using cubic higher-order auto-correlation. Proc. ICPR 4, 741–744 (2004)
Google Scholar
Laptev I., Lindeberg T.: Space-time interest points. Proc. ICCV 1, 432–439 (2003)
Google Scholar
Moeslund T.B., Hilton A., Krüger V.: A survey of advances in vision-based human motion capture and analysis. CVIU 104(2–3), 90–126 (2006)
Google Scholar
Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: Proceedings of the CVPR, 8 pp (2007)
Niebles J., Wang H., Fei-Fei L.: Unsupervised learning of human action categories using spatial-temporal words. IJCV 79(3), 299–318 (2008)
Article Google Scholar
Niyogi, S.A., Adelson, E.H.: Analysing and recognizing walking figs in XYT. In: Proceedings of the CVPR, pp. 469–474 (1994)
Ojala T., Pietikäinen M., Mäenpää T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. PAMI 24(7), 971–987 (2002)
Article Google Scholar
Rabiner L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–285 (1989)
Article Google Scholar
Schindler, K., van Gool, L.: Action snippets: how many frames does human action recognition require? In: Proceedings of the CVPR, 8 pp (2008)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the ICPR, pp. 32–36 (2004)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional SIFT descriptor and its application to action recognition. In: Proceedings of the ACM Multimedia, pp. 357–360 (2007)
Shechtman E., Irani M.: Space-time behavior based correlation. Proc. CVPR 1, 405–412 (2005)
Google Scholar
Stauffer C., Grimson W.E.L.: Adaptive background mixture models for real-time tracking. Proc. CVPR 2, 246–252 (1999)
Google Scholar
Wang, L., Suter, D.: Recognizing human activities from silhouettes: motion subspace and factorial discriminative graphical model. In: Proceedings of the CVPR, 8 pp (2007)
Wong, S., Kim, T., Cipolla, R.: Learning motion categories using both semantic and structural information. In: Proceedings of the CVPR, 6 pp (2007)
Weinland D., Ronfard R., Boyer E.: Free viewpoint action recognition using motion history volumes. CVIU 104(2-3), 249–257 (2006)
Google Scholar
Yilmaz A., Shah M.: Action sketch: a novel action representation. Proc. CVPR 1, 984–989 (2005)
Google Scholar
Zhao G., Pietikäinen M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. PAMI 29(6), 915–928 (2007)
Article Google Scholar
Zhao G., Barnard M., Pietikäinen M.: Lipreading with local spatiotemporal descriptors. IEEE Trans. Multimed. 11(7), 1254–1265 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Machine Vision Group, University of Oulu, P.O. Box 4500, Oulu, Finland
Vili Kellokumpu, Guoying Zhao & Matti Pietikäinen

Authors

Vili Kellokumpu
View author publications
You can also search for this author in PubMed Google Scholar
Guoying Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Matti Pietikäinen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vili Kellokumpu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kellokumpu, V., Zhao, G. & Pietikäinen, M. Recognition of human actions using texture descriptors. Machine Vision and Applications 22, 767–780 (2011). https://doi.org/10.1007/s00138-009-0233-8

Download citation

Received: 01 August 2008
Revised: 09 July 2009
Accepted: 04 November 2009
Published: 15 December 2009
Issue Date: September 2011
DOI: https://doi.org/10.1007/s00138-009-0233-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Recognition of human actions using texture descriptors

Abstract

Access this article

Similar content being viewed by others

Directional Beams of Dense Trajectories for Dynamic Texture Recognition

A Comparison of Image Texture Descriptors for Pattern Classification

Sparse binarised statistical dynamic features for spatio-temporal texture analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Recognition of human actions using texture descriptors

Abstract

Access this article

Similar content being viewed by others

Directional Beams of Dense Trajectories for Dynamic Texture Recognition

A Comparison of Image Texture Descriptors for Pattern Classification

Sparse binarised statistical dynamic features for spatio-temporal texture analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation