Abstract
Human motion can be seen as a type of texture pattern. In this paper, we adopt the ideas of spatiotemporal analysis and the use of local features for motion description. Two methods are proposed. The first one uses temporal templates to capture movement dynamics and then uses texture features to characterize the observed movements. We then extend this idea into a spatiotemporal space and describe human movements with dynamic texture features. Following recent trends in computer vision, the method is designed to work with image data rather than silhouettes. The proposed methods are computationally simple and suitable for various applications. We verify the performance of our methods on the popular Weizmann and KTH datasets, achieving high accuracy.
Similar content being viewed by others
References
Blank, M., Gorelick, L., Shechtman, E., Irani, M. Basri, R.: Actions as space-time shapes. In: Proceedings of the ICCV, pp. 1395–1402 (2005)
Bobick A., Davis J.: The recognition of human movement using temporal templates. PAMI 23(3), 257–267 (2001)
Boiman, O., Irani, M.: Similarity by composition. In: Proceedings of the Neural Information Processing Systems (NIPS) (2006)
Dollár, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS Workshop (2005)
Gavrila D.M.: The visual analysis of human movement: a survey. CVIU 73(3), 82–98 (1999)
Heikkilä M., Pietikäinen M.: A texture-based method for modeling the background and detecting moving objects. PAMI 28(4), 657–662 (2006)
Ikizler, N., Duygulu, P.: Human action recognition using distribution of oriented rectangular patches. In: ICCV Workshop on Human Motion Understanding, Modeling, Capture and Animation (2007)
Ke, Y., Sukthankar, R. Hebert, M.: Efficient visual event detection using volumetric features. In: Proceedings of the ICCV, pp. 165–173 (2005)
Ke, Y., Sukthankar, R., Hebert, M.: Spatio-temporal shape and flow correlation for action recognition. In: Proceedings of the CVPR, 8 pp (2007)
Kellokumpu, V., Zhao, G., Pietikäinen, M.: Human activity recognition using a dynamic texture based method. In: Proceedings of the BMVC, 10 pp (2008)
Kellokumpu, V., Pietikäinen, M., Heikkilä, J.: Human activity recognition using sequences of postures. In: Proceedings of the MVA, pp. 570–573 (2005)
Kellokumpu, V., Zhao, G., Pietikäinen, M.: texture based description of movements for activity analysis. In: Proceedings of the VISAPP (2008)
Kim K., Chalidabhongse T.H., Harwood D., Davis L.: Background modeling and subtraction by codebook construction. Proc. ICIP 5, 3061–3064 (2004)
Kim, T., Wong, S. Cipolla, R.: Tensor canonical correlation analysis for action classification. In: Proceedings of the CVPR, 8 pp (2007)
Kobyashi T., Otsu N.: Action and simultaneous multiple-person identification using cubic higher-order auto-correlation. Proc. ICPR 4, 741–744 (2004)
Laptev I., Lindeberg T.: Space-time interest points. Proc. ICCV 1, 432–439 (2003)
Moeslund T.B., Hilton A., Krüger V.: A survey of advances in vision-based human motion capture and analysis. CVIU 104(2–3), 90–126 (2006)
Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: Proceedings of the CVPR, 8 pp (2007)
Niebles J., Wang H., Fei-Fei L.: Unsupervised learning of human action categories using spatial-temporal words. IJCV 79(3), 299–318 (2008)
Niyogi, S.A., Adelson, E.H.: Analysing and recognizing walking figs in XYT. In: Proceedings of the CVPR, pp. 469–474 (1994)
Ojala T., Pietikäinen M., Mäenpää T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. PAMI 24(7), 971–987 (2002)
Rabiner L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77(2), 257–285 (1989)
Schindler, K., van Gool, L.: Action snippets: how many frames does human action recognition require? In: Proceedings of the CVPR, 8 pp (2008)
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the ICPR, pp. 32–36 (2004)
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional SIFT descriptor and its application to action recognition. In: Proceedings of the ACM Multimedia, pp. 357–360 (2007)
Shechtman E., Irani M.: Space-time behavior based correlation. Proc. CVPR 1, 405–412 (2005)
Stauffer C., Grimson W.E.L.: Adaptive background mixture models for real-time tracking. Proc. CVPR 2, 246–252 (1999)
Wang, L., Suter, D.: Recognizing human activities from silhouettes: motion subspace and factorial discriminative graphical model. In: Proceedings of the CVPR, 8 pp (2007)
Wong, S., Kim, T., Cipolla, R.: Learning motion categories using both semantic and structural information. In: Proceedings of the CVPR, 6 pp (2007)
Weinland D., Ronfard R., Boyer E.: Free viewpoint action recognition using motion history volumes. CVIU 104(2-3), 249–257 (2006)
Yilmaz A., Shah M.: Action sketch: a novel action representation. Proc. CVPR 1, 984–989 (2005)
Zhao G., Pietikäinen M.: Dynamic texture recognition using local binary patterns with an application to facial expressions. PAMI 29(6), 915–928 (2007)
Zhao G., Barnard M., Pietikäinen M.: Lipreading with local spatiotemporal descriptors. IEEE Trans. Multimed. 11(7), 1254–1265 (2009)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kellokumpu, V., Zhao, G. & Pietikäinen, M. Recognition of human actions using texture descriptors. Machine Vision and Applications 22, 767–780 (2011). https://doi.org/10.1007/s00138-009-0233-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-009-0233-8