Human Activity Recognition Using Hierarchically-Mined Feature Constellations

Oikonomopoulos, Antonios; Pantic, Maja

doi:10.1007/978-3-642-41914-0_16

Antonios Oikonomopoulos²⁸ &
Maja Pantic^28,29

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8033))

Included in the following conference series:

International Symposium on Visual Computing

2853 Accesses

Abstract

In this paper we address the problem of human activity modelling and recognition by means of a hierarchical representation of mined dense spatiotemporal features. At each level of the hierarchy, the proposed method selects feature constellations that are increasingly discriminative and characteristic of a specific action category, by taking into account how frequently they occur in that action category versus the rest of the available action categories in the training dataset. Each feature constellation consists of n-tuples of features selected in the previous level of the hierarchy and lying within a small spatiotemporal neighborhood. We use spatiotemporal Local Steering Kernel (LSK) features as a basis for our representation, due to their ability and efficiency in capturing the local structure and dynamics of the underlying activities. The proposed method is able to detect activities in unconstrained videos, by back-projecting the activated features at the locations at which they were activated. We test the proposed method on two publicly available datasets, namely the KTH and YouTube datasets of human bodily actions. The acquired results demonstrate the effectiveness of the proposed method in recognising a wide variety of activities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Qualitative and Quantitative Spatio-temporal Relations in Daily Living Activity Recognition

Frame-Level Covariance Descriptor for Action Recognition

Trajectory Based Integrated Features for Action Classification from Depth Data

References

Weinland, D., Ronfard, R., Boyer, E.: A survey of vision-based methods for action representation, segmentation and recognition. Comp. Vision, and Image Understanding 115, 224–241 (2011)
Article Google Scholar
Laptev, I., Lindeberg, T.: Space-time Interest Points. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 432–439 (2003)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS, pp. 65– 72 (2005)
Google Scholar
Bregonzio, M., Gong, S., Xiang, T.: Recognising action as clouds of space-time interest points. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2009)
Google Scholar
Bay, H., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. Comp. Vision, and Image Understanding 110, 346–359 (2008)
Article Google Scholar
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A Biologically Inspired System for Action Recognition. In: Proc. IEEE Int. Conf. Computer Vision, pp. 1–8 (2007)
Google Scholar
Schindler, K., Gool, L.V.: Action snippets: How many frames does human action require? In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Ali, S., Shah, M.: Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans. Pattern Analysis and Machine Intelligence (2010)
Google Scholar
Gilbert, A., Illingworth, J., Bowden, R.: Fast realistic multi-action recognition using mined dense spatio-temporal features. In: Proc. IEEE Int. Conf. Computer Vision (2009)
Google Scholar
Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Google Scholar
Seo, H., Milanfar, P.: Action recognition from one example. IEEE Trans. Pattern Analysis and Machine Intelligence 33, 867–882 (2011)
Article Google Scholar
Amer, M., Todorovic, S.: Sum-product networks for modeling activities with stochastic structure. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1314–1321 (2012)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Stanford University Technical Report (1993)
Google Scholar
Quack, T., Ferrari, V., Leibe, B., Gool, L.V.: Efficient mining of frequent and distinctive feature configurations. In: Proc. IEEE Int. Conf. Computer Vision (2007)
Google Scholar
Gilbert, A., Illingworth, J., Bowden, R.: Action recognition using mined hierarchical compound features. IEEE Trans. Pattern Analysis and Machine Intelligence 33, 883–897 (2011)
Article Google Scholar
Wang, L., Wang, Y., Jiang, T., Gao, W.: Instantly telling what happens in a video sequence using simple features. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3257–3264 (2011)
Google Scholar
Dong, G., Li, J.: Efficient mining of emerging patterns: Discovering trends and differences. In: ACM SIGKDD, pp. 43–52 (2004)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local svm approach. In: IEEE Conf. on Computer Vision and Pattern Recognition, vol. 3, pp. 32–36 (2004)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos ”in the wild”. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)
Google Scholar
Seo, H., Milanfar, P.: Training-free, generic object detection using locally adaptive regression kernels. IEEE Trans. Pattern Analysis and Machine Intelligence 32, 1688–1704 (2010)
Article Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2008)
Google Scholar
Wang, H., Klaeser, A., Schmid, C., Liu, C.: Action recognition by dense trajectories. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3169–3176 (2011)
Google Scholar
Le, Q., Zou, W., Yeung, S., Ng, A.: Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3361–3368 (2011)
Google Scholar
Ikizler-Cinbis, N., Sclaroff, S.: Object, scene and actions: Combining multiple features for human action recognition. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 494–507. Springer, Heidelberg (2010)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Comp. Dept., Imperial College London, UK
Antonios Oikonomopoulos & Maja Pantic
EEMCS, University of Twente, The Netherlands
Maja Pantic

Authors

Antonios Oikonomopoulos
View author publications
You can also search for this author in PubMed Google Scholar
Maja Pantic
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Nevada, Reno, NV, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
Arizona State University, Tempe, AZ, USA
Baoxin Li
Mitsubishi Electric Research Laboratories, Cambridge, MA, USA
Fatih Porikli
University of California, Riverside, CA, USA
Victor Zordan
AT&T Research Labs, Florham Park, NJ, USA
James Klosowski
ZIRST, Saint-Ismier Cedex, France
Sabine Coquillart
Qualcomm Research, San Diego, CA, USA
Xun Luo
Oxford e-Research Centre, University of Oxford, Oxford, UK
Min Chen
IBM, Hawthorne, NY, USA
David Gotz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Oikonomopoulos, A., Pantic, M. (2013). Human Activity Recognition Using Hierarchically-Mined Feature Constellations. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2013. Lecture Notes in Computer Science, vol 8033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41914-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-41914-0_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41913-3
Online ISBN: 978-3-642-41914-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics