Skip to main content

Human Activity Recognition Using Hierarchically-Mined Feature Constellations

  • Conference paper
Advances in Visual Computing (ISVC 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8033))

Included in the following conference series:

  • 2853 Accesses

Abstract

In this paper we address the problem of human activity modelling and recognition by means of a hierarchical representation of mined dense spatiotemporal features. At each level of the hierarchy, the proposed method selects feature constellations that are increasingly discriminative and characteristic of a specific action category, by taking into account how frequently they occur in that action category versus the rest of the available action categories in the training dataset. Each feature constellation consists of n-tuples of features selected in the previous level of the hierarchy and lying within a small spatiotemporal neighborhood. We use spatiotemporal Local Steering Kernel (LSK) features as a basis for our representation, due to their ability and efficiency in capturing the local structure and dynamics of the underlying activities. The proposed method is able to detect activities in unconstrained videos, by back-projecting the activated features at the locations at which they were activated. We test the proposed method on two publicly available datasets, namely the KTH and YouTube datasets of human bodily actions. The acquired results demonstrate the effectiveness of the proposed method in recognising a wide variety of activities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Weinland, D., Ronfard, R., Boyer, E.: A survey of vision-based methods for action representation, segmentation and recognition. Comp. Vision, and Image Understanding 115, 224–241 (2011)

    Article  Google Scholar 

  2. Laptev, I., Lindeberg, T.: Space-time Interest Points. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 432–439 (2003)

    Google Scholar 

  3. Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)

    Article  Google Scholar 

  4. Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: VS-PETS, pp. 65– 72 (2005)

    Google Scholar 

  5. Bregonzio, M., Gong, S., Xiang, T.: Recognising action as clouds of space-time interest points. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2009)

    Google Scholar 

  6. Bay, H., Tuytelaars, T., Gool, L.V.: Surf: Speeded up robust features. Comp. Vision, and Image Understanding 110, 346–359 (2008)

    Article  Google Scholar 

  7. Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A Biologically Inspired System for Action Recognition. In: Proc. IEEE Int. Conf. Computer Vision, pp. 1–8 (2007)

    Google Scholar 

  8. Schindler, K., Gool, L.V.: Action snippets: How many frames does human action require? In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2008)

    Google Scholar 

  9. Ali, S., Shah, M.: Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans. Pattern Analysis and Machine Intelligence (2010)

    Google Scholar 

  10. Gilbert, A., Illingworth, J., Bowden, R.: Fast realistic multi-action recognition using mined dense spatio-temporal features. In: Proc. IEEE Int. Conf. Computer Vision (2009)

    Google Scholar 

  11. Shechtman, E., Irani, M.: Matching local self-similarities across images and videos. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2007)

    Google Scholar 

  12. Seo, H., Milanfar, P.: Action recognition from one example. IEEE Trans. Pattern Analysis and Machine Intelligence 33, 867–882 (2011)

    Article  Google Scholar 

  13. Amer, M., Todorovic, S.: Sum-product networks for modeling activities with stochastic structure. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1314–1321 (2012)

    Google Scholar 

  14. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: a statistical view of boosting. Stanford University Technical Report (1993)

    Google Scholar 

  15. Quack, T., Ferrari, V., Leibe, B., Gool, L.V.: Efficient mining of frequent and distinctive feature configurations. In: Proc. IEEE Int. Conf. Computer Vision (2007)

    Google Scholar 

  16. Gilbert, A., Illingworth, J., Bowden, R.: Action recognition using mined hierarchical compound features. IEEE Trans. Pattern Analysis and Machine Intelligence 33, 883–897 (2011)

    Article  Google Scholar 

  17. Wang, L., Wang, Y., Jiang, T., Gao, W.: Instantly telling what happens in a video sequence using simple features. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3257–3264 (2011)

    Google Scholar 

  18. Dong, G., Li, J.: Efficient mining of emerging patterns: Discovering trends and differences. In: ACM SIGKDD, pp. 43–52 (2004)

    Google Scholar 

  19. Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local svm approach. In: IEEE Conf. on Computer Vision and Pattern Recognition, vol. 3, pp. 32–36 (2004)

    Google Scholar 

  20. Liu, J., Luo, J., Shah, M.: Recognizing realistic actions from videos ”in the wild”. In: IEEE Conf. on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

  21. Seo, H., Milanfar, P.: Training-free, generic object detection using locally adaptive regression kernels. IEEE Trans. Pattern Analysis and Machine Intelligence 32, 1688–1704 (2010)

    Article  Google Scholar 

  22. Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 1–8 (2008)

    Google Scholar 

  23. Wang, H., Klaeser, A., Schmid, C., Liu, C.: Action recognition by dense trajectories. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3169–3176 (2011)

    Google Scholar 

  24. Le, Q., Zou, W., Yeung, S., Ng, A.: Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: IEEE Conf. on Computer Vision and Pattern Recognition, pp. 3361–3368 (2011)

    Google Scholar 

  25. Ikizler-Cinbis, N., Sclaroff, S.: Object, scene and actions: Combining multiple features for human action recognition. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 494–507. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Oikonomopoulos, A., Pantic, M. (2013). Human Activity Recognition Using Hierarchically-Mined Feature Constellations. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2013. Lecture Notes in Computer Science, vol 8033. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41914-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41914-0_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41913-3

  • Online ISBN: 978-3-642-41914-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics