Skip to main content

Human Activity Recognition: A Scheme Using Multiple Cues

  • Conference paper
Advances in Visual Computing (ISVC 2010)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6454))

Included in the following conference series:

Abstract

In this work, a schematic model for human activity recognition based on multiple cues is introduced. In the beginning, a sequence of temporal silhouettes of the moving human body parts are extracted from a video clip (i.e., an action snippet). Next, each action snippet is temporally split into several time-slices represented by fuzzy intervals. As shape features, a variety of descriptors both boundary-based (Fourier descriptors, Curvature features) and region-based (Moments, Moment-based features) are then extracted from the silhouettes at each time-slice. Finally, an NB (Naïve Bayes) classifier is learned in the feature space for activity classification. The performance of the method was evaluated on the KTH dataset and the obtained results are quite encouraging and show that an accuracy on par with or exceeding that of existing methods is achievable. Further the simplicity and computational efficiency of the features employed allow the method to achieve real-time performance, and thus it can provide latency guarantees to real-time applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chakraborty, B., Bagdanov, A.D., Gonzàlez, J.: Towards real-time human action recognition. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds.) IbPRIA 2009. LNCS, vol. 5524, pp. 425–432. Springer, Heidelberg (2009)

    Google Scholar 

  2. Jiang, Y.G., Ngo, C.W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proc. ACM Int. Conf. Image and Video Retrieval, vol. 8, pp. 494–501 (2007)

    Google Scholar 

  3. Little, L., Boyd, J.E.: Recognizing people by their gait: The shape of motion. International Journal of Computer Vision 1, 1–32 (1998)

    Google Scholar 

  4. Cutler, R., Davis, L.S.: Robust real-time periodic motion detection, analysis, and applications. IEEE Transactions on PAMI 22, 781–796 (2000)

    Article  Google Scholar 

  5. Thuran, C., Hlaváč, V.: Pose primitive based human action recognition in videos or still images. In: IEEE Computer Society Conference on CVPR (2008)

    Google Scholar 

  6. Lu, W.L., Okuma, K., Little, J.J.: Tracking and recognizing actions of multiple hockey players using the boosted particle filter. Image and Vision Computing 27, 189–205 (2009)

    Article  Google Scholar 

  7. Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U.: Toward robust action retrieval in video. In: Proc. of BMVC 2010 (2010)

    Google Scholar 

  8. Laptev, I., Pérez, P.: Retrieving actions in movies. In: Proc. ICCV, pp. 1–8 (2007)

    Google Scholar 

  9. Dollàr, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: Proc. IEEE Workshop on VS-PETS, pp. 65–72 (2005)

    Google Scholar 

  10. Liu, J., Shah, M.: Learning human actions via information maximization. In: IEEE Int. Conference on Computer Vision and. Pattern Recognition (2008)

    Google Scholar 

  11. Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on PAMI 23, 257–267 (2001)

    Article  Google Scholar 

  12. Shechtman, E., Irani, M.: Space-time behavior based correlation. Computer Vision and. Pattern Recognition 1, 405–412 (2005)

    Google Scholar 

  13. Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH: A spatio-temporal maximum average correlation height filter for action recognition. In: Proc. CVPR (2008)

    Google Scholar 

  14. Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A biologically inspired system for action recognition. In: IEEE International Conference on Computer Vision, pp. 257–267 (2007)

    Google Scholar 

  15. Schindler, K., Van Gool, L.: Action snippets: How many frames does action recognition require? In: Proc. CVPR, pp. 1–8 (2008)

    Google Scholar 

  16. Laxton, B., Lim, J., Kriegman, D.: Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video. In: Proc. CVPR, pp. 1–8 (2007)

    Google Scholar 

  17. Blei, D.M., Lafferty, J.D.: Correlated topic models. Advances in Neural Information Processing Systems (NIPS) 18, 147–154 (2006)

    Google Scholar 

  18. Stauffer, C., Grimson, W.E.L.: Adaptive background mixture models for real-time tracking. In: CVPR, pp. 246–2527 (1999)

    Google Scholar 

  19. Hu, M.: Visual pattern recognition by moment invariants. IRE Tr. on. Inf. Theory, 179–187 (1962)

    Google Scholar 

  20. Alajlan, N., Kamel, M.S., Freeman, G.: Multi-object image retrieval based on shape and topology. Signal Processing: Image Communication 21, 904–918 (2006)

    Google Scholar 

  21. Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29, 103–137 (1997)

    Article  MATH  Google Scholar 

  22. Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR (2009)

    Google Scholar 

  23. Rapantzikos, K., Avrithis, Y., Kollias, S.: Dense saliency-based spatiotemporal feature points for action recognition. In: CVPR, pp. 1–8 (2009)

    Google Scholar 

  24. Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: Proc. ICCV, vol. 1, pp. 166–173 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U. (2010). Human Activity Recognition: A Scheme Using Multiple Cues. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17274-8_56

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-17274-8_56

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-17273-1

  • Online ISBN: 978-3-642-17274-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics