Human Activity Recognition: A Scheme Using Multiple Cues

Sadek, Samy; Al-Hamadi, Ayoub; Michaelis, Bernd; Sayed, Usama

doi:10.1007/978-3-642-17274-8_56

Samy Sadek²⁸,
Ayoub Al-Hamadi²⁸,
Bernd Michaelis²⁸ &
…
Usama Sayed²⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6454))

Included in the following conference series:

International Symposium on Visual Computing

2373 Accesses
5 Citations

Abstract

In this work, a schematic model for human activity recognition based on multiple cues is introduced. In the beginning, a sequence of temporal silhouettes of the moving human body parts are extracted from a video clip (i.e., an action snippet). Next, each action snippet is temporally split into several time-slices represented by fuzzy intervals. As shape features, a variety of descriptors both boundary-based (Fourier descriptors, Curvature features) and region-based (Moments, Moment-based features) are then extracted from the silhouettes at each time-slice. Finally, an NB (Naïve Bayes) classifier is learned in the feature space for activity classification. The performance of the method was evaluated on the KTH dataset and the obtained results are quite encouraging and show that an accuracy on par with or exceeding that of existing methods is achievable. Further the simplicity and computational efficiency of the features employed allow the method to achieve real-time performance, and thus it can provide latency guarantees to real-time applications.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Chakraborty, B., Bagdanov, A.D., Gonzàlez, J.: Towards real-time human action recognition. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds.) IbPRIA 2009. LNCS, vol. 5524, pp. 425–432. Springer, Heidelberg (2009)
Google Scholar
Jiang, Y.G., Ngo, C.W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proc. ACM Int. Conf. Image and Video Retrieval, vol. 8, pp. 494–501 (2007)
Google Scholar
Little, L., Boyd, J.E.: Recognizing people by their gait: The shape of motion. International Journal of Computer Vision 1, 1–32 (1998)
Google Scholar
Cutler, R., Davis, L.S.: Robust real-time periodic motion detection, analysis, and applications. IEEE Transactions on PAMI 22, 781–796 (2000)
Article Google Scholar
Thuran, C., Hlaváč, V.: Pose primitive based human action recognition in videos or still images. In: IEEE Computer Society Conference on CVPR (2008)
Google Scholar
Lu, W.L., Okuma, K., Little, J.J.: Tracking and recognizing actions of multiple hockey players using the boosted particle filter. Image and Vision Computing 27, 189–205 (2009)
Article Google Scholar
Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U.: Toward robust action retrieval in video. In: Proc. of BMVC 2010 (2010)
Google Scholar
Laptev, I., Pérez, P.: Retrieving actions in movies. In: Proc. ICCV, pp. 1–8 (2007)
Google Scholar
Dollàr, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: Proc. IEEE Workshop on VS-PETS, pp. 65–72 (2005)
Google Scholar
Liu, J., Shah, M.: Learning human actions via information maximization. In: IEEE Int. Conference on Computer Vision and. Pattern Recognition (2008)
Google Scholar
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transactions on PAMI 23, 257–267 (2001)
Article Google Scholar
Shechtman, E., Irani, M.: Space-time behavior based correlation. Computer Vision and. Pattern Recognition 1, 405–412 (2005)
Google Scholar
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH: A spatio-temporal maximum average correlation height filter for action recognition. In: Proc. CVPR (2008)
Google Scholar
Jhuang, H., Serre, T., Wolf, L., Poggio, T.: A biologically inspired system for action recognition. In: IEEE International Conference on Computer Vision, pp. 257–267 (2007)
Google Scholar
Schindler, K., Van Gool, L.: Action snippets: How many frames does action recognition require? In: Proc. CVPR, pp. 1–8 (2008)
Google Scholar
Laxton, B., Lim, J., Kriegman, D.: Leveraging temporal, contextual and ordering constraints for recognizing complex activities in video. In: Proc. CVPR, pp. 1–8 (2007)
Google Scholar
Blei, D.M., Lafferty, J.D.: Correlated topic models. Advances in Neural Information Processing Systems (NIPS) 18, 147–154 (2006)
Google Scholar
Stauffer, C., Grimson, W.E.L.: Adaptive background mixture models for real-time tracking. In: CVPR, pp. 246–2527 (1999)
Google Scholar
Hu, M.: Visual pattern recognition by moment invariants. IRE Tr. on. Inf. Theory, 179–187 (1962)
Google Scholar
Alajlan, N., Kamel, M.S., Freeman, G.: Multi-object image retrieval based on shape and topology. Signal Processing: Image Communication 21, 904–918 (2006)
Google Scholar
Domingos, P., Pazzani, M.: On the optimality of the simple bayesian classifier under zero-one loss. Machine Learning 29, 103–137 (1997)
Article MATH Google Scholar
Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR (2009)
Google Scholar
Rapantzikos, K., Avrithis, Y., Kollias, S.: Dense saliency-based spatiotemporal feature points for action recognition. In: CVPR, pp. 1–8 (2009)
Google Scholar
Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: Proc. ICCV, vol. 1, pp. 166–173 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Electronics, Signal Processing and Communications, Otto-von-Guericke-University, Magdeburg, Germany
Samy Sadek, Ayoub Al-Hamadi & Bernd Michaelis
Department of Electrical Engineering, Assiut University, Assiut, Egypt
Usama Sayed

Authors

Samy Sadek
View author publications
You can also search for this author in PubMed Google Scholar
Ayoub Al-Hamadi
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Michaelis
View author publications
You can also search for this author in PubMed Google Scholar
Usama Sayed
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada, 89557, Reno, NV, USA
George Bebis
Moffett Field, NASA Ames Research Center, 94035, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
The Chinese University of Hong Kong, Shatin, Hong Kong, China
Ronald Chung
Dyna Vox Systems, Pittsburgh, PA, USA
Riad Hammound
King Saud University, Riyadh, Saudi Arabia
Muhammad Hussain
Hewlett Packard Labs, Paolo Alto, CA, USA
Tan Kar-Han
The Ohio State University, Columbus, OH, USA
Roger Crawfis
Virtual Reality Lab, EPFL, Lausanne, Switzerland
Daniel Thalmann
NASA Ames Research Center, Clifton Park, NY, USA
David Kao
Kitware, Clifton Park, NY, USA
Lisa Avila

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sadek, S., Al-Hamadi, A., Michaelis, B., Sayed, U. (2010). Human Activity Recognition: A Scheme Using Multiple Cues. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2010. Lecture Notes in Computer Science, vol 6454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17274-8_56

Download citation

DOI: https://doi.org/10.1007/978-3-642-17274-8_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17273-1
Online ISBN: 978-3-642-17274-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics