Automatic Video Activity Recognition

Liu, Haowei; Sun, Ming-Ting; Feris, Rogerio

doi:10.1007/978-3-642-19551-8_11

Haowei Liu⁷,
Ming-Ting Sun⁷ &
Rogerio Feris⁸

Part of the book series: Studies in Computational Intelligence ((SCI,volume 346))

1604 Accesses

Abstract

With the demands on public security and the availability of large storage systems, an increasing number of video surveillance systems are being deployed all over the world to help people detect interesting target events. However, most of these systems require intensive human monitoring, or require human operators to review video footage corresponding to extended periods of time, only to find a few short clips that are of interest. The problem fosters a demand of an automatic computer surveillance system, which can assist the human operators in identifying possible interesting events. This challenge has attracted researchers from different domains, leading to a variety of proposed approaches, particularly in the field of human activity recognition. These approaches vary in the choice of representation and methodologies as well. This chapter gives a survey and reviews the state of the art approaches to automatic human activity recognition in videos.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Hardcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: IEEE International Conference on Computer Vision (2005)
Google Scholar
Krueger, V., Kragic, D., Ude, A., Geib, C.: Meaning of action: a review on action recognition and mapping. Int. Journal on Advanced Robotics, Special issue on Imitative Robotics 21, 1473–1501 (2007)
Google Scholar
Dollar, P., Rabaud, V., Cottrellm, G., Belongie, S.: Behavior recognition via sparse spatiotemporal features. In: PETS (2005)
Google Scholar
Laptev, I., Lindeberg, T.: Space-time interest points. In: IEEE International Conference on Computer Vision (2003)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: Proceedings of the International Conference on Pattern Recognition (2004)
Google Scholar
Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transaction on Pattern Analysis and Machine Intelligence 23, 647–666 (2001)
Article Google Scholar
Efros, A., Berg, E., Mori, G., Malik, J.: Recognizing action at a distance. In: IEEE International Conference on Computer Vision (2003)
Google Scholar
Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space time shapes. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)
Google Scholar
Shechtman, E., Irani, M.: Space-time behavior based correlation. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)
Google Scholar
Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach: a spatiotemporal maximum average correlation height filter for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Mahalanobis, A., Vijaya Kumar, B.V.K., Sims, S.R.F., Epperson, J.: Unconstrained correlation filters. Applied Optics 33, 3751–3759 (1994)
Article Google Scholar
Ebling, J., Scheuermann, G.: Clifford Fourier transform on vector fields. IEEE Transactions on Visualization and Computer Graphics 11(4), 469–479 (2005)
Article Google Scholar
Makris, D., Ellis, T.: Path detection in video surveillance. In: Image and Visual Computing (2002)
Google Scholar
Hu, W., Xie, D., Tan, T.: A Hierarchical Self-Organizing Approach for Learning the Patterns of Motion Trajectories. IEEE Transaction on Neural Network 15(1) (January 2004)
Google Scholar
Fu, Z., Hu, W., Tan, T.: Similarity based vehicle trajectory clustering and anomaly detection. In: IEEE Int. Conf. Image Processing, vol. 2, pp. 602–605 (2005)
Google Scholar
Junejo, I.N., Foroosh, H.: Trajectory Rectification and Path Modeling for Video Surveillance. In: International Conference on Computer Vision (2007)
Google Scholar
Hamid, R., Maddi, S., Bobick, A., Essa, I.: Structure from Statistics - Un-supervised Activity Analysis using Suffix Trees. In: International Conference on Computer Vision (2007)
Google Scholar
Bose, B., Grimson, E.: Improving Object Classification in Far-Field Video. In: IEEE Conference on Computer Vision and Pattern Recognition (2004)
Google Scholar
Ramanan, D., Forsyth, D.A.: Automatic annotation of everyday movements. In: Neural Information Processing Systems (2003)
Google Scholar
Fanti, C., Zelnik-Manor, L., Perona, P.: Hybrid models for human motion recognition. In: International Conference on Computer Vision (2005)
Google Scholar
Gavrila, D.: The visual analysis of human movement: a survey. Computer Vision and Image Understanding 73, 82–98 (1999)
Article MATH Google Scholar
Ikizler, N., Forsyth, D.: Searching video for complex activities with finite state models. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Ukkonen, E.: Constructing suffix trees on-line in linear time. In: Proc. Information Processing 92. IFIP Transactions A-12, vol. 1, pp. 484–492 (1994)
Google Scholar
Zhang, Z., Huang, K., Tan, T., Wang, L.: Trajectory Series Analysis based Event Rule Induction for Visual Surveillance. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Li, L.-J., Fei-Fei, L.: What, where and who? Classifying event by scene and object recog-nition. In: IEEE International Conference on Computer Vision (2007)
Google Scholar
Cao, L., Fei-Fei, L.: Spatially coherent latent topic model for concurrent object segmentation and classification. In: IEEE International Conference on Computer Vision (2007)
Google Scholar
Savarese, S., Pozo, A.D., Niebles, J., Fei-Fei, L.: Spatial-temporal correlations for unsupervised action classification. In: IEEE Workshop on Motion and Video Computing (2008)
Google Scholar
Wang, X., Ma, K.T., Ng, G.W., Grimson, E.: Trajectory analysis and semantic region modeling using a nonparametric Bayesian model. In: IEEE Conference on Computer Vision and Patter Recognition (2008)
Google Scholar
Wang, X., Ma, X., Grimson, E.: Unsupervised activity perception by hierarchical Bayesian models. In: IEEE Conference on Computer Vision and Patter Recognition (2007)
Google Scholar
Hofmann, T.: Probabilistic latent semantic analysis. In: Conference on Uncertainty in Artificial Intelligence (1999)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
Article MATH Google Scholar
Teh, Y., Jordan, M., Beal, M., Blei, D.: Hierarchical dirichlet process. Journal of the American Statistical Association (2006)
Google Scholar
Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision 79, 299–318 (2008)
Article Google Scholar
Brand, M., Oliver, N., Pentland, A.: Coupled hidden markov models for complex action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (1997)
Google Scholar
Olivera, N., Gargb, A., Horvitz, E.: Layered representations for learning and inferring office activity from multiple sensory channels. Computer Vision and Image Understanding 96, 163–180 (2004)
Article Google Scholar
Bobick, A.F., Ivanov, Y.A.: Action recognition using probabilistic parsing. In: IEEE Conference on Computer Vision and Pattern Recognition (1998)
Google Scholar
Quattoni, A., Wang, S., Morency, L.-P., Collins, M., Darrell, T.: Hidden-state conditional random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 1848–1852 (2007)
Article Google Scholar
Wang, Y., Jiang, H., Drew, M.S., Li, Z., Mori, G.: Unsupervised discovery of action classes. In: IEEE Conference on Computer Vision and Pattern Recognition (2006)
Google Scholar
Athitsos, V., Sclaroff, S.: Estimating 3D Hand Pose from a Cluttered Image. In: IEEE Conference on Computer Vision and Pattern Recognition (2003)
Google Scholar
Sudderth, E.B., Mandel, M.I., Freeman, W.T., Willsky, A.S.: Visual Hand Tracking Using Nonparametric Belief Propagation. In: IEEE Conference on Computer Vision and Pattern Recognition Workshop on Generative Model Based Vision (2004)
Google Scholar
Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, MIR (2006)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing Realistic Actions from Videos in the Wild. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: IEEE International Conference on Machine Learning (2001)
Google Scholar
Vail, D., Veloso, M., Lafferty, J.: Conditional Random Fields for Activity Recognition. In: ACM International Conference on Autonomous Agents and Multiagent Systems (2007)
Google Scholar
Sminchisescu, C., Kanaujia, A., Li, Z., Metaxas, D.: Conditional models for contextual human motion recognition. In: IEEE International Conference on Computer Vision (2005)
Google Scholar
Quattoni, A., Collins, M., Darrell, T.: Conditional random fields for object recognition. In: Advances in Neural Information Processing Systems (2005)
Google Scholar
Wang, S., Quattoni, A., Morency, L.-P., Demirdjian, D., Darrell, T.: Hidden Conditional Random Fields for Gesture Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2006)
Google Scholar
Wang, Y., Mori, G.: Learning a discriminative hidden part model for human action recognition. In: Advances in Neural Information Processing Systems (2008)
Google Scholar
Wang, Y., Mori, G.: Max-Margin Hidden Conditional Random Fields for Human Action Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Google Scholar
Liu, J., Shah, M.: Learning Human Actions via Information Maximization. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Bobick, A.F.: Movement, activity, and action: The role of knowledge in the perception of motion. Philosoph. Trans. Roy. Soc. Lond. B 352, 1257–1265 (1997)
Article Google Scholar
Marszalek, M., Laptev, I., Schmid, C.: Actions in Context. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Google Scholar
Liu, J., Luo, J., Shah, M.: Recognizing Realistic Actions from Videos in the Wild. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Washington, Seattle, WA, 98195
Haowei Liu & Ming-Ting Sun
IBM T.J Watson Research Center, Hawthorn, NY, 10532
Rogerio Feris

Authors

Haowei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Ting Sun
View author publications
You can also search for this author in PubMed Google Scholar
Rogerio Feris
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Engineering , Nanyang Technological University, 639798, Singapore
Weisi Lin & Dacheng Tao &
Intelligent Systems Laboratory Systems Research Institute , Polish Academy of Sciences, Poland
Janusz Kacprzyk
Department of Computing , Hong Kong Polytechnic University, Hung Hom, Hong Kong
Zhu Li
School of Electronic Engineering and Computer Science, Queen Mary, University of London, London, U.K.
Ebroul Izquierdo
TCL-Thomson Electronics , Santa Clara, California
Haohong Wang

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Liu, H., Sun, MT., Feris, R. (2011). Automatic Video Activity Recognition. In: Lin, W., Tao, D., Kacprzyk, J., Li, Z., Izquierdo, E., Wang, H. (eds) Multimedia Analysis, Processing and Communications. Studies in Computational Intelligence, vol 346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19551-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-19551-8_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19550-1
Online ISBN: 978-3-642-19551-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics