Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 346))

  • 1604 Accesses

Abstract

With the demands on public security and the availability of large storage systems, an increasing number of video surveillance systems are being deployed all over the world to help people detect interesting target events. However, most of these systems require intensive human monitoring, or require human operators to review video footage corresponding to extended periods of time, only to find a few short clips that are of interest. The problem fosters a demand of an automatic computer surveillance system, which can assist the human operators in identifying possible interesting events. This challenge has attracted researchers from different domains, leading to a variety of proposed approaches, particularly in the field of human activity recognition. These approaches vary in the choice of representation and methodologies as well. This chapter gives a survey and reviews the state of the art approaches to automatic human activity recognition in videos.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 379.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ke, Y., Sukthankar, R., Hebert, M.: Efficient visual event detection using volumetric features. In: IEEE International Conference on Computer Vision (2005)

    Google Scholar 

  2. Krueger, V., Kragic, D., Ude, A., Geib, C.: Meaning of action: a review on action recognition and mapping. Int. Journal on Advanced Robotics, Special issue on Imitative Robotics 21, 1473–1501 (2007)

    Google Scholar 

  3. Dollar, P., Rabaud, V., Cottrellm, G., Belongie, S.: Behavior recognition via sparse spatiotemporal features. In: PETS (2005)

    Google Scholar 

  4. Laptev, I., Lindeberg, T.: Space-time interest points. In: IEEE International Conference on Computer Vision (2003)

    Google Scholar 

  5. Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local svm approach. In: Proceedings of the International Conference on Pattern Recognition (2004)

    Google Scholar 

  6. Niebles, J.C., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  7. Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Transaction on Pattern Analysis and Machine Intelligence 23, 647–666 (2001)

    Article  Google Scholar 

  8. Efros, A., Berg, E., Mori, G., Malik, J.: Recognizing action at a distance. In: IEEE International Conference on Computer Vision (2003)

    Google Scholar 

  9. Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space time shapes. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)

    Google Scholar 

  10. Shechtman, E., Irani, M.: Space-time behavior based correlation. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)

    Google Scholar 

  11. Rodriguez, M.D., Ahmed, J., Shah, M.: Action mach: a spatiotemporal maximum average correlation height filter for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)

    Google Scholar 

  12. Mahalanobis, A., Vijaya Kumar, B.V.K., Sims, S.R.F., Epperson, J.: Unconstrained correlation filters. Applied Optics 33, 3751–3759 (1994)

    Article  Google Scholar 

  13. Ebling, J., Scheuermann, G.: Clifford Fourier transform on vector fields. IEEE Transactions on Visualization and Computer Graphics 11(4), 469–479 (2005)

    Article  Google Scholar 

  14. Makris, D., Ellis, T.: Path detection in video surveillance. In: Image and Visual Computing (2002)

    Google Scholar 

  15. Hu, W., Xie, D., Tan, T.: A Hierarchical Self-Organizing Approach for Learning the Patterns of Motion Trajectories. IEEE Transaction on Neural Network 15(1) (January 2004)

    Google Scholar 

  16. Fu, Z., Hu, W., Tan, T.: Similarity based vehicle trajectory clustering and anomaly detection. In: IEEE Int. Conf. Image Processing, vol. 2, pp. 602–605 (2005)

    Google Scholar 

  17. Junejo, I.N., Foroosh, H.: Trajectory Rectification and Path Modeling for Video Surveillance. In: International Conference on Computer Vision (2007)

    Google Scholar 

  18. Hamid, R., Maddi, S., Bobick, A., Essa, I.: Structure from Statistics - Un-supervised Activity Analysis using Suffix Trees. In: International Conference on Computer Vision (2007)

    Google Scholar 

  19. Bose, B., Grimson, E.: Improving Object Classification in Far-Field Video. In: IEEE Conference on Computer Vision and Pattern Recognition (2004)

    Google Scholar 

  20. Ramanan, D., Forsyth, D.A.: Automatic annotation of everyday movements. In: Neural Information Processing Systems (2003)

    Google Scholar 

  21. Fanti, C., Zelnik-Manor, L., Perona, P.: Hybrid models for human motion recognition. In: International Conference on Computer Vision (2005)

    Google Scholar 

  22. Gavrila, D.: The visual analysis of human movement: a survey. Computer Vision and Image Understanding 73, 82–98 (1999)

    Article  MATH  Google Scholar 

  23. Ikizler, N., Forsyth, D.: Searching video for complex activities with finite state models. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  24. Ukkonen, E.: Constructing suffix trees on-line in linear time. In: Proc. Information Processing 92. IFIP Transactions A-12, vol. 1, pp. 484–492 (1994)

    Google Scholar 

  25. Zhang, Z., Huang, K., Tan, T., Wang, L.: Trajectory Series Analysis based Event Rule Induction for Visual Surveillance. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)

    Google Scholar 

  26. Li, L.-J., Fei-Fei, L.: What, where and who? Classifying event by scene and object recog-nition. In: IEEE International Conference on Computer Vision (2007)

    Google Scholar 

  27. Cao, L., Fei-Fei, L.: Spatially coherent latent topic model for concurrent object segmentation and classification. In: IEEE International Conference on Computer Vision (2007)

    Google Scholar 

  28. Savarese, S., Pozo, A.D., Niebles, J., Fei-Fei, L.: Spatial-temporal correlations for unsupervised action classification. In: IEEE Workshop on Motion and Video Computing (2008)

    Google Scholar 

  29. Wang, X., Ma, K.T., Ng, G.W., Grimson, E.: Trajectory analysis and semantic region modeling using a nonparametric Bayesian model. In: IEEE Conference on Computer Vision and Patter Recognition (2008)

    Google Scholar 

  30. Wang, X., Ma, X., Grimson, E.: Unsupervised activity perception by hierarchical Bayesian models. In: IEEE Conference on Computer Vision and Patter Recognition (2007)

    Google Scholar 

  31. Hofmann, T.: Probabilistic latent semantic analysis. In: Conference on Uncertainty in Artificial Intelligence (1999)

    Google Scholar 

  32. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)

    Article  MATH  Google Scholar 

  33. Teh, Y., Jordan, M., Beal, M., Blei, D.: Hierarchical dirichlet process. Journal of the American Statistical Association (2006)

    Google Scholar 

  34. Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision 79, 299–318 (2008)

    Article  Google Scholar 

  35. Brand, M., Oliver, N., Pentland, A.: Coupled hidden markov models for complex action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (1997)

    Google Scholar 

  36. Olivera, N., Gargb, A., Horvitz, E.: Layered representations for learning and inferring office activity from multiple sensory channels. Computer Vision and Image Understanding 96, 163–180 (2004)

    Article  Google Scholar 

  37. Bobick, A.F., Ivanov, Y.A.: Action recognition using probabilistic parsing. In: IEEE Conference on Computer Vision and Pattern Recognition (1998)

    Google Scholar 

  38. Quattoni, A., Wang, S., Morency, L.-P., Collins, M., Darrell, T.: Hidden-state conditional random fields. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 1848–1852 (2007)

    Article  Google Scholar 

  39. Wang, Y., Jiang, H., Drew, M.S., Li, Z., Mori, G.: Unsupervised discovery of action classes. In: IEEE Conference on Computer Vision and Pattern Recognition (2006)

    Google Scholar 

  40. Athitsos, V., Sclaroff, S.: Estimating 3D Hand Pose from a Cluttered Image. In: IEEE Conference on Computer Vision and Pattern Recognition (2003)

    Google Scholar 

  41. Sudderth, E.B., Mandel, M.I., Freeman, W.T., Willsky, A.S.: Visual Hand Tracking Using Nonparametric Belief Propagation. In: IEEE Conference on Computer Vision and Pattern Recognition Workshop on Generative Model Based Vision (2004)

    Google Scholar 

  42. Smeaton, A.F., Over, P., Kraaij, W.: Evaluation campaigns and TRECVid. In: Proceedings of the 8th ACM International Workshop on Multimedia Information Retrieval, MIR (2006)

    Google Scholar 

  43. Liu, J., Luo, J., Shah, M.: Recognizing Realistic Actions from Videos in the Wild. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

  44. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: IEEE International Conference on Machine Learning (2001)

    Google Scholar 

  45. Vail, D., Veloso, M., Lafferty, J.: Conditional Random Fields for Activity Recognition. In: ACM International Conference on Autonomous Agents and Multiagent Systems (2007)

    Google Scholar 

  46. Sminchisescu, C., Kanaujia, A., Li, Z., Metaxas, D.: Conditional models for contextual human motion recognition. In: IEEE International Conference on Computer Vision (2005)

    Google Scholar 

  47. Quattoni, A., Collins, M., Darrell, T.: Conditional random fields for object recognition. In: Advances in Neural Information Processing Systems (2005)

    Google Scholar 

  48. Wang, S., Quattoni, A., Morency, L.-P., Demirdjian, D., Darrell, T.: Hidden Conditional Random Fields for Gesture Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2006)

    Google Scholar 

  49. Wang, Y., Mori, G.: Learning a discriminative hidden part model for human action recognition. In: Advances in Neural Information Processing Systems (2008)

    Google Scholar 

  50. Wang, Y., Mori, G.: Max-Margin Hidden Conditional Random Fields for Human Action Recognition. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

  51. Liu, J., Shah, M.: Learning Human Actions via Information Maximization. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)

    Google Scholar 

  52. Bobick, A.F.: Movement, activity, and action: The role of knowledge in the perception of motion. Philosoph. Trans. Roy. Soc. Lond. B 352, 1257–1265 (1997)

    Article  Google Scholar 

  53. Marszalek, M., Laptev, I., Schmid, C.: Actions in Context. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

  54. Liu, J., Luo, J., Shah, M.: Recognizing Realistic Actions from Videos in the Wild. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Liu, H., Sun, MT., Feris, R. (2011). Automatic Video Activity Recognition. In: Lin, W., Tao, D., Kacprzyk, J., Li, Z., Izquierdo, E., Wang, H. (eds) Multimedia Analysis, Processing and Communications. Studies in Computational Intelligence, vol 346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19551-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19551-8_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19550-1

  • Online ISBN: 978-3-642-19551-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics