On Importance of Interactions and Context in Human Action Recognition

Shapovalova, Nataliya; Gong, Wenjuan; Pedersoli, Marco; Roca, Francesc Xavier; Gonzàlez, Jordi

doi:10.1007/978-3-642-21257-4_8

Nataliya Shapovalova¹⁹,
Wenjuan Gong¹⁹,
Marco Pedersoli¹⁹,
Francesc Xavier Roca¹⁹ &
…
Jordi Gonzàlez¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6669))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

3169 Accesses
10 Citations

Abstract

This paper is focused on the automatic recognition of human events in static images. Popular techniques use knowledge of the human pose for inferring the action, and the most recent approaches tend to combine pose information with either knowledge of the scene or of the objects with which the human interacts. Our approach makes a step forward in this direction by combining the human pose with the scene in which the human is placed, together with the spatial relationships between humans and objects. Based on standard, simple descriptors like HOG and SIFT, recognition performance is enhanced when these three types of knowledge are taken into account. Results obtained in the PASCAL 2010 Action Recognition Dataset demonstrate that our technique reaches state-of-the-art results using simple descriptors and classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Single- and two-person action recognition based on silhouette shape and optical point descriptors

Article 04 January 2018

Human Action Recognition Using a New Hybrid Descriptor

Frame-Level Covariance Descriptor for Action Recognition

References

Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(10), 1349–1380 (2010)
Google Scholar
Ikizler, N., Duygulu, P.I.: Histogram of oriented rectangles: A new pose descriptor for human action recognition. IVC 27(10), 1515–1526 (2009)
Article Google Scholar
Marszałek, M., Laptev, I., Schmid, C.: Actions in Context. In: CVPR, Florida (2009)
Google Scholar
Li, L.-J., Fei-Fei, L.: What, where and who? Classifying event by scene and object recognition. In: ICCV, Rio de Janeiro (2007)
Google Scholar
Gupta, A., Kembhavi, A., Davis, L.S.: Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 31, 1775–1789 (2009)
Article Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: CVPR, New York (2006)
Google Scholar
Kjellström, H., Romero, J., Martínez, D., Kragić, D.: Simultaneous Visual Recognition of Manipulation Actions and Manipulated Objects. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 336–349. Springer, Heidelberg (2008)
Chapter Google Scholar
Bangpeng, Y., Fei-Fei, l.: Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities. In: CVPR, San Francisco (2010)
Google Scholar
Desai, C., Ramanan, D., Fowlkes, C.: Discriminative models for multi-class object layout. In: ICCV, Kyoto (2009)
Google Scholar
Pedersoli, M., Gonzàlez, J., Bagdanov, A.D., Villanueva, J.J.: Recursive Coarse-to-Fine Localization for Fast Object Detection. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6316, pp. 280–293. Springer, Heidelberg (2010)
Chapter Google Scholar
Dalal, N., Triggs, B., Rhone-Alps, I., Montbonnot, F.: Histograms of oriented gradients for human detection. In: CVPR, San Diego (2005)
Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Representing shape with a spatial pyramid kernel. In: ACM ICIVR, Amsterdam (2007)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2010 (VOC 2010) Results (2010), http://www.pascal-network.org/challenges/VOC/voc2010/workshop/index.html

Download references

Author information

Authors and Affiliations

Computer Science Department and Computer Vision Center, Universitat Autònoma de Barcelona (UAB), 08193, Barcelona, Catalonia, Spain
Nataliya Shapovalova, Wenjuan Gong, Marco Pedersoli, Francesc Xavier Roca & Jordi Gonzàlez

Authors

Nataliya Shapovalova
View author publications
You can also search for this author in PubMed Google Scholar
Wenjuan Gong
View author publications
You can also search for this author in PubMed Google Scholar
Marco Pedersoli
View author publications
You can also search for this author in PubMed Google Scholar
Francesc Xavier Roca
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Gonzàlez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departament de Matemàtica Aplicada i Anàlisi, Universitat de Barcelona, Facultat de Matemàtiques, Gran Via de les Corts Catalanes 585, 08007, Barcelona, Spain
Jordi Vitrià
Instituto de Sistemas e Robótica / Instituto Superior Técnico, Av. Rovisco Pais, 1, 1049-001, Lisbon, Portugal
João Miguel Sanches
Institute for Intelligent Systems and Numerical Applications in Engineering (SIANI), Edificio de Informática y Matemáticas, University of Las Palmas de Gran Canaria, Campus Universitario de Tafira, 35017, Las Palmas, Spain
Mario Hernández

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shapovalova, N., Gong, W., Pedersoli, M., Roca, F.X., Gonzàlez, J. (2011). On Importance of Interactions and Context in Human Action Recognition. In: Vitrià, J., Sanches, J.M., Hernández, M. (eds) Pattern Recognition and Image Analysis. IbPRIA 2011. Lecture Notes in Computer Science, vol 6669. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21257-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-21257-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21256-7
Online ISBN: 978-3-642-21257-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics