Human Context: Modeling Human-Human Interactions for Monocular 3D Pose Estimation

Andriluka, Mykhaylo; Sigal, Leonid

doi:10.1007/978-3-642-31567-1_26

Mykhaylo Andriluka¹⁹ &
Leonid Sigal²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7378))

Included in the following conference series:

International Conference on Articulated Motion and Deformable Objects

1367 Accesses
4 Citations

Abstract

Automatic recovery of 3d pose of multiple interacting subjects from unconstrained monocular image sequence is a challenging and largely unaddressed problem. We observe, however, that by tacking the interactions explicitly into account, treating individual subjects as mutual “context” for one another, performance on this challenging problem can be improved. Building on this observation, in this paper we develop an approach that first jointly estimates 2d poses of people using multi-person extension of the pictorial structures model and then lifts them to 3d. We illustrate effectiveness of our method on a new dataset of dancing couples and challenging videos from dance competitions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions

Article Open access 01 March 2018

Multiple Human Pose Estimation with Temporally Consistent 3D Pictorial Structures

HMOR: Hierarchical Multi-person Ordinal Relations for Monocular Multi-person 3D Pose Estimation

References

Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: CVPR (2010)
Google Scholar
Pellegrini, S., Edd, A., Schindler, K., van Gool, L.: You’ll never walk alone: Modeling social behaviour for multi-target tracking. In: ICCV (2009)
Google Scholar
Yao, B., Fei-Fei, L.: Modeling mutual context of object and human pose in human-object interaction activities. In: CVPR (2010)
Google Scholar
Kjellström, H., Kragic, D., Black, M.J.: Tracking people interacting with objects. In: CVPR (2010)
Google Scholar
Ionescu, C., Bo, L., Sminchisescu, C.: Structured svm for visual localization and continuous state estimation. In: ICCV (2009)
Google Scholar
Eichner, M., Ferrari, V.: We Are Family: Joint Pose Estimation of Multiple Persons. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6311, pp. 228–242. Springer, Heidelberg (2010)
Chapter Google Scholar
Wang, J.M., Fleet, D.J., Hertzmann, A.: Gaussian process dynamical models for human motion. PAMI 30 (2008)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. PAMI 32 (2010)
Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: People detection and articulated pose estimation. In: CVPR (2009)
Google Scholar
Eichner, M., Ferrari, V.: Better appearance models for pictorial structures. In: BMVC (2009)
Google Scholar
Ramanan, D.: Learning to parse images of articulated objects. In: NIPS (2006)
Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. International Journal of Computer Vision (2005)
Google Scholar
Johnson, S., Everingham, M.: Clustered pose and nonlinear appearance models for human pose estimation. In: BMVC (2010)
Google Scholar
Sapp, B., Weiss, D., Taskar, B.: Parsing human motion with stretchable models. In: CVPR (2011)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR (2011)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Pose search: retrieving people using their pose. In: CVPR (2009)
Google Scholar
Sigal, L., Black, M.J.: Measure locally, reason globally: Occlusion-sensitive articulated pose estimation. In: CVPR (2006)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Progressive search space reduction for human pose estimation. In: CVPR (2008)
Google Scholar
Tian, T.P., Sclaroff, S.: Fast globally optimal 2d human derection with loopy graph models. In: CVPR (2010)
Google Scholar
Urtasun, R., Fleet, D., Fua, P.: 3d people tracking with gaussian process dynamical models. In: CVPR (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Max Planck Institute for Informatics, Saarbrücken, Germany
Mykhaylo Andriluka
Disney Research, Pittsburgh, USA
Leonid Sigal

Authors

Mykhaylo Andriluka
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Sigal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Mathematics, UIB – Universitat de les Illes Balears, C/ Valldemossa km 7.5, PC 07122, Palma de Mallorca, Spain
Francisco J. Perales
School of Informatics, University of Edinburgh, 1.26 Informatics Forum, 10 Crichton St., EH8 9AB, Edinburgh, UK
Robert B. Fisher
Dept. for Architecture, Design and Media Technology, Aalborg University, Niels Jernes Vej 14, 9220, Aalborg East, Denmark
Thomas B. Moeslund

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andriluka, M., Sigal, L. (2012). Human Context: Modeling Human-Human Interactions for Monocular 3D Pose Estimation. In: Perales, F.J., Fisher, R.B., Moeslund, T.B. (eds) Articulated Motion and Deformable Objects. AMDO 2012. Lecture Notes in Computer Science, vol 7378. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31567-1_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-31567-1_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31566-4
Online ISBN: 978-3-642-31567-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Human Context: Modeling Human-Human Interactions for Monocular 3D Pose Estimation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions

Multiple Human Pose Estimation with Temporally Consistent 3D Pictorial Structures

HMOR: Hierarchical Multi-person Ordinal Relations for Monocular Multi-person 3D Pose Estimation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Human Context: Modeling Human-Human Interactions for Monocular 3D Pose Estimation

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Hands-on: deformable pose and motion models for spatiotemporal localization of fine-grained dyadic interactions

Multiple Human Pose Estimation with Temporally Consistent 3D Pictorial Structures

HMOR: Hierarchical Multi-person Ordinal Relations for Monocular Multi-person 3D Pose Estimation

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation