2D Human Pose Estimation and Tracking in Non-overlapping Cameras

Taj, Murtaza; Hassan, Ali; Khalid, Abdul Rafay

doi:10.1007/978-3-319-10807-0_12

Murtaza Taj⁴,
Ali Hassan⁴ &
Abdul Rafay Khalid⁴

821 Accesses

Abstract

This chapter will discuss approaches to 2D human pose estimation and tracking in a non-overlapping camera network. It will demonstrate the limitations of current approaches and suggest strategies to overcome them. In particular, computational intractability due to high dimensional limb space, violation of articulation constraints, and view-point dependence. The chapter is divided into three major components; namely, search space reduction, pose validation, and view-invariant pose tracking in a non-overlapping camera network. Firstly, we present approaches for search space reduction, such as Kinematic Tree based sub-region selection for each limb, Mean-Shift based maxima search on the likelihood surface, and temporal based reduction of search in parameter space. Secondly, we devise a PCA based Pose Validation strategy to prune out anatomically incorrect hypotheses. Thirdly, we propose to incorporate articulation constraints while keeping the problem tractable. Finally, we enable view-invariance through the fusion of only two pose detectors and an articulated skeleton tracker.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In this work, by articulated skeleton we refer to a skeleton in which each pair of adjacent limbs shares a common point (a joint) called articulation point. This common point introduces a joint constraint on the movement of these limbs called articulation constraint.
2.
http://groups.inf.ed.ac.uk/calvin/calvin_upperbody_detector/.
3.
http://www.robots.ox.ac.uk/~vgg/data/stickmen/buffy_stickmen_v3.01.tgz.
4.
PCP computes the distance between the estimated skeleton and the ground truth, skeletons found closer than a set threshold (commonly set to \(0.5\)) are considered correct.

References

Andriluka M, Roth S, Schiele B (2010) Monocular 3d pose estimation and tracking by detection. In: CVPR, San Francisco, pp 623–630
Google Scholar
Andriluka M, Roth S, Schiele B (2012) Discriminative appearance models for pictorial structures. Int J Comp Vis 99(3):259–280
Article MathSciNet Google Scholar
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proceedings of IEEE international conference on computer vision and pattern recognition, vol 1, pp 886–893
Google Scholar
Datta A, Sheikh YA, Kanade T (2008) Linear motion estimation for systems of articulated planes. In: IEEE CVPR, June 2008
Google Scholar
Eichner M, Marin-Jimenez M, Zisserman A, Ferrari V (2012) 2D articulated human pose estimation and retrieval in (almost) unconstrained still images. Int J Comp Vis 99:190–214
Article MathSciNet Google Scholar
Ess A, Leibe B, Van Gool L (2007) Depth and appearance for mobile scene analysis. In: IEEE ICCV, Oct 2007
Google Scholar
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans PAMI 32:1627–1645
Article Google Scholar
Fischler MA, Elschlager RA (1973) The representation and matching of PS. IEEE Trans Comput 22(1):67–92
Article Google Scholar
Hassan A, Taj M (2014) 2D articulated pose tracking: a hybrid approach. In: Proceedings of IEEE international conference on image processing, Paris, Oct 2014
Google Scholar
Lucey S, Saragih J, Cohn J (2011) Deformable model fitting by regularized landmark mean-shifts. Int J Comp Vis 91(2):200–215
Article MATH MathSciNet Google Scholar
Johnson S, Everingham M (2010) Clustered pose and nonlinear appearance models for human pose estimation. In: Proceedings of British machine vision conference, pp 1–11
Google Scholar
Kalal Z, Mikolajczyk K, Matas J (2012) Tracking-learning-detection. IEEE Trans PAMI 34:1409–1422
Article Google Scholar
Khalid AR, Hassan A, Taj M (2014) Efficient 2D pose estimation using mean-shift. In: Proceedings of IEEE international conference on image processing, Paris, Oct 2014
Google Scholar
Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: IEEE CVPR, Anchorage, Alaska
Google Scholar
Pishchulin L, Jain A, Andriluka M, Thormaehlen T, Schiele B (2012) Articulated people detection and pose estimation: reshaping the future. In: CVPR, Providence
Google Scholar
Ramanan D (2006) Learning to parse images of articulated objects. In: NIPS, Vancouver
Google Scholar
Ramanan D (2011) Visual analysis of humans, Chapter 11: part-based models for finding people and estimating their pose, Springer, pp 199–224
Google Scholar
Sadeghi MA, Farhadi A (2011) Recognition using visual phrases. In: IEEE CVPR, Colorado Springs, USA, June 2011
Google Scholar
Sapp B, Toshev A, Taskar B (2010) Cascaded models for articulated pose estimation. In: Proceedings of the European conference on computer vision, Berlin, Heidelberg, pp 406–420
Google Scholar
Shotton J, Fitzgibbon A, Cook M, Sharp T, Finocchio M, Moore R, Kipman A, Blake A (2011) Real-time human pose recognition in parts from single depth images. In: Proceedings of IEEE international conference on computer vision and pattern recognition, pp 1297–1304
Google Scholar
Sigal L, Balan AO, Black MJ (2006) Humaneva: synchronized video and motion capture dataset for evaluation of articulated human motion. Int J Comp Vis 87(1):4–27
Google Scholar
Simo-Serra E, Quattoni A, Torras C, Moreno-Noguer F (2013) A joint model for 2D and 3D pose estimation from a single image. In: Proceedings of the conference on computer vision and pattern recognition (CVPR)
Google Scholar
Wu C, Aghajan H (2008) Human pose estimation in vision networks via distributed local processing and nonparametric belief propagation. In: Proceedings of the 10th international conference on advanced concepts for intelligent vision systems, ACIVS’08, Springer, Berlin, pp 1006–1017
Google Scholar
Yang Y, Ramanan D (2011) Articulated pose estimation with flexible mixtures-of-parts. In: IEEE CVPR, Washington
Google Scholar
Zuffi S, Freifeld O, Black MJ (2012) From pictorial structures to deformable structures. In: IEEE CVPR, Washington
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, LUMS School of Science and Engineering, Lahore, Pakistan
Murtaza Taj, Ali Hassan & Abdul Rafay Khalid

Authors

Murtaza Taj
View author publications
You can also search for this author in PubMed Google Scholar
Ali Hassan
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Rafay Khalid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Murtaza Taj .

Editor information

Editors and Affiliations

National Research Council, Lecce, Lecce, Italy
Paolo Spagnolo
National Research Council, Lecce, Lecce, Italy
Pier Luigi Mazzeo
National Research Council, Lecce, Lecce, Italy
Cosimo Distante

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Taj, M., Hassan, A., Khalid, A.R. (2014). 2D Human Pose Estimation and Tracking in Non-overlapping Cameras. In: Spagnolo, P., Mazzeo, P., Distante, C. (eds) Human Behavior Understanding in Networked Sensing. Springer, Cham. https://doi.org/10.1007/978-3-319-10807-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-10807-0_12
Published: 07 November 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10806-3
Online ISBN: 978-3-319-10807-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics