Skip to main content
Log in

Human Pose Estimation Using Partial Configurations and Probabilistic Regions

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

A method for recovering a part-based description of human pose from single images of people is described. It is able to perform estimation efficiently in the presence significant background clutter, large foreground variation, self-occlusion and occlusion by other objects. This is achieved through two key developments. Firstly, a new formulation is proposed that allows partial configurations, hypotheses with differing numbers of parts, to be made and compared. This permits efficient global sampling in the presence of self and other object occlusions without prior knowledge of body part visibility. Secondly, a highly discriminatory likelihood model is proposed comprising two complementary components. A boundary component improves upon previous appearance distribution divergence methods by incorporating high-level shape and appearance information and hence better discriminates textured, overlapping body parts. An inter-part component uses appearance similarity of body parts to reduce the number of false-positive, multi-part hypotheses, hence increasing estimation efficiency. Results are presented for challenging images with unknown subject and large variations in subject appearance, scale and pose.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baerlocher, P. and Boulic, R. 2001. Deformable Avatars, chapter Parametrization and range of motion of the ball-and-socket joint. Kluwer Academic, pp. 180–190.

  • Barron, C. and Kakadiaris, I.A. 2001. Estimating anthropometry and pose from a single uncalibrated image. Computer Vision and Image Understanding, 81(3):269–284.

    Article  Google Scholar 

  • Baumberg, A. and Hogg, D. 1994. An efficient method for contour tracking using active shape models. In Proc. of IEEE Workshop on Motion of Non-Rigid and Articulated Objects, pp. 194–199.

  • Bowden, R., Mitchell, T.A., and Sarhadi, M. 2000. Non-linear statistical models for the 3D reconstruction of human pose and motion from monocular image sequences. Image and Vision Computing, 18(9):729–737.

    Article  Google Scholar 

  • Bregler, C. and Malik, J. 1998. Tracking people with twists and exponential maps. In IEEE Conference on Computer Vision and Pattern Recognition. Santa Barbara, CA, pp. 8–15.

  • Cham, T.J. and Rehg, J.M. 1999. A multiple hypothesis approach to figure tracking. In IEEE Conference on Computer Vision and Pattern Recognition. Fort Collins, Colorado, USA, vol. 2, pp. 239–245.

  • Choo, K. and Fleet, D.J. 2001. People tracking using hybrid Monte Carlo filtering. In IEEE International Conference on Computer Vision. Vancouver, pp. 321–328.

  • Comaniciu, D., Ramesh, V., and Meer, P. 2000. Real-time tracking of nonrigid objects using mean shift. In IEEE Conference on Computer Vision and Pattern Recognition, pp. 673–678.

  • Deutscher, J., Blake, A., and Reid, I. 2000. Articulated body motion capture by annealed particle filtering. In IEEE Conference on Computer Vision and Pattern Recognition. South Carolina, USA, vol. 2, pp. 126–133.

  • Deutscher, J., Davison, A., and Reid, I. 2001. Automatic partitioning of high dimensional search spaces associated with articulated body motion capture. In IEEE Conference on Computer Vision and Pattern Recognition. Hawaii, vol. 2, pp. 669–676.

  • Deutscher, J., North, B., Bascle, B., and Blake, A. 1999. Tracking through singularities and discontinuities by random sampling. In IEEE International Conference on Computer Vision, pp. 1144–1149.

  • Duda, R., Hart, P., and Stork, D. 2001. Pattern Classification. 2nd edition, Wiley.

  • Felzenszwalb, P.F. and Huttenlocher, D.P. 2000. Efficient matching of pictorial structures. In IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 66–73.

  • Forsyth, D.A. and Fleck, M.M. 1997. Body plans. In IEEE Conference on Computer Vision and Pattern Recognition. Puerto Rico, pp. 678–683.

  • Forsyth, D.A. and Fleck, M.M. 1999. Automatic detection of human nudes. International Journal of Computer Vision, 32(1):63–77.

    Article  Google Scholar 

  • Gavrila, D.M. 1999. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73(1):82–98.

    Article  Google Scholar 

  • Gavrila, D.M. and Davis, L.S. 1996. 3D model-based tracking of humans in action: A multi-view approach. In IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, USA, pp. 73–80.

  • Grosso, M., Quach, R., and Badler, N. 1989. Anthropometry for computer animated human figures. In Proceedings of Computer Animation Human Figure, Spring Verlag, pp. 83–96.

  • Haritaoglu, I., Harwood, D., and Davis, L.S. 1999. Hydra: Multiple people detection and tracking using silhouettes. In IEEE International Workshop on Visual Surveillance, pp. 6–13.

  • Haritaoglu, I., Harwood, D., and Davis, L.S. 2000. W4: Real-time surveillance of people and their activities. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):809–830.

    Article  Google Scholar 

  • Hogg, D. 1983. Model-based vision: A program to see a walking person. Image and Vision Computing, 1(1):5–20.

    Article  Google Scholar 

  • Ioffe, S. and Forsyth, D.A. 2001a. Probabilistic methods for finding people. International Journal of Computer Vision, 43:45–68.

    Article  Google Scholar 

  • Ioffe, S. and Forsyth, D.A. 2001b. Human tracking with mixtures of trees. In IEEE International Conference on Computer Vision, vol. 1, pp. 690–695.

  • Isard, M. and Blake, A. 1996. Contour tracking by stochastic propagation of conditional density. In European Conference on Computer Vision. Cambridge, vol. 1, pp. 343–356.

  • Ju, S., Black, M., and Yacoob, Y. 1996. Cardboard people: A parameterized model of articulated image motion. In IEEE International Conference on Face and Gesture Recognition. Killington, VT, USA, pp. 38–44.

  • Kakadiaris, I.A. and Metaxas, D. 1996. Model-based estimation of 3D human motion with occlusion based on active multi-viewpoint selection. In IEEE Conference on Computer Vision and Pattern Recognition. San Francisco, USA, pp. 81–87.

  • Karaulova, I., Hall, P., and Marshall, A. 2000. A hierarchical model of dynamics for tracking people with a single video camera. In British Machine Vision Conference. Bristol, pp. 352–361.

  • Konishi, S., Yuille, A.L., Coughlan, J.M., and Zhu, S.C. 2003. Statistical edge detection: Learning and evaluating edge cues. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(1):57–74.

    Article  Google Scholar 

  • Lee, M.W. and Cohen I. 2004. Human upper body pose estimation from static images. In European Conference on Computer Vision, pp. 126–138.

  • Leung, M.K. and Yang, Y.H. 1995. First sight: A human body outline labeling system. IEEE Transactions on Pattern Analysis and Machine Intelligence, 359–377.

  • MacCormick, J. and Blake, A. 1998a. A probabilistic contour discriminant for object localisation. In IEEE International Conference on Computer Vision, pp. 390–395.

  • MacCormick, J. and Blake, A. 1998b. Spatial dependence in the observation of visual contours. In European Conference on Computer Vision.

  • Malik, J., Belongie, S., Leung, T., and Shi, J. 2001. Contour and texture analysis for image segmentation. International Journal of Computer Vision, 43(1):7–27.

    Article  Google Scholar 

  • Martin, D.R., Fowlkes, C.C., and Malik, J. 2004. Learning to detect natural image boundaries using local brightness, color, and texture cues. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(5):530–549.

    Article  Google Scholar 

  • McKenna, S.J., Jabri, S., Duric, Z., Rosenfeld, A., and Wechsler, H. 2000. Tracking groups of people. Computer Vision and Image Understanding, 80(1):42–56.

    Article  Google Scholar 

  • Metaxas, D. and Terzopoulos, D. 1993. Shape and non rigid motion estimation through physics based synthesis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6):580–591.

    Article  Google Scholar 

  • Mikolajczyk, K., Schmid, C., and Zisserman, A. 2004. Human detection based on a probabilistic assembly of robust part detectors. In European Conference on Computer Vision, vol. I, pp. 69–81.

  • Moeslund, T.B. and Bajers, F. 1999. Summaries of 107 computer vision-based human motion capture papers. Technical Report LIA99-01, University of Aalborg.

  • Moeslund, T.B. and Granum, E. 2000. 3D human pose estimation using 2D-Data and an alternative phase space representation. In IEEE Workshop on Human Modeling, Analysis and Synthesis.

  • Moeslund, T.B. and Granum, E. 2001. A survey of computer vision-based human motion capture. Computer Vision and Image Understanding, 81(3):231–268.

    Article  Google Scholar 

  • Mori, G. and Malik, J. 2002. Estimating human body configurations using shape context matching. In European Conference on Computer Vision, pp. 666–680.

  • Mori, G., Ren, X., Efros, A.A., and Malik, J. 2004. Recovering human body configurations: Combining segmentation and recognition. pp. II: 326–333.

  • Morris, D.D. and Rehg, J.M. 1998. Singularity analysis for articulated object tracking. In IEEE Conference on Computer Vision and Pattern Recognition, pp. 289–296.

  • Ong, E. and Gong, S. 1999. A dynamic human model using hybrid 2D-3D representations in hierarchical PCA space. In British Machine Vision Conference. Nottingham, pp. 33–42.

  • O’Rourke, J. and Badler, N.I. 1980. Model-based image analysis of human motion using constraint propagation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2(6):522–536.

    Google Scholar 

  • Park, J., Hwang-Seok, O., Chang, D., and Lee, E. 2000. Human posture recognition using curve segments for image retrieval. In SPIE Conference on Storage and Retrieval for Media Databases, vol. 3972, pp. 2–11.

  • Puzicha, J., Rubner, Y., Tomasi, C., and Buhmann, J.M. 1999. Empirical evaluation of dissimilarity measures for color and texture. IEEE International Conference on Computer Vision, 1165–1173.

  • Ramanan, D. and Forsyth, D.A. 2003. Finding and tracking people from the bottom up. In IEEE Conference on Computer Vision and Pattern Recognition. Madison, Wisconsin, vol. II, pp. 467–474.

  • Rehg, J.M. and Kanade, T. 1995. Model based tracking of self occluding articulated objects. In IEEE International Conference on Computer Vision, pp. 612–617.

  • Roberts, T.J., McKenna, S.J., and Ricketts, I.W. 2004. Human pose estimation using learnt probabilistic region similarities and partial configurations. In European Conference on Computer Vision.

  • Ronfard, R., Schud, C., and Triggs, B. 2002. Learning to parse pictures of people. In European Conference on Computer Vision. Copenhagen, pp. 700–714.

  • Rosales, R. and Sclaroff, S. 2000. Inferring body pose without tracking body parts. In IEEE Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 721–727.

  • Rosales, R., Siddiqui, M., Alon, J., and Sclaroff, S. 2001. Estimating 3D body pose using uncalibrated cameras. In IEEE Conference on Computer Vision and Pattern Recognition, vol. I, pp. 821–827.

  • Ruzon, M.A. and Tomasi, C. 1999. Color edge detection with the compass operator. In IEEE Conference on Computer Vision and Pattern Recognition, pp. II: 160–166.

  • Schiele, B. and Crowley, J.L. 2000. Recognition without correspondence using multidimensional receptive field histograms. International Journal of Computer Vision, 36(1):31–50

    Article  Google Scholar 

  • Shahrokni, A., Drummond, T., and Fua, P. 2004. Texture boundary detection for real-time tracking. In European Conference on Computer Vision, vol. II, pp. 566–577.

  • Sidenbladh, H. and Black, M.J. 2001. Learning image statistics for Bayesian tracking. In IEEE International Conference on Computer Vision. Vancouver, vol. 2, pp. 709–716.

  • Sidenbladh, H., de la Torre, F., and Black, M.J. 2000. A framework for modeling the appearance of 3D articulated figures. In IEEE International Conference on Face and Gesture Recognition. Grenoble, pp. 368–375.

  • Sminchisescu, C. and Triggs, B. 2001. Covariance scaled sampling for monocular 3D body tracking. In IEEE Conference on Computer Vision and Pattern Recognition. Hawaii, vol. 1, pp. 447–454.

  • Sminchisescu, C. and Triggs, B. 2003. Kinematic jump processes for monocular 3D human tracking. In IEEE Conference on Computer Vision and Pattern Recognition, pp. I: 69–76.

  • Stauffer, C. and Grimson, E. 2001. Similarity templates for detection and recognition. In IEEE Conference on Computer Vision and Pattern Recognition, vol. I, pp. 221–228.

  • Taylor, C. 2000. Reconstruction of articulated objects from point correspondences in a single uncalibrated image. Computer Vision and Image Understanding, 80:349–363.

    Article  Google Scholar 

  • Wachter, S. and Nagel, H.H. 1999. Tracking persons in monocular image sequences. Computer Vision and Image Understanding, 74(3):174–192.

    Article  Google Scholar 

  • Wilhelms, J., van Gelder, A., Atkinson-Derman, L., and Luo, A. 2000. Human motion from active contours. In IEEE Workshop on Human Motion, pp. 155–160.

  • Wren, C.R., Azarbayejani, A., Darrell, T.J., and Pentland, A.P. 1997. Pfinder: Real-time tracking of the human body. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(7):780–785.

    Article  Google Scholar 

  • Zhao, J. and Badler, N.I. 1994. Inverse kinematics positioning using nonlinear programming for highly articulated figures. ACM Trans. Graph., 13(4):313–336.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Timothy J. Roberts.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Roberts, T.J., McKenna, S.J. & Ricketts, I.W. Human Pose Estimation Using Partial Configurations and Probabilistic Regions. Int J Comput Vision 73, 285–306 (2007). https://doi.org/10.1007/s11263-006-9781-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-006-9781-9

Keywords

Navigation