Skip to main content

Advertisement

Log in

Monocular human motion tracking

  • Sp.lss. on Video Surveillance
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract.

Human motion tracking from monocular image sequences has been explored widely. However, a framework that addresses the variety of sensing conditions is lacking. In this paper, we present a simple, efficient, and robust method for recovering plausible 3D motion from a video without knowledge of the camera’s parameters. Our method transforms the motion capture problem into a convex problem and employs a hierarchical geometrical solver for the minimization. This algorithm was applied to numerous synthetic and real image sequences with very encouraging results. Specifically, our results indicate that it can handle challenges posed by variation of lighting, partial self-occlusion, and rapid motion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aggarwal JK, Cai Q (1999) Human motion analysis: a review. Comput Vision Image Understand 73(3):428-440

    Article  Google Scholar 

  2. Barrón C (2003) Human motion tracking from an uncalibrated camera. PhD thesis, University of Houston, December 2003

  3. Barrón C, Kakadiaris I (2001) Estimating anthropometry and pose from a single image. Comput Vision Image Understand 81(3):269-284

    Article  Google Scholar 

  4. Barrón C, Kakadiaris I (2003) A convex penalty method for optical human motion tracking. In: ACM international workshop on video surveillance, Berkeley, CA, 7 November 2003, pp 1-10

  5. Barrón C, Kakadiaris I (2003) On the improvement of anthropometry and pose estimation from a single uncalibrated image. Mach Vision Appl 14(4):229-236

    Article  Google Scholar 

  6. Bowden R, Mitchell TA, Sarhadi M (2000) Non-linear statistical models for the 3D reconstruction of human pose and motion from monocular image sequences. Image Vision Comput 18(9):729-737

    Article  Google Scholar 

  7. Bregler C, Malik J (1998) Tracking people with twists and exponential maps. In: IEEE conference on computer vision and pattern recognition, Santa Barbara, CA, 23-25 June 1998, pp 8-15

  8. Bregler C, Malik J, Pullen K (2004) Twist based acquisition and tracking of animal and human kinematics. Int J Comput Vision 56(3):179-194

    Article  Google Scholar 

  9. Cai Q, Aggarwal J (1998) Automatic tracking of human motion in indoor scenes across multiple synchronized video streams. In: Proc. international conference on computer vision, pp 356-362

  10. Cai Q, Aggarwal J (1999) Tracking human motion in a structured environment using a distributed camera system. IEEE Trans Pattern Anal Mach Intell 21(11):1241-1247

    Article  Google Scholar 

  11. Cham T, Rehg J (1999) A multiple hypothesis approach to figure tracking. In: Proc. IEEE conference on computer vision and pattern recognition, Fort Collins, CO, 23-25 June 1999. IEEE Press, New York, 2:239-245

  12. Cheung GKM, Kanade T, Bouguet JY, Holler M (2000) A real time system for robust 3D voxel reconstruction of human motions. In: Proc. IEEE conference on computer vision and pattern recognition, Hilton Head Island, SC, 13-15 June 2000. IEEE Press, 2:714-720

  13. Deutscher J, Davison AJ, Reid ID (2001) Automatic partitioning of high dimensional search spaces associated with articulated body motion capture. In: Proc. IEEE conference on computer vision and pattern recognition, Kauai, HI, 8-14 December 2001. IEEE Press, New York, 2:669-676

  14. Fablet R, Black MJ (2002) Automatic detection and tracking of human motion with a view-based representation. In: Proc. IEEE conference on computer vision (ECCV 2002), Copenhagen, 28-31 May 2002, 1:476-491

  15. DiFranco DE. Cham T, Rehg JM (2001) Reconstruction of 3-D figure motion from 2-D correspondences. In: Proc. IEEE conference on computer vision and pattern recognition, Kauai, HI, 8-14 December 2001. IEEE Press, New York, 1:307-314

  16. Gleicher M (1999) Animation from observation: Motion capture and motion editing. Comput Graph 33(4):51-55

    Google Scholar 

  17. Isard M, Blake A (1998) Condensation - conditional density propagation for visual tracking. Int J Comput Vision 29(1):5-28

    Article  Google Scholar 

  18. Ju SX, Black MJ, Yacoob Y (1996) Cardboard people: a parameterized model of articulated motion. In: Proc. 2nd international conference on automatic face- and gesture-recognition, Killington, VT, October 1996, pp 38-44

  19. Kakadiaris I, Metaxas D (2000) Model-based estimation of 3D human motion. IEEE Trans Pattern Anal Mach Intell 22(12):1453-1459

    Article  Google Scholar 

  20. Liebowitz D, Carlsson S (2001) Uncalibrated motion capture exploiting articulated structure constraints. In: Proc. 8th IEEE international conference on computer vision (ICCV’01), Vancouver, BC, Canada, 9-12 July 2001, 2:230-237

  21. Luck JP (2003) Real-time markerless human motion tracking using linked kinematics chains. PhD thesis, Colorado School of Mines, Golden, CO

  22. Martinez G, Kakadiaris I, Magruder D (2002) Teleoperating robonaut: a case study. In: Proc. British machine vision conference, Cardiff, UK, 2-5 September 2002

  23. Moeslund TB, Granum E (2001) A survey of computer vision-based human motion capture. Comput Vision Image Understand 81(3):231-268

    Article  MATH  Google Scholar 

  24. Park S, Aggarwal J (2002) Segmentation and tracking of interacting human body parts under occlusion and shadowing. In: IEEE workshop on motion and video computing, Orlando, FL, 5-6 December 2002

  25. Pläenkers R, Fua P (2001) Tracking and modeling people in video sequences. Comput Vision Image Understand 81(3):285-302

    Article  Google Scholar 

  26. Pläenkers R, Fua P (2003) Articulated soft objects for multi-view shape and motion capture. IEEE Trans Pattern Anal Mach Intell 25(10):1182-1187

    Article  Google Scholar 

  27. Plaenkers R Fua P (2002) Model-based silhouette extraction for accurate people tracking. In: European conference on computer vision, Copenhagen, May 2002

  28. Rehg J (2000) Motion capture from movies. In: Asian conference in computer vision, Taipei, Taiwan, 2:1125-1131

  29. Roberts T, McKenna SJ, Ricketts IW (2002) Adaptive learning of statistical appearance models for 3D human tracking. In: British machine vision conference, Cardiff University, 2-5 September 2002

  30. Rosales R (2002) The specialized mappings architecture with applications to vision-based estimation of articulated body pose. PhD thesis, Boston University, January 2002

  31. Rosenfeld A (2000) Survey image analysis and computer vision: 1999. Comput Vision Image Understand 78(2):222-302

    Article  Google Scholar 

  32. Sato K, Aggarwal J (2002) Tracking and recognizing two-person interactions in outdoor image sequences. In: IEEE workshop on multi-object tracking, Vancouver, BC, Canada, 8 July 2002

  33. Sidenbladh J, Black MJ, Sigal L (2002) Implicit probabilistic models of human motion for synthesis and tracking. In: European conference on computer vision (ECCV 2002), Copenhagen, 28-31 May 2002, 1:784-800

  34. Silaghi M, Plaenkers R, Boulic R, Fua O, Thalmann D (1998) Local and global skeleton fitting techniques for optical motion capture. In: Modeling and motion capture techniques for virtual environments. Lecture notes in artificial intelligence, vol 1537. International Workshop, CAPTECH’98, Geneva, Switzerland, November 1998. Springer, Berlin Heidelberg New York, pp 26-40

  35. Ude A, Riley M (1999) Prediction of body configurations and appearance for model-based estimation of articulated human motions. In: IEEE international conference on systems, man and cybernetics, Tokyo, 12-15 October 1999, 2:687-691

  36. Weng J, Liu Y, Huang T, Ahuja N (1988) Estimating motion/structure from line correspondences: a robust linear algorithm and uniqueness theorems. In: IEEE conference on computer vision and pattern recognition, Ann Arbor, MI, 5-6 June 1988, pp 387-392

  37. Zhuang Y, Liu X, Pan Y (1999) Video motion capture using feature tracking and skeleton reconstruction. In: IEEE international conference on image processing, Kobe, Japan, 24-28 October 1999, pp 232-236

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to C. Barrón.

Additional information

Published online: 21 October 2004

Rights and permissions

Reprints and permissions

About this article

Cite this article

Barrón, C., Kakadiaris, I.A. Monocular human motion tracking. Multimedia Systems 10, 118–130 (2004). https://doi.org/10.1007/s00530-004-0145-4

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-004-0145-4

Keywords:

Navigation