Abstract.
Human motion tracking from monocular image sequences has been explored widely. However, a framework that addresses the variety of sensing conditions is lacking. In this paper, we present a simple, efficient, and robust method for recovering plausible 3D motion from a video without knowledge of the camera’s parameters. Our method transforms the motion capture problem into a convex problem and employs a hierarchical geometrical solver for the minimization. This algorithm was applied to numerous synthetic and real image sequences with very encouraging results. Specifically, our results indicate that it can handle challenges posed by variation of lighting, partial self-occlusion, and rapid motion.
Similar content being viewed by others
References
Aggarwal JK, Cai Q (1999) Human motion analysis: a review. Comput Vision Image Understand 73(3):428-440
Barrón C (2003) Human motion tracking from an uncalibrated camera. PhD thesis, University of Houston, December 2003
Barrón C, Kakadiaris I (2001) Estimating anthropometry and pose from a single image. Comput Vision Image Understand 81(3):269-284
Barrón C, Kakadiaris I (2003) A convex penalty method for optical human motion tracking. In: ACM international workshop on video surveillance, Berkeley, CA, 7 November 2003, pp 1-10
Barrón C, Kakadiaris I (2003) On the improvement of anthropometry and pose estimation from a single uncalibrated image. Mach Vision Appl 14(4):229-236
Bowden R, Mitchell TA, Sarhadi M (2000) Non-linear statistical models for the 3D reconstruction of human pose and motion from monocular image sequences. Image Vision Comput 18(9):729-737
Bregler C, Malik J (1998) Tracking people with twists and exponential maps. In: IEEE conference on computer vision and pattern recognition, Santa Barbara, CA, 23-25 June 1998, pp 8-15
Bregler C, Malik J, Pullen K (2004) Twist based acquisition and tracking of animal and human kinematics. Int J Comput Vision 56(3):179-194
Cai Q, Aggarwal J (1998) Automatic tracking of human motion in indoor scenes across multiple synchronized video streams. In: Proc. international conference on computer vision, pp 356-362
Cai Q, Aggarwal J (1999) Tracking human motion in a structured environment using a distributed camera system. IEEE Trans Pattern Anal Mach Intell 21(11):1241-1247
Cham T, Rehg J (1999) A multiple hypothesis approach to figure tracking. In: Proc. IEEE conference on computer vision and pattern recognition, Fort Collins, CO, 23-25 June 1999. IEEE Press, New York, 2:239-245
Cheung GKM, Kanade T, Bouguet JY, Holler M (2000) A real time system for robust 3D voxel reconstruction of human motions. In: Proc. IEEE conference on computer vision and pattern recognition, Hilton Head Island, SC, 13-15 June 2000. IEEE Press, 2:714-720
Deutscher J, Davison AJ, Reid ID (2001) Automatic partitioning of high dimensional search spaces associated with articulated body motion capture. In: Proc. IEEE conference on computer vision and pattern recognition, Kauai, HI, 8-14 December 2001. IEEE Press, New York, 2:669-676
Fablet R, Black MJ (2002) Automatic detection and tracking of human motion with a view-based representation. In: Proc. IEEE conference on computer vision (ECCV 2002), Copenhagen, 28-31 May 2002, 1:476-491
DiFranco DE. Cham T, Rehg JM (2001) Reconstruction of 3-D figure motion from 2-D correspondences. In: Proc. IEEE conference on computer vision and pattern recognition, Kauai, HI, 8-14 December 2001. IEEE Press, New York, 1:307-314
Gleicher M (1999) Animation from observation: Motion capture and motion editing. Comput Graph 33(4):51-55
Isard M, Blake A (1998) Condensation - conditional density propagation for visual tracking. Int J Comput Vision 29(1):5-28
Ju SX, Black MJ, Yacoob Y (1996) Cardboard people: a parameterized model of articulated motion. In: Proc. 2nd international conference on automatic face- and gesture-recognition, Killington, VT, October 1996, pp 38-44
Kakadiaris I, Metaxas D (2000) Model-based estimation of 3D human motion. IEEE Trans Pattern Anal Mach Intell 22(12):1453-1459
Liebowitz D, Carlsson S (2001) Uncalibrated motion capture exploiting articulated structure constraints. In: Proc. 8th IEEE international conference on computer vision (ICCV’01), Vancouver, BC, Canada, 9-12 July 2001, 2:230-237
Luck JP (2003) Real-time markerless human motion tracking using linked kinematics chains. PhD thesis, Colorado School of Mines, Golden, CO
Martinez G, Kakadiaris I, Magruder D (2002) Teleoperating robonaut: a case study. In: Proc. British machine vision conference, Cardiff, UK, 2-5 September 2002
Moeslund TB, Granum E (2001) A survey of computer vision-based human motion capture. Comput Vision Image Understand 81(3):231-268
Park S, Aggarwal J (2002) Segmentation and tracking of interacting human body parts under occlusion and shadowing. In: IEEE workshop on motion and video computing, Orlando, FL, 5-6 December 2002
Pläenkers R, Fua P (2001) Tracking and modeling people in video sequences. Comput Vision Image Understand 81(3):285-302
Pläenkers R, Fua P (2003) Articulated soft objects for multi-view shape and motion capture. IEEE Trans Pattern Anal Mach Intell 25(10):1182-1187
Plaenkers R Fua P (2002) Model-based silhouette extraction for accurate people tracking. In: European conference on computer vision, Copenhagen, May 2002
Rehg J (2000) Motion capture from movies. In: Asian conference in computer vision, Taipei, Taiwan, 2:1125-1131
Roberts T, McKenna SJ, Ricketts IW (2002) Adaptive learning of statistical appearance models for 3D human tracking. In: British machine vision conference, Cardiff University, 2-5 September 2002
Rosales R (2002) The specialized mappings architecture with applications to vision-based estimation of articulated body pose. PhD thesis, Boston University, January 2002
Rosenfeld A (2000) Survey image analysis and computer vision: 1999. Comput Vision Image Understand 78(2):222-302
Sato K, Aggarwal J (2002) Tracking and recognizing two-person interactions in outdoor image sequences. In: IEEE workshop on multi-object tracking, Vancouver, BC, Canada, 8 July 2002
Sidenbladh J, Black MJ, Sigal L (2002) Implicit probabilistic models of human motion for synthesis and tracking. In: European conference on computer vision (ECCV 2002), Copenhagen, 28-31 May 2002, 1:784-800
Silaghi M, Plaenkers R, Boulic R, Fua O, Thalmann D (1998) Local and global skeleton fitting techniques for optical motion capture. In: Modeling and motion capture techniques for virtual environments. Lecture notes in artificial intelligence, vol 1537. International Workshop, CAPTECH’98, Geneva, Switzerland, November 1998. Springer, Berlin Heidelberg New York, pp 26-40
Ude A, Riley M (1999) Prediction of body configurations and appearance for model-based estimation of articulated human motions. In: IEEE international conference on systems, man and cybernetics, Tokyo, 12-15 October 1999, 2:687-691
Weng J, Liu Y, Huang T, Ahuja N (1988) Estimating motion/structure from line correspondences: a robust linear algorithm and uniqueness theorems. In: IEEE conference on computer vision and pattern recognition, Ann Arbor, MI, 5-6 June 1988, pp 387-392
Zhuang Y, Liu X, Pan Y (1999) Video motion capture using feature tracking and skeleton reconstruction. In: IEEE international conference on image processing, Kobe, Japan, 24-28 October 1999, pp 232-236
Author information
Authors and Affiliations
Corresponding author
Additional information
Published online: 21 October 2004
Rights and permissions
About this article
Cite this article
Barrón, C., Kakadiaris, I.A. Monocular human motion tracking. Multimedia Systems 10, 118–130 (2004). https://doi.org/10.1007/s00530-004-0145-4
Issue Date:
DOI: https://doi.org/10.1007/s00530-004-0145-4