Abstract
We describe innovative methods for extracting three-dimensional motion of humans and animals from unrestricted monocular video, using a combination of new and established computer vision and computer graphics techniques. We identity features using image processing and active contours. Active contours become anchored to model segments in specified image frames and automatically “pull” the segments to feature positions in other frames. Adjustments are subject to joint limits and may use inverse kinematics. Occluded contour points are detected using object geometry and do not participate in feature tracking. Interactive adjustments are possible at any time in the process, allowing extraction of complicated movements regardless of background, camera movement, or feature clarity.
Similar content being viewed by others
References
Aggarwal JK, Cai Q, Liao W, Sabata B (1998) Nonrigid motion analysis: Articulated and elastic motion. Comput Vision Image Understanding 70(2):142–156
Akita K (1984) Image sequence analysis of real world human motion. Pattern Recognition 17(1):73–83
Atkinson-Derman L (2000) Tracking on the wild side – using active contours to track fauna in noisy image sequences. Master’s thesis, University of California, Santa Cruz, Santa Cruz, CA 95064
Blake A, Isard M (1998) Active Contours. Springer
Bregler C, Malik J (1998) Estimating and tracking kinematic chains. In: Computer Vision and Pattern Recognition, Santa Barbara, CA
Cham T-J, Rehg JM (1999) A multiple hypothesis approach to figure tracking. In: Proc. Computer Vision and Pattern Recognition, pp 239–245, Ft. Collins, CO
Chen Z, Lee HJ (1992) Knowledge-guided visual perception of 3D human gait from single image sequence. IEEE Trans Syst Man Cyber 22(2):336–342
Goncalves L, Di Bernardo E, Ursella E, Perona P (1995) Monocular tracking of the human arm in 3D. In: Proc. of the IEEE Fifth International Conference on Computer Vision, pp 764–770. Cambridge, MA
Goody PC (1983) Horse Anatomy: A Pictorial Approach to Equine Structure. J.A. Allen, London
Hel-Or Y, Werman M (1996) Constraint fusion for recognition and localization of articulated and constrained objects. Int J Comput Vision 19(1):5–28
Isard M, Blake A (1998) Condensation – conditional density propagation for visual tracking. Int J Comput Vision 29(1):5–28
Jain A (1989) Fundamentals of Digital Image Processing. Prentice-Hall International Editions
Kakadiaris IA, Metaxas D, Bajcsy R (1995) 3D human body model acquisition from multiple views. In Proc. of the IEEE Workshop on Non-Rigid and Articulated Objects, pp 618–623, Boston, MA
Kass M, Witkin A, Terzopoulos D (1988) Snakes: Active contour models. Int J Comput Vision 1(4):321–331
Lapierre J, Wilhelms J (1999) Matching anatomy to model for articulated body animation. In Proceedings of 1999 IASTED Computer Graphics and Imaging Conference, Palm Springs, CA
Morris DD, Rehg JM (1998) Singularity analysis for articulated object tracking. In Proc. Computer Vision and Pattern Recognition, pp 289–296, Santa Barbara, CA
Pentland A, Horowitz B (1991) Recovery of nonrigid motion and structure. IEEE Trans Patt Anal Machine Intel 13(7):730–742
Perales FJ, Torres J (1994) A system for human motion matching between synthetic and real images based on a biomechanic graphical model. Proceedings of the 1994 IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, TX, USA. IEEE Computer Society Press, Los Alamitos, CA, USA, pp 83–88
Rehg JM, Kanade T (1994) Digiteyes: Vision-based hand tracking for human-computer interaction. Proceedings of the 1994 IEEE Workshop on Motion of Non-Rigid and Articulated Objects, Austin, TX, USA. IEEE Computer Society Press, Los Alamitos, CA, USA, pp 16–22
Schneider PJ, Wilhelms J (1998) Hybrid anatomically based modeling of animals. In Computer Animation ’98, Philadelphia, PA, USA, June 1998. IEEE Computer Society, Los Alamitos, CA, USA, pp 161–169
Simmons M, Wilhelms J, Van Gelder A (2002) Model-based reconstruction for creature animation. ACM Symposium on Computer Animation, San Antonio, TX, USA. ACM SIGGRAPH, pp 139–146
Sobel I (1990) An isotropic 3x3 image gradient operator. Machine Vision for Three-Dimensional Scenes. Academic Press, Boston, USA, pp 376–379
Stubbs G (1976) The Anatomy of the Horse. Dover Publications, New York
Wilhelms J, Van Gelder A (1997) Anatomically based modeling. In Computer Graphics (ACM SIGGRAPH ’97 Proceedings), Los Angeles, CA, USA. ACM, pp 173–180
Wilhelms J, Van Gelder A (2001) Fast and easy reach-cone joint limits. J Graph Tools 6(2):27–41. Available at ftp://ftp.cse.ucsc.edu/pub/avg/jtl.pdf
Wilhelms J, Van Gelder A, Atkinson-Derman L, Luo H (2000) Human motion from active contours. In: IEEE Workshop on Human Motion, Austin, TX, USA. IEEE Computer Society, Los Alamitos, CA, USA, pp 155–260
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wilhelms, J., Van Gelder, A. Combining vision and computer graphics for video motion capture. Vis Comput 19, 360–376 (2003). https://doi.org/10.1007/s00371-003-0201-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-003-0201-7