Abstract
Optical motion capture can be classified as an inference problem: given the data produced by a set of cameras, the aim is to extract the hidden state, which in this case encodes the posture of the subject’s body. Problems with motion capture arise due to the multi-modal nature of the likelihood distribution, the extremely large dimensionality of its state-space, and the narrow region of support of local modes. There are also problems with the size of the data and the difficulty with which useful visual cues can be extracted from it, as well as how informative these cues might be. Several algorithms exist that use stochastic methods to extract the hidden state, but although highly parallelisable in theory, such methods produce a heavy computational overhead even with the power of today’s computers. In this paper we assume a set of pre-calibrated cameras and only extract the subject’s silhouette as a visual cue. In order to describe the 2D silhouette data we define a 2D model consisting of conic fields. The resulting likelihood distribution is differentiable w.r.t. the state, meaning that its global maximum can be located fast using gradient ascent search, given manual initialisation at the first frame. In this paper we explain the construction of the model for tracking a human hand; we describe the formulation of the derivatives needed, and present initial results on both real and simulated data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vicon Motion Systems and Peak Performance Inc.: Motion capture systems (Online Material), http://www.vicon.com
PhaseSpace Inc.: Optical motion capture systems (Online Material), http://www.phasespace.com
Isard, M., Blake, A.: CONDENSATION – conditional density propagation for visual tracking. Trans. International Journal of Computer Vision (IJCV) 29, 5–28 (1998)
Deutscher, J., Blake, A., Reid, I.: Articulated body motion capture by annealed particle filtering. In: CVPR 2000. Proc. IEEE International Conference on Computer Vision and Pattern Recognition, Hilton Head, SC, USA, pp. 2126–2133. IEEE Computer Society Press, Los Alamitos (2000)
Wu, Y., Lin, J., Huang, T.S.: Analyzing and capturing articulated hand motion in image sequences. Trans. IEEE Pattern Analalysis and Machine Intelligence (PAMI) 27 (2005)
Sudderth, E.B., Mandel, M.I., Freeman, W.T., Willsky, A.S.: Visual hand tracking using nonparametric belief propagation. In: Proc. CVPR Workshop, Washington, DC, USA (2004)
Urtasun, R., Fua, P.: 3D tracking for gait characterization and recognition. In: Proc. IEEE Automatic Face and Gesture Recognition, Seoul, Korea, pp. 17–22. IEEE Computer Society Press, Los Alamitos (2004)
Campos, T.E.D., Murray, D.W.: Regression-based hand pose estimation from multiple cameras. In: Proc. CVPR, New York, NY, USA, vol. 1, pp. 782–789 (2006)
Mikić, I., Trivedi, M.M., Hunter, E., Cosman, P.C.: Human body model acquisition and tracking using voxel data. Trans. IJCV 53, 199–223 (2003)
Plänkers, R., Fua, P.: Tracking and modeling people in video sequences. Trans. Computer Vision and Image Understanding (CVIU) 81, 285–302 (2001)
Bray, M., Koller-Meier, E., Gool, L.V.: Smart particle filtering for high-dimensional tracking. Trans. CVIU 106, 116–129 (2007)
Stenger, B., Mendonça, P.R.S., Cipolla, R.: Model-based 3D tracking of an articulated hand. In: Proc. CVPR, Kauai, HI, USA, vol. 2, pp. 310–315 (2001)
Stenger, B., Thayananthan, A., Torr, P.H.S., Cipolla, R.: Model-based hand tracking using a hierarchical bayesian filter. Trans. PAMI 28, 1372–1384 (2006)
Kaimakis, P., Lasenby, J.: Markerless motion capture with single and multiple cameras. In: Proc. IEEE International Conference on Image Processing, Singapore, pp. 2607–2610. IEEE Computer Society Press, Los Alamitos (2004)
Horprasert, T., Harwood, D., Davis, L.S.: A statistical approach for real-time robust background subtraction and shadow detection. In: Proc. IEEE International Conference on Computer Vision FRAME-RATE Workshop, Kerkyra, Greece, IEEE Computer Society Press, Los Alamitos (1999)
Hartley, R., Zisserman, A.: Multiple View Geometry in Computer Vision, 2nd edn. Cambridge University Press, Cambridge (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kaimakis, P., Lasenby, J. (2007). Gradient-Based Hand Tracking Using Silhouette Data. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2007. Lecture Notes in Computer Science, vol 4841. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76858-6_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-76858-6_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76857-9
Online ISBN: 978-3-540-76858-6
eBook Packages: Computer ScienceComputer Science (R0)