Abstract
Recently, model-based approaches have produced very promising results to the problems of 3D hand tracking. The current state of the art method recovers the 3D position, orientation and 20 DOF articulation of a human hand from markerless visual observations obtained by an RGB-D sensor. Hand pose estimation is formulated as an optimization problem, seeking for the hand model parameters that minimize an objective function that quantifies the discrepancy between the appearance of hand hypotheses and the actual hand observation. The design of such a function is a complicated process that requires a lot of prior experience with the problem. In this paper we automate the definition of the objective function in such optimization problems. First, a set of relevant, candidate image features is computed. Then, given synthetic data sets with ground truth information, regression analysis is used to combine these features in an objective function that seeks to maximize optimization performance. Extensive experiments study the performance of the proposed approach based on various dataset generation strategies and feature selection techniques.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Albrecht, I., Haber, J., Seidel, H.P.: Construction and animation of anatomically based human hand models. In: SIGGRAPH, pp. 98–109. San Diego, USA (2003)
Argyros, A.A., Lourakis, M.I.: Real-time tracking of multiple skin-colored objects with a possibly moving camera. In: Pajdla, T., Matas, J. (eds.) Computer Vision–ECCV 2004. LNCS, vol. 3023, pp. 368–379. Springer, Berlin (2004)
Canny, J.: A computational approach to edge detection. IEEE Trans. PAMI-Pattern Anal. Mach. Intell. 8(6), 679–698 (1986)
Cao, C., Weng, Y., Lin, S., Zhou, K.: 3D shape regression for real-time facial animation. ACM Trans. Graphics 32(4), 41:1–41:10 (2013)
de La Gorce, M., Paragios, N., Fleet, D.J.: Model-based hand tracking with texture, shading and self-occlusions. In: CVPR, pp. 1–8. Anchorage, USA (2008)
Erol, A., Bebis, G., Nicolescu, M., Boyle, R.D., Twombly, X.: Vision-based hand pose estimation: a review. Comput. Vis. Image Underst. 108(1), 52–73 (2007)
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient belief propagation for early vision. Int. J. Comput. Vision 70(1), 41–54 (2006)
Hamer, H., Schindler, K., Koller-Meier, E., Van Gool, L.: Tracking a hand manipulating an object. In: ICCV, pp. 1475–1482. Kyoto, Japan (2009)
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: ICNN, vol. 4, pp. 1942–1948. Perth, Australia (1995)
Kyriazis, N., Oikonomidis, I., Argyros, A.: A GPU-powered computational framework for efficient 3D model-based vision. Techincal report, ICS-FORTH (2011)
Moeslund, T.B., Hilton, A., Krüger, V.: A survey of advances in vision-based human motion capture and analysis. Comput. Vis. Image Underst. 104(2), 90–126 (2006)
Oikonomidis, I., Kyriazis, N., Argyros, A.: Efficient model-based 3D tracking of hand articulations using Kinect. In: BMVC, pp. 101.1-101.11. Dundee, UK (2011)
Oikonomidis, I., Kyriazis, N., Argyros, A.A.: Markerless and efficient 26-DOF hand pose recovery. In: Kimmel, R., Klette, R., Sugimoto, A. (eds.) Computer Vision–ACCV 2010. LNCS, vol. 6494, pp. 744–757. Springer, Berlin (2011)
Romero, J., Kjellstrom, H., Kragic, D.: Monocular real-time 3d articulated hand pose estimation. In: Humanoids, pp. 87–92. Paris, France (2009)
Shotton, J., Sharp, T., Kipman, A., Fitzgibbon, A., Finocchio, M., Blake, A., Cook, M., Moore, R.: Real-time human pose recognition in parts from single depth images. Commun. ACM 56(1), 116–124 (2013)
Sobol, I.M.: On the distribution of points in a cube and the approximate evaluation of integrals. USSR Comput. Math. Mathematical Phys. 7(4), 86–112 (1967)
Wu, Y., Huang, T.S.: View-independent recognition of hand postures. In: CVPR, vol. 2, pp. 88–94. Hilton Head Island, USA (2000)
Xiong, X., De la Torre, F.: Supervised descent method and its applications to face alignment. In: CVPR, pp. 532–539. Portland, USA (2013)
Acknowledgments
This work was partially supported by the EU project FP7-IP- 288533 Robohow. The contributions of Iason Oikonomidis and Nikolaos Kyriazis, members of FORTH/CVRL, are gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Paliouras, K., Argyros, A.A. (2016). Towards the Automatic Definition of the Objective Function for Model-Based 3D Hand Tracking. In: Gruca, A., Brachman, A., Kozielski, S., Czachórski, T. (eds) Man–Machine Interactions 4. Advances in Intelligent Systems and Computing, vol 391. Springer, Cham. https://doi.org/10.1007/978-3-319-23437-3_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-23437-3_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23436-6
Online ISBN: 978-3-319-23437-3
eBook Packages: EngineeringEngineering (R0)