Abstract
The 3D shape of the human body is useful for applications in fitness, games, and apparel. Accurate body scanners, however, are expensive, limiting the availability of 3D body models. Although there has been a great deal of interest recently in the use of active depth sensing cameras, such as the Microsoft Kinect, for human pose tracking, little has been said about the related problem of human shape estimation. We present a method for human shape reconstruction from noisy monocular image and range data using a single inexpensive commodity sensor. The approach combines low-resolution image silhouettes with coarse range data to estimate a parametric model of the body. Accurate 3D shape estimates are obtained by combining multiple monocular views of a person moving in front of the sensor. To cope with varying body pose, we use a SCAPE body model which factors 3D body shape and pose variations. This enables the estimation of a single consistent shape, while allowing pose to vary. Additionally, we describe a novel method to minimize the distance between the projected 3D body contour and the image silhouette that uses analytic derivatives of the objective function. We use a simple method to estimate standard body measurements from the recovered SCAPE model and show that the accuracy of our method is competitive with commercial body scanning systems costing orders of magnitude more.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allen, B., Curless, B., Popovic, Z.: The space of human body shapes: reconstruction and parameterization from range scans. ACM Trans. Graph. 22(3), 587–594 (2003)
Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J., Davis, J.: SCAPE: shape completion and animation of people. ACM Trans. Graph. 24(3), 408–416 (2005)
Balan, A.: Detailed human shape and pose from images. Ph.D. thesis, Brown University (2010)
Balan, A., Sigal, L., Black, M., Davis, J., Haussecker, H.: Detailed human shape and pose from images. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Balan, A., Black, M.: The naked truth: estimating body shape under clothing. In: European Conference on Computer Vision (2007)
Bouguet, J.: Camera calibration toolbox for Matlab. http://www.vision.caltech.edu/bouguetj/calib_doc (2007)
Charpiat, G., Faugeras, O., Keriven, R.: Approximations of shape metrics and application to shape warping and empirical shape statistics. Found. Comput. Math. 5(1), 1–58 (2005)
Delamarre, Q., Faugeras, O.: 3D articulated models and multi-view tracking with silhouettes. In: International Conference on Computer Vision (1999)
Flanders, H.: Differentiation under the integral sign. Am. Math. Mon. 80(6), 615–627 (1973)
Ganapathi, V., Plagemann, C., Koller, D., Thrun, S.: Real time motion capture using a single time-of-flight camera. In: IEEE Conference on Computer Vision and Pattern Recognition (2010)
Geman, S., McClure, D.: Statistical methods for tomographic image reconstruction. Bull. Int. Stat. Inst. LII(4), 5–21 (1987)
Grest, D., Herzog, D., Koch, R.: Human model fitting from monocular posture images. In: Vision, Modeling, and Visualization (2005)
Guan, P., Weiss, A., Balan, A., Black, M.: Estimating human shape and pose from a single image. In: International Conference on Computer Vision (2009)
Hasler, N., Rosenhahn, B., Thormählen, T., Wand, M., Gall, J., Seidel, H.P.: Markerless motion capture with unsynchronized moving cameras. In: IEEE Conference on Computer Vision and Pattern Recognition (2009)
Hasler, N., Stoll, C., Rosenhahn, B., Thormählen, T., Seidel, H.P.: Estimating body shape of dressed humans. Comput. Graph. 33(3), 211–216 (2009)
Hasler, N., Ackermann, H., Rosenhahn, B., Thormählen, T., Seidel, H.P.: Multilinear pose and body shape estimation of dressed subjects from image sets. In: IEEE Conference on Computer Vision and Pattern Recognition (2010)
Jain, A., Thormählen, T., Seidel, H.P., Theobalt, C.: Moviereshape: tracking and reshaping of humans in videos. ACM Trans. Graph. 29(6), 148 (2010)
Knossow, D., Ronfard, R., Horaud, R.: Human motion tracking with a kinematic parameterization of extremal contours. Int. J. Comput. Vis. 79(3), 247–269 (2008)
Konolige, K., Mihelich, P.: ROS.org Wiki: Kinect_calibration/technical. http://www.ros.org/wiki/kinect_calibration/technical (2011)
de La Gorce, M., Paragios, N., Fleet, D.: Model-based hand tracking with texture, shading and self-occlusions. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Microsoft: Kinect for X-BOX 360. http://www.xbox.com/kinect (2010)
Newcombe, R., Izadi, S., Hilliges, O., Molyneaux, D., Kim, D., Davison, A., Kohli, P., Shotton, J., Hodges, S., Fitzgibbon, A.: KinectFusion: real-time dense surface mapping and tracking. In: International Symposium on Mixed and Augmented Reality (2011)
OpenKinect project. http://www.openkinect.org (2011)
Plänkers, R., Fua, P.: Model-based silhouette extraction for accurate people tracking. In: European Conference on Computer Vision (2002)
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: IEEE Conference on Computer Vision and Pattern Recognition (2011)
Sminchisescu, C., Telea, A.: Human pose estimation from silhouettes. a consistent approach using distance level sets. In: WSCG International Conference on Computer Graphics, Visualization and Computer Vision (2002)
Sumner, R., Popovic, J.: Deformation transfer for triangle meshes. ACM Trans. Graph. 23(3), 399–405 (2004)
Tong, J., Zhou, J., Liu, L., Pan, Z., Yan, H.: Scanning 3D full human bodies using Kinects. IEEE Trans. Vis. Comput. Graph. 18(4), 643–650 (2012)
Zhou, S., Fu, H., Liu, L., Cohen-Or, D., Han, X.: Parametric reshaping of human bodies in images. ACM Trans. Graph. 29(4), 126 (2010)
Acknowledgements
We thank Loretta Reiss for her measurement expertise and Lisa Wang for mathematical discussions. This work was supported in part by NIH EUREKA award 1R01NS066311–01 and NSF award IIS–0812364.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag London
About this chapter
Cite this chapter
Weiss, A., Hirshberg, D., Black, M.J. (2013). Home 3D Body Scans from a Single Kinect. In: Fossati, A., Gall, J., Grabner, H., Ren, X., Konolige, K. (eds) Consumer Depth Cameras for Computer Vision. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-4640-7_6
Download citation
DOI: https://doi.org/10.1007/978-1-4471-4640-7_6
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4639-1
Online ISBN: 978-1-4471-4640-7
eBook Packages: Computer ScienceComputer Science (R0)