Abstract
The 3D scanning of time-varying objects based on multi-view geometry has been a hot topic in computer vision in recent years. This paper presents an active approach to acquiring 3D shapes of moving foot from multi-view video sequences based on the explicit marker points woven in socks. The reference 3D model of foot is firstly recovered from the starting frames in the multi-view video clips, and then the markers in each view are traced as a whole based on a continuous motion vector field built from the reference 3D model. The missing vertices in the candidate 3D model caused by self-occlusion, non-uniform illumination and random noises are reliably estimated and reconstructed in 3D space by minimizing the changes of differential features in 3D geometry of the reference model. The experimental results show that our method can robustly recover the 3D model of foot in complex background even if there is a relatively large movement between the adjacent frames, with an acceptable accuracy of the resulting 3D model.
Similar content being viewed by others
References
Makiko K, Masaaki M. Development of a low cost foot-scanner for a custom shoe making system. In: Proceedings of the 5th Symposium on Footwear Biomechanics. Paris: Footwear Biomechanics Group, 2001. 58–59
Luximon A, Goonetilleke R S, Zhang M. 3D foot shape generation from 2D information. Ergonomics, 2005, 48: 625–641
Witana C P, Xiong S P, Zhao J H, et al. Foot measurements from three-dimensional scans: a comparison and evaluation of different methods. Int J Industr Ergonom, 2006, 36: 789–807
MacWilliams B A, Cowley M, Nicholson D E. Foot kinematics and kinetics during adolescent gait. Gait Post, 2003, 17: 214–224
Boehnen C, Flynn P J. Accuracy of 3D scanning technologies in a face scanning context. In: Fifth International Conference on 3D Imaging and Modeling(3DIM2005). Washington: IEEE Computer Society Press, 2005. 310–317
Gao F, Chu J, Li M J, et al. 3D Scanning of foot by stereo vision based on explicit markers. J Comput Aid Des Comput Graph, 2009, 21: 1412–1419
Kutulakos K N, Seitz S M. A theory of shape by space carving. Int J Comput Vis, 2000, 38: 197–216
Culbertson W, Malzbender T, Slabaugh G. Vision Algorithms: Theory and Practice. 1st ed. Berlin: Springer, 1999. 100–115
Matusik W, Buehler C, McMillan L. Polyhedral visual hulls for real-time rendering. In: Proceedings of the 12th Eurographics Workshop on Rendering Techniques. London: Springer-Verlag, 2001. 116–126
Moezzi S, Tai L C, Gerard P. Virtual view generation for 3D digital video. IEEE Multimed, 1997, 4: 18–26
Laurentini A. The visual hull concept for silhouette-based image understanding. IEEE Trans PAMI, 1994, 16: 150–162
Decarlo D, Metaxas D. The integration of optical flow and deformable models with applications to human face shape and motion estimation. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Washington: IEEE Computer Society Press, 1996. 231–238
Pighin F H, Szeliski R, Salesin D. Resynthesizing facial animation through 3D model-based tracking. In: Proceedings of International Conference on Computer Vision (ICCV). Washington: IEEE Computer Society Press, 1999. 143–150
Blanz V, Basso C, Poggio T, et al. Reanimating faces in images and video. Comput Graph Forum, 2003, 22: 641–650
Vlasic D, Brand M, Pfister H, et al. Face transfer with multilinear models. ACM Trans Graph, 2005, 24: 426–433
De Aguiar E, Theobalt C, Stoll C, et al. Marker-less deformable mesh tracking for human shape and motion capture. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition(CVPR). Washington: IEEE Computer Society Press, 2007. 1–8
De Aguiar E, Stoll C, Theobalt C, et al. Performance capture from sparse multi-view video. ACM Trans Graph (TOG), 2008, 27: 98
Pighin F, Hecker J, Lischinski D, et al. Synthesizing realistic facial expressions from photographs. In: Proceedings of ACM SIGGRAPH1998. New York: Association for Computing Machinery SIGGRAPH, 1998. 75–84
Herda L, Fua P, Plankers R, et al. Skeleton-based motion capture for robust reconstruction of human motion. In: Proceedings of Computer Animation 2000(CA’00). Washington: IEEE Computer Society Press, 2000. 77–93
Moeslund T B, Granum E. A survey of computer vision-based human motion capture. Comput Vis Image Understand, 2001, 81: 231–268
Park S I, Hodgins J K. Capturing and animating skin deformation in human motion. ACM Trans Graph (TOG), 2006, 25: 881–889
Zhang Z. A flexible new technique for camera calibration. IEEE Trans Patt Anal Mach Intell, 2000, 22: 1330–1334
Battiato S, Gallo G, Puglisi G, et al. SIFT features tracking for video stabilization. In: Proceedings of 14th International Conference on Image Analysis and Processing. Berlin: Springer, 2007. 825–830
Wagner D, Reitmayr G, Mulloni A, et al. Pose tracking from natural features on mobile phones. In: Proceedings of the 7th IEEE/ACM International Symposium on Mixed and Augmented Reality. Washington: IEEE Computer Society Press, 2008. 125–134
David G L. Distinctive image features from scale-invariant keypoints. Int J Comput Vis, 2004, 60: 91–110
Tsutsui H, Miura J, Shirai Y. Optical flow-based person tracking by multiple cameras. In: Proceedings of International Conference on Multisensor Fusion and Integration for Intelligent Systems. Berlin: Springer, 2001. 91–96
Brox T, Rosenhahn B, Cremers D, et al. High accuracy optical flow serves 3-D pose tracking: exploiting contour and flow based constraints. In: Proceedings of European Conference on Computer Vision, ECCV’06(vol. 3952 of LNCS). Berlin: Springer, 2006. 98–111
Lucas B, Kanade T. An iterative image registration technique with an application to stereo vision. In: Proceedings of the International Joint Conference on Artificial Intelligence. San Francisco: Morgan Kaufmann, 1981. 674–679
Bergen J R, Anandan P, Hanna K J, et al. Hierarchical model-based motion estimation. In: Poceedings of the European Conference on Computer Vision(vol. LNCS 588). Berlin: Springer, 1992. 237–252
Harris C, Stephens M. A combined corner and edge detector. In: Proceedings of the Fourth Alvey Vision Conference. Manchester: Organising Committee, AVC 88, 1988. 147–151
Smith S M, Brady J M. SUSAN-A new approach to low level image processing. Int J Comput Vis, 1997, 23: 45–78
Parker J R, Jennings C, Salkauskas A G. Thresholding using an illumination model. In: Proceedings of the 2nd International Conference on Document Analysis and Recognition. Washington: IEEE Computer Society Press, 1993. 270–273
Au O K C, Tai C L, Liu L G, et al. Dual Laplacian editing for meshes. IEEE Trans Visual Comput Graph, 2006, 12: 386–395
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gao, F., Wang, Q., Geng, W. et al. Acquisition of time-varying 3D foot shapes from video. Sci. China Inf. Sci. 54, 2256–2268 (2011). https://doi.org/10.1007/s11432-011-4361-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11432-011-4361-1