Abstract
Human weight estimation is useful in a variety of potential applications, e.g., targeted advertisement, entertainment scenarios and forensic science. However, estimating weight only from color cues is particularly challenging since these cues are quite sensitive to lighting and imaging conditions. In this article, we propose a novel weight estimator based on a single RGB-D image, which utilizes the visual color cues and depth information. Our main contributions are three-fold. First, we construct the W8-RGBD dataset including RGB-D images of different people with ground truth weight. Second, the novel sideview shape feature and the feature fusion model are proposed to facilitate weight estimation. Additionally, we consider gender as another important factor for human weight estimation. Third, we conduct comprehensive experiments using various regression models and feature fusion models on the new weight dataset, and encouraging results are obtained based on the proposed features and models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Velardo C, Dugelay J L. Weight estimation from visual body appearance. In Proc. the 4th IEEE International Conference on Biometrics: Theory, Applications and Systems, Sept. 2010, pp.1–6.
Buckley R G, Stehman C R, DosSantos F L et al. Bedside method to estimate actual body weight in the emergency department. The Journal of Emergency Medicine, 2012, 42(1): 100–104.
Bloomfield R, Steel E, MacLennan G, Noble D W. Accuracy of weight and height estimation in an intensive care unit: Implications for clinical practice and research. Critical Care Medicine, 2006, 34(8): 2153–2157.
Weise T, Bouaziz S, Li H, Pauly M. Realtime performance-based facial animation. ACM Transactions on Graphics, 2011, 30(4): Article No.77.
Shotton J, Fitzgibbon A, Cook M et al. Real-time human pose recognition in parts from single depth images. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.1297–1304.
Xia L, Chen C C, Aggarwal J K. Human detection using depth information by Kinect. In Proc. the 2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, June 2011, pp.15–22.
Sun C, Zhang T, Bao B, Xu C, Mei T. Discriminative exemplar coding for sign language recognition with Kinect. IEEE Transactions on Cybernetics, 2013, 43(5): 1418–1428.
Sun C, Zhang T, Bao B K, Xu C. Latent support vector machine for sign language recognition with Kinect. In Proc. the 20th IEEE International Conference on Image Processing, Sept. 2013, pp.4190–4194.
Liu S, Nguyen T, Feng J et al. Hi, magic closet, tell me what to wear! In Proc. the 20th ACM Multimedia, Oct.29–Nov.2, 2012, pp.1333–1334.
Velardo C, Dugelay J, Paleari M, Ariano P. Building the space scale or how to weight a person with no gravity. In Proc. International Conference on Emerging Signal Processing Applications, Jan. 2012, pp.67–70.
Dalal N, Triggs B. Histograms of oriented gradients for human detection. In Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2005, pp.886–893.
Mikolajczyk K, Schmid C, Zisserman A. Human detection based on a probabilistic assembly of robust part detectors. In Proc. the 8th European Conference on Computer Vision, May 2004, pp.69–82.
Basso F, Munaro M, Michieletto S et al. Fast and robust multi-people tracking from RGB-D data for a mobile robot. Advances in Intelligent Systems and Computing, 2013, 193: 265–276.
Spinello L, Arras K. Leveraging RGB-D data: Adaptive fusion and domain adaptation for object detection. In Proc. IEEE International Conference on Robotics and Automation, May 2012, pp.4469–4474.
Janoch A, Karayev S, Jia Y et al. A category-level 3-D object dataset: Putting the Kinect to work. In Proc. IEEE International Conference on Computer Vision Workshops, Nov. 2011, pp.1168–1174.
Achanta R, Shaji A, Smith K et al. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(11): 2274–2282.
Gonzalez R, Woods R. Digital Image Processing. Addison-Wesley Pub., 1992.
Jabid T, Kabir M H, Chae O. Gender classification using local directional pattern (LDP). In Proc. the 20th International Conference on Pattern Recognition, Aug. 2010, pp.2162–2165.
Viola P, Jones M. Robust real-time face detection. International Journal of Computer Vision, 2004, 57(2): 137–154.
Chang C C, Lin C J. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): Article No.27.
Guo G, Mu G, Fu Y, Huang T S. Human age estimation using bio-inspired features. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2009, pp.112–119.
Yang M, Zhu S, Lv F, Yu K. Correspondence driven adaptation for human profile recognition. In Proc. the 24th IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.505–512.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work is partially supported by Singapore Ministry of Education under Research Grant No. MOE2012-TIF-2-G-016, and also partially by the National Natural Science Foundation of China under Grant No. 61328205.
Electronic supplementary material
Below is the link to the electronic supplementary material.
ESM 1
(PDF 81 kb)
Rights and permissions
About this article
Cite this article
Nguyen, T.V., Feng, J. & Yan, S. Seeing Human Weight from a Single RGB-D Image. J. Comput. Sci. Technol. 29, 777–784 (2014). https://doi.org/10.1007/s11390-014-1467-0
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-014-1467-0