Skip to main content
Log in

Seeing Human Weight from a Single RGB-D Image

  • Regular Paper
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Human weight estimation is useful in a variety of potential applications, e.g., targeted advertisement, entertainment scenarios and forensic science. However, estimating weight only from color cues is particularly challenging since these cues are quite sensitive to lighting and imaging conditions. In this article, we propose a novel weight estimator based on a single RGB-D image, which utilizes the visual color cues and depth information. Our main contributions are three-fold. First, we construct the W8-RGBD dataset including RGB-D images of different people with ground truth weight. Second, the novel sideview shape feature and the feature fusion model are proposed to facilitate weight estimation. Additionally, we consider gender as another important factor for human weight estimation. Third, we conduct comprehensive experiments using various regression models and feature fusion models on the new weight dataset, and encouraging results are obtained based on the proposed features and models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Velardo C, Dugelay J L. Weight estimation from visual body appearance. In Proc. the 4th IEEE International Conference on Biometrics: Theory, Applications and Systems, Sept. 2010, pp.1–6.

  2. Buckley R G, Stehman C R, DosSantos F L et al. Bedside method to estimate actual body weight in the emergency department. The Journal of Emergency Medicine, 2012, 42(1): 100–104.

    Article  Google Scholar 

  3. Bloomfield R, Steel E, MacLennan G, Noble D W. Accuracy of weight and height estimation in an intensive care unit: Implications for clinical practice and research. Critical Care Medicine, 2006, 34(8): 2153–2157.

    Article  Google Scholar 

  4. Weise T, Bouaziz S, Li H, Pauly M. Realtime performance-based facial animation. ACM Transactions on Graphics, 2011, 30(4): Article No.77.

  5. Shotton J, Fitzgibbon A, Cook M et al. Real-time human pose recognition in parts from single depth images. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.1297–1304.

  6. Xia L, Chen C C, Aggarwal J K. Human detection using depth information by Kinect. In Proc. the 2011 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, June 2011, pp.15–22.

  7. Sun C, Zhang T, Bao B, Xu C, Mei T. Discriminative exemplar coding for sign language recognition with Kinect. IEEE Transactions on Cybernetics, 2013, 43(5): 1418–1428.

    Article  Google Scholar 

  8. Sun C, Zhang T, Bao B K, Xu C. Latent support vector machine for sign language recognition with Kinect. In Proc. the 20th IEEE International Conference on Image Processing, Sept. 2013, pp.4190–4194.

  9. Liu S, Nguyen T, Feng J et al. Hi, magic closet, tell me what to wear! In Proc. the 20th ACM Multimedia, Oct.29–Nov.2, 2012, pp.1333–1334.

  10. Velardo C, Dugelay J, Paleari M, Ariano P. Building the space scale or how to weight a person with no gravity. In Proc. International Conference on Emerging Signal Processing Applications, Jan. 2012, pp.67–70.

  11. Dalal N, Triggs B. Histograms of oriented gradients for human detection. In Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2005, pp.886–893.

  12. Mikolajczyk K, Schmid C, Zisserman A. Human detection based on a probabilistic assembly of robust part detectors. In Proc. the 8th European Conference on Computer Vision, May 2004, pp.69–82.

  13. Basso F, Munaro M, Michieletto S et al. Fast and robust multi-people tracking from RGB-D data for a mobile robot. Advances in Intelligent Systems and Computing, 2013, 193: 265–276.

    Article  Google Scholar 

  14. Spinello L, Arras K. Leveraging RGB-D data: Adaptive fusion and domain adaptation for object detection. In Proc. IEEE International Conference on Robotics and Automation, May 2012, pp.4469–4474.

  15. Janoch A, Karayev S, Jia Y et al. A category-level 3-D object dataset: Putting the Kinect to work. In Proc. IEEE International Conference on Computer Vision Workshops, Nov. 2011, pp.1168–1174.

  16. Achanta R, Shaji A, Smith K et al. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(11): 2274–2282.

    Article  Google Scholar 

  17. Gonzalez R, Woods R. Digital Image Processing. Addison-Wesley Pub., 1992.

  18. Jabid T, Kabir M H, Chae O. Gender classification using local directional pattern (LDP). In Proc. the 20th International Conference on Pattern Recognition, Aug. 2010, pp.2162–2165.

  19. Viola P, Jones M. Robust real-time face detection. International Journal of Computer Vision, 2004, 57(2): 137–154.

    Article  Google Scholar 

  20. Chang C C, Lin C J. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): Article No.27.

  21. Guo G, Mu G, Fu Y, Huang T S. Human age estimation using bio-inspired features. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, June 2009, pp.112–119.

  22. Yang M, Zhu S, Lv F, Yu K. Correspondence driven adaptation for human profile recognition. In Proc. the 24th IEEE Conference on Computer Vision and Pattern Recognition, June 2011, pp.505–512.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tam V. Nguyen.

Additional information

This work is partially supported by Singapore Ministry of Education under Research Grant No. MOE2012-TIF-2-G-016, and also partially by the National Natural Science Foundation of China under Grant No. 61328205.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 81 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nguyen, T.V., Feng, J. & Yan, S. Seeing Human Weight from a Single RGB-D Image. J. Comput. Sci. Technol. 29, 777–784 (2014). https://doi.org/10.1007/s11390-014-1467-0

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11390-014-1467-0

Keywords

Navigation