skip to main content
10.1145/2671015.2671021acmconferencesArticle/Chapter ViewAbstractPublication PagesvrstConference Proceedingsconference-collections
research-article

Posture reconstruction using Kinect with a probabilistic model

Published:11 November 2014Publication History

ABSTRACT

Recent work has shown that depth image based 3D posture estimation hardware such as Kinect has made interactive applications more popular. However, it is still challenging to accurately recognize postures from a single depth camera due to the inherently noisy data derived from depth images and self-occluding action performed by the user. While previous research has shown that data-driven methods can be used to reconstruct the correct postures, they usually require a large posture database, which greatly limit the usability for systems with constrained hardware such as game console. To solve this problem, we present a new probabilistic framework to enhance the accuracy of the postures live captured by Kinect. We adopt the Gaussian Process model as a prior to leverage position data obtained from Kinect and marker-based motion capture system. We also incorporate a temporal consistency term into the optimization framework to constrain the velocity variations between successive frames. To ensure that the reconstructed posture resembles the observed input data from Kinect when its tracking result is good, we embed joint reliability into the optimization framework. Experimental results demonstrate that our system can generate high quality postures even under severe self-occlusion situations, which is beneficial for real-time posture based applications such as motion-based gaming and sport training.

Skip Supplemental Material Section

Supplemental Material

References

  1. Baak, A., Muller, M., Bharaj, G., Seidel, H.-P., and Theobalt, C. 2011. A data-driven approach for real-time full body pose reconstruction from a depth camera. In Proceedings of the 2011 International Conference on Computer Vision, ICCV '11, 1092--1099. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bailey, S. W., and Bodenheimer, B. 2012. A comparison of motion capture data recorded from a vicon system and a microsoft kinect sensor. In Proceedings of the ACM Symposium on Applied Perception, SAP '12, 121--121. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chai, J., and Hodgins, J. K. 2005. Performance animation from low-dimensional control signals. In ACM SIGGRAPH 2005 Papers, SIGGRAPH '05, 686--696. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chan, J., Leung, H., Tang, J., and Komura, T. 2011. A virtual reality dance training system using motion capture technology. Learning Technologies, IEEE Transactions on 4, 2 (April), 187--195. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Han, J., Shao, L., Xu, D., and Shotton, J. 2013. Enhanced computer vision with microsoft kinect sensor: A review. Cybernetics, IEEE Transactions on 43, 5 (Oct), 1318--1334.Google ScholarGoogle Scholar
  6. Helten, T., Muller, M., Seidel, H.-P., and Theobalt, C. 2013. Real-time body tracking with one depth camera and inertial sensors. In Computer Vision (ICCV), 2013 IEEE International Conference on, 1105--1112. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Hoyet, L., McDonnell, R., and O'Sullivan, C. 2012. Push it real: Perceiving causality in virtual interactions. ACM Trans. Graph. 31, 4 (July), 90:1--90:9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Iason Oikonomidis, N. K., and Argyros, A. 2011. Efficient model-based 3d tracking of hand articulations using kinect. In Proceedings of the British Machine Vision Conference, BMVA Press, 101.1--101.11.Google ScholarGoogle Scholar
  9. Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., Davison, A., and Fitzgibbon, A. 2011. Kinectfusion: Real-time 3d reconstruction and interaction using a moving depth camera. In Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, UIST '11, 559--568. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kim, J., Seol, Y., and Lee, J. 2013. Human motion reconstruction from sparse 3d motion sensors using kernel cca-based regression. Computer Animation and Virtual Worlds 24, 6.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Lawrence, N. 2009. Gaussian process library, http://www.cs.manchester.ac.uk/neill/gp/.Google ScholarGoogle Scholar
  12. Liu, H., Wei, X., Chai, J., Ha, I., and Rhee, T. 2011. Realtime human motion control with a small number of inertial sensors. In Symposium on Interactive 3D Graphics and Games, I3D '11, 133--140. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Microsoft, C. 2013. Kinect for windows sdk programming guide version 1.8.Google ScholarGoogle Scholar
  14. Morgan, T., Jarrell, D., and Vance, J. 2014. Poster: Rapid development of natural user interaction using kinect sensors and vrpn. In 3D User Interfaces (3DUI), 2014 IEEE Symposium on, 163--164.Google ScholarGoogle Scholar
  15. Quiñonero Candela, J., and Rasmussen, C. E. 2005. A unifying view of sparse approximate gaussian process regression. J. Mach. Learn. Res. 6 (Dec.), 1939--1959. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Rasmussen, C. E., and Williams, C. K. I. 2005. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning). The MIT Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Shen, W., Deng, K., Bai, X., Leyvand, T., Guo, B., and Tu, Z. 2012. Exemplar-based human action pose correction and tagging. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, 1784--1791. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., and Blake, A. 2011. Real-time human pose recognition in parts from single depth images. In Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition, CVPR '11, 1297--1304. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Shum, H., and Ho, E. S. 2012. Real-time physical modelling of character movements with microsoft kinect. In Proceedings of the 18th ACM symposium on Virtual reality software and technology, VRST '12, 17--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Shum, H. P. H., Ho, E. S. L., Jiang, Y., and Takagi, S. 2013. Real-time posture reconstruction for microsoft kinect. IEEE Transactions on Cybernetics 43, 5, 1357--1369.Google ScholarGoogle ScholarCross RefCross Ref
  21. Sidenbladh, H., and Black, M. 2001. Learning image statistics for bayesian tracking. In Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, vol. 2, 709--716 vol.2.Google ScholarGoogle Scholar
  22. Sigalas, M., Pateraki, M., Oikonomidis, I., and Trahanias, P. 2013. Robust model-based 3d torso pose estimation in rgb-d sequences. In Computer Vision Workshops (ICCVW), 2013 IEEE International Conference on, 315--322. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Tashev, I. 2013. Kinect development kit: A toolkit for gesture- and speech-based human-machine interaction {best of the web}. Signal Processing Magazine, IEEE 30, 5 (Sept), 129--131.Google ScholarGoogle Scholar
  24. Taylor, G., Sigal, L., Fleet, D., and Hinton, G. 2010. Dynamical binary latent variable models for 3d human pose tracking. In Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, 631--638.Google ScholarGoogle Scholar
  25. Vondrak, M., Sigal, L., and Jenkins, O. 2013. Dynamical simulation priors for human motion tracking. Pattern Analysis and Machine Intelligence, IEEE Transactions on 35, 1 (Jan), 52--65. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Wei, X., Zhang, P., and Chai, J. 2012. Accurate realtime fullbody motion capture using a single depth camera. ACM Trans. Graph. 31, 6 (Nov.), 188:1--188:12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Yasin, H., Krüger, B., and Weber, A. 2013. Model based full body human motion reconstruction from video data. In Proceedings of the 6th International Conference on Computer Vision/Computer Graphics Collaboration Techniques and Applications, MIRAGE '13, 1:1--1:8. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Posture reconstruction using Kinect with a probabilistic model

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        VRST '14: Proceedings of the 20th ACM Symposium on Virtual Reality Software and Technology
        November 2014
        238 pages
        ISBN:9781450332538
        DOI:10.1145/2671015

        Copyright © 2014 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 11 November 2014

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate66of254submissions,26%

        Upcoming Conference

        VRST '24

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader