skip to main content
10.1145/3191442.3191444acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicigpConference Proceedingsconference-collections
research-article

Real-time Human Body Correspondences for Motion Tracking

Published:24 February 2018Publication History

ABSTRACT

Non-rigid human surface tracking systems have many important applications in virtual reality and mixed reality. However, current systems can hardly be applied to interactive scenarios for their performance or accuracy limitations. One of the biggest challenges is efficiently estimating accurate correspondences between the template human model and tracking data. To bridge the gap, we propose a hierarchical feature matching framework for computing accurate dense correspondences between human shapes in real-time. For input human mesh model and depth scan, we train a fully convolutional network to produce dense feature descriptors with local similarity on human surface. Base on that, correspondences are found by performing fast hierarchical matching on segmented human body. Our approach is robust to large motion and deformation, its efficiency is validated in related real-time scenarios, while its accuracy has been proven comparable to state-of-the-art offline methods.

References

  1. Shafaei, A. and Little, J.J., 2016, June. Real-Time Human Motion Capture with Multiple Depth Cameras. In Computer and Robot Vision (CRV), 2016 13th Conference on (pp. 24--31). IEEE.Google ScholarGoogle Scholar
  2. Shotton, J., Sharp, T., Kipman, A., Fitzgibbon, A., Finocchio, M., Blake, A., Cook, M. and Moore, R., 2013. Real-time human pose recognition in parts from single depth images. Communications of the ACM, 56(1), pp. 116--124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ye, M. and Yang, R., 2014. Real-time simultaneous pose and shape estimation for articulated objects using a single depth camera. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2345--2352). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Barmpoutis, A., 2013. Tensor body: Real-time reconstruction of the human body and avatar synthesis from RGB-D. IEEE transactions on cybernetics, 43(5), pp. 1347--1356.Google ScholarGoogle Scholar
  5. Zhang, L., Sturm, J., Cremers, D. and Lee, D., 2012, October. Real-time human motion tracking using multiple depth cameras. In Intelligent Robots and Systems (IROS), 2012 IEEE/RSJ International Conference on (pp. 2389--2395). IEEE.Google ScholarGoogle Scholar
  6. Chen, Q. and Koltun, V., 2015. Robust nonrigid registration by convex optimization. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2039--2047). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Newcombe, R.A., Fox, D. and Seitz, S.M., 2015. Dynamicfusion: Reconstruction and tracking of non-rigid scenes in real-time. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 343--352).Google ScholarGoogle Scholar
  8. Taylor, J., Shotton, J., Sharp, T. and Fitzgibbon, A., 2012, June. The vitruvian manifold: Inferring dense correspondences for one-shot human pose estimation. In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on (pp. 103--110). IEEE. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Van Kaick, O., Zhang, H., Hamarneh, G. and Cohen-Or, D., 2011, September. A survey on shape correspondence. In Computer Graphics Forum (Vol. 30, No. 6, pp. 1681--1707). Blackwell Publishing Ltd.Google ScholarGoogle Scholar
  10. Vlasic, D., Baran, I., Matusik, W. and Popović, J., 2008, August. Articulated mesh animation from multi-view silhouettes. In ACM Transactions on Graphics (TOG) (Vol. 27, No. 3, p. 97). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Wei, L., Huang, Q., Ceylan, D., Vouga, E. and Li, H., 2016. Dense human body correspondences using convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1544--1553).Google ScholarGoogle Scholar
  12. Han, X., Leung, T., Jia, Y., Sukthankar, R. and Berg, A.C., 2015. Matchnet: Unifying feature and metric learning for patch-based matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3279--3286).Google ScholarGoogle Scholar
  13. Long, J.L., Zhang, N. and Darrell, T., 2014. Do convnets learn correspondence?. In Advances in Neural Information Processing Systems (pp. 1601--1609). Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Masci, J., Boscaini, D., Bronstein, M. and Vandergheynst, P., 2015. Geodesic convolutional neural networks on riemannian manifolds. In Proceedings of the IEEE international conference on computer vision workshops (pp. 37--45). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Ciresan, D., Giusti, A., Gambardella, L.M. and Schmidhuber, J., 2012. Deep neural networks segment neuronal membranes in electron microscopy images. In Advances in neural information processing systems (pp. 2843--2851). Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Farabet, C., Couprie, C., Najman, L. and LeCun, Y., 2013. Learning hierarchical features for scene labeling. IEEE transactions on pattern analysis and machine intelligence, 35(8), pp. 1915--1929. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R. and LeCun, Y., 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229.Google ScholarGoogle Scholar
  18. Long, J., Shelhamer, E. and Darrell, T., 2015. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3431--3440).Google ScholarGoogle Scholar
  19. Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J. and Davis, J., 2005, July. SCAPE: shape completion and animation of people. In ACM Transactions on Graphics (TOG) (Vol. 24, No. 3, pp. 408--416). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Bogo, F., Romero, J., Loper, M. and Black, M.J., 2014. FAUST: Dataset and evaluation for 3D mesh registration. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3794--3801). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Rodolà, E., Rota Bulo, S., Windheuser, T., Vestner, M. and Cremers, D., 2014. Dense non-rigid shape correspondence using random forests. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4177--4184). Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Kim, V.G., Lipman, Y. and Funkhouser, T., 2011, August. Blended intrinsic maps. In ACM Transactions on Graphics (TOG) (Vol. 30, No. 4, p. 79). ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Real-time Human Body Correspondences for Motion Tracking

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICIGP '18: Proceedings of the 2018 International Conference on Image and Graphics Processing
      February 2018
      183 pages
      ISBN:9781450363679
      DOI:10.1145/3191442

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 February 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader