A monocular 3D human pose estimation approach for virtual character skeleton retargeting

Yang, Aolei; Liu, Guancheng; Naeem, Wasif; Wu, Dakui; Zhou, Yinghong; Chen, Ling

doi:10.1007/s12652-023-04629-2

A monocular 3D human pose estimation approach for virtual character skeleton retargeting

Original Research
Published: 18 May 2023

Volume 14, pages 9563–9574, (2023)
Cite this article

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Aolei Yang ORCID: orcid.org/0000-0002-5077-9150¹,
Guancheng Liu¹,
Wasif Naeem²,
Dakui Wu¹,
Yinghong Zhou¹ &
…
Ling Chen³

240 Accesses
Explore all metrics

Abstract

This paper presents a monocular 3D human pose estimation approach for virtual character skeleton retargeting with monocular visual equipment. First, the 2D human pose is achieved by using the OpenPose method from the continuous video frames collected by the monocular camera, and the corresponding 3D human pose is estimated by fusing and constructing the depth-channel pose estimation network. The pose filter network is next designed to smooth and optimize the 3D human pose estimation through a sliding window strategy. The human pose skeleton retargeting and optimizer methods are then proposed to support video motion capture applications and virtual character skeleton retargeting for animation based on the bone direction vectors and the re-projection error of joint points. Finally, the performance of the proposed approach is verified on Human3.6 M dataset, and the results show that the mean per-joint position error in the public dataset is 40.5 mm, which is lower than that of the multiple benchmark methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 7

PoseVR: Structure-Aware Hybrid Full-Body Pose Estimation in Virtual Reality

DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation

Video-based body geometric aware network for 3D human pose estimation

Article 01 May 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request.

References

Cao Z, Hidalgo G, Simon T et al (2021) OpenPose: realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell 43(1):172–186. https://doi.org/10.1109/TPAMI.2019.2929257
Article Google Scholar
Chen Y, Shen C, Chen H et al (2020) Adversarial learning of structure-aware fully convolutional networks for landmark localization. IEEE Trans Pattern Anal Mach Intell 42(7):1654–1669. https://doi.org/10.1109/TPAMI.2019.2901875
Article Google Scholar
Chen YC, Piccirilli M, Piramuthu R et al (2021) Self-attentive 3D human pose and shape estimation from videos. Comput vis Image Underst 213(2021):103305–103318. https://doi.org/10.1016/j.cviu.2021.103305
Article Google Scholar
Dabral R, Mundhada A, Kusupati U et al (2018) Learning 3D human pose from structure and motion. Eur Conf Comput Vision 11213:679–696. https://doi.org/10.1007/978-3-030-01240-3_41
Article Google Scholar
Gu R, Wang G, Jiang Z et al (2020) Multi-person hierarchical 3D pose estimation in natural videos. IEEE Trans Circuits Syst Video Technol 30(11):4245–4257. https://doi.org/10.1109/TCSVT.2019.2953678
Article Google Scholar
Ionescu C, Papava D, Olaru V et al (2014) Human3.6M: Large scale datasets and predictive methods for 3D human sensing in natural environments. IEEE Trans Pattern Anal Mach Intell 36(7):1325–1339. https://doi.org/10.1109/TPAMI.2013.248
Article Google Scholar
Jain DK, Zareapoor M, Jain R et al (2020) GAN-Poser: an improvised bidirectional GAN model for human motion prediction. Neural Comput Appl 32(18):14579–14591. https://doi.org/10.1007/s00521-020-04941-4
Article Google Scholar
Katircioglu I, Tekin B, Salzmann M et al (2018) Learning latent representations of 3D human pose with deep neural networks. Int J Comput Vision 126(12):1326–1341. https://doi.org/10.1007/s11263-018-1066-6
Article Google Scholar
Kim DY, Chang JY (2021) Attention-based 3D human pose sequence refinement network. Sensors 21(13):4752–4770. https://doi.org/10.3390/s21134572
Article Google Scholar
Li S, Zhang W, Chan A (2017) Maximum-margin structured learning with deep networks for 3D human pose estimation. Int J Comput Vision 122:149–168. https://doi.org/10.1007/s11263-016-0962-x
Article MathSciNet Google Scholar
Li Y, Li K, Jiang S et al (2020) Geometry-driven self-supervised method for 3D human pose estimation. AAAI Conf Artif Intell. https://doi.org/10.1609/aaai.v34i07.6808
Article Google Scholar
Loper M, Mahmood N, Romero J et al (2015) SMPL: a skinned multi-person linear model. ACM Trans Graph 34(6):1–16. https://doi.org/10.1145/2816795.2818013
Article Google Scholar
Martinez J, Hossain R, Romero J et al (2017) A simple yet effective baseline for 3D human pose estimation. IEEE Int Conf Comput Vision. https://doi.org/10.1109/ICCV.2017.288
Article Google Scholar
Mehta D, Sridhar S, Sotnychenko O et al (2017) VNect: real-time 3D human pose estimation with a single RGB camera. ACM Transa Graph 36(4):1–14. https://doi.org/10.1145/3072959.3073596
Article Google Scholar
Moon G, Chang JY, Lee KM (2019) Camera distance-aware top-down approach for 3D multi-person pose estimation from a single RGB image. IEEE Int Conf Comput Vision. https://doi.org/10.1109/ICCV.2019
Article Google Scholar
Naveen P, Sivakumar P (2021) Adaptive morphological and bilateral filtering with ensemble convolutional neural network for pose-invariant face recognition. J Ambient Intell Humaniz Comput 12:10023–10033. https://doi.org/10.1007/s12652-020-02753-x
Article Google Scholar
Rogez G, Weinzaepfel P, Schmid C (2020) LCR-Net++: multi-person 2D and 3D pose detection in natural images. IEEE Trans Pattern Anal Mach Intell 42(5):1146–1161. https://doi.org/10.1109/TPAMI.2019.2892985
Article Google Scholar
Selva J, Johansen AS, Escalera S et al (2023) Video transformers: a survey. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2023.3243465
Article Google Scholar
Shi M, Aberman K, Aristidou A et al (2021) MotioNet: 3D human motion reconstruction from monocular video with skeleton consistency. ACM Trans Graph 40(1):1–15. https://doi.org/10.1145/3407659
Article Google Scholar
Verma P, Srivastava R (2020) Three stage deep network for 3D human pose reconstruction by exploiting spatial and temporal data via its 2D pose. J vis Commun Image Represent 71:102866–102876. https://doi.org/10.1016/j.jvcir.2020.102866
Article Google Scholar
Wang M, Qiu F, Liu W et al (2020) Monocular human pose and shape reconstruction using part differentiable rendering. Comput Graph Forum 39(7):351–362. https://doi.org/10.1111/cgf.14150
Article Google Scholar
Xia H, Xiao M (2020) 3D human pose estimation with generative adversarial networks. IEEE Access 8:206198–206206. https://doi.org/10.1109/ACCESS.2020.3037829
Article Google Scholar
Yang A, Ren H, Fei M, Naeem W (2021b) Multi-person vision tracking approach based on human body localization features. Adv Manuf 9(4):496–508. https://doi.org/10.1007/s40436-021-00363-0
Article Google Scholar
Yang D, Kim D, Lee SH (2021c) LoBSTr: Real-time lower-body pose prediction from sparse upper-body tracking signals. Comput Graph Forum 40(2):265–275. https://doi.org/10.1111/cgf.142631
Article Google Scholar
Yang A, Chen Y, Naeem W, Fei M, Chen L (2021a) Humanoid motion planning of robotic arm based on human arm action feature and reinforcement learning. Mechatronics 78:102630. https://doi.org/10.1016/j.mechatronics.2021.102630
Article Google Scholar
Zheng X, Chen X, Lu X (2020) A joint relationship aware neural network for single-image 3D human pose Estimation. IEEE Trans Image Process 29:4747–4758. https://doi.org/10.1109/TIP.2020.2972104
Article MATH Google Scholar
Zhou X, Zhu M, Leonardos S et al (2017) Sparse representation for 3D shape estimation: a convex relaxation approach. IEEE Trans Pattern Anal Mach Intell 39(8):1648–1661. https://doi.org/10.1109/TPAMI.2016.2605097
Article Google Scholar

Download references

Acknowledgements

This work is supported by Natural Science Foundation of Shanghai under Grant 22ZR1424200.

Author information

Authors and Affiliations

School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, 200444, China
Aolei Yang, Guancheng Liu, Dakui Wu & Yinghong Zhou
School of Electronics, Electrical Engineering and Computer Science, Queen’s University Belfast, Belfast, BT7 1NN, Northern Ireland, UK
Wasif Naeem
School of Engineering and Design, Hunan Normal University, Changsha, 410081, China
Ling Chen

Authors

Aolei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Guancheng Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wasif Naeem
View author publications
You can also search for this author in PubMed Google Scholar
Dakui Wu
View author publications
You can also search for this author in PubMed Google Scholar
Yinghong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Ling Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Aolei Yang or Yinghong Zhou.

Ethics declarations

Conflict of interest

The authors declare that we have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (MP4 1755 KB)

Appendix

See Tables 7 and 8.

Table 7 Notation and description

Full size table

Table 8 Abbreviation and description

Full size table

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Yang, A., Liu, G., Naeem, W. et al. A monocular 3D human pose estimation approach for virtual character skeleton retargeting. J Ambient Intell Human Comput 14, 9563–9574 (2023). https://doi.org/10.1007/s12652-023-04629-2

Download citation

Received: 08 June 2022
Accepted: 02 May 2023
Published: 18 May 2023
Issue Date: July 2023
DOI: https://doi.org/10.1007/s12652-023-04629-2

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A monocular 3D human pose estimation approach for virtual character skeleton retargeting

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

PoseVR: Structure-Aware Hybrid Full-Body Pose Estimation in Virtual Reality

DH-AUG: DH Forward Kinematics Model Driven Augmentation for 3D Human Pose Estimation

Video-based body geometric aware network for 3D human pose estimation

Explore related subjects

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Electronic supplementary material

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now