Human pose estimation based on lightweight basicblock

Li, Yanping; Liu, Ruyi; Wang, Xiangyang; Wang, Rui

doi:10.1007/s00138-022-01352-4

Human pose estimation based on lightweight basicblock

Special Issue Paper
Published: 13 November 2022

Volume 34, article number 3, (2023)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Yanping Li^1,2,
Ruyi Liu¹,
Xiangyang Wang¹ &
…
Rui Wang ORCID: orcid.org/0000-0002-7974-9510¹

826 Accesses
8 Citations
2 Altmetric
Explore all metrics

Abstract

Human pose estimation based on deep learning have attracted increasing attention in the past few years and have shown superior performance on various datasets. Many researchers have increased the number of network layers to improve the accuracy of the model. However, with the deepening of the number of network layers, the parameters and computation of the model are also increasing, which makes the model unable to be deployed on edge devices and mobile terminals with limited computing power, and also makes many intelligent terminals limited in volume, power consumption and storage. Inspired by the lightweight method, we propose a human pose estimation model based on the lightweight network to solve those problems, which designs the lightweight basic block module by using the deep separable convolution and the reverse bottleneck layer to accelerate the network calculation and reduce the parameters of the overall network model. Experiments on COCO dataset and MPII dataset prove that this lightweight basicblock module can effectively reduce the amount of parameters and computation of human pose estimation model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 2

Fig. 3

Lightweight Human Pose Estimation Based on Multi-Attention Mechanism

Article 02 January 2024

EfficientPose: Efficient human pose estimation with neural architecture search

Article Open access 07 April 2021

Efficient High-Resolution Human Pose Estimation

References

Toshev, A., Szegedy, C.: Deeppose: human pose estimation via deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1653–1660 (2014)
Mao, W., Ge, Y., Shen, C., et al.: Tfpose: Direct human pose estimation with transformers (2021). arXiv preprint arXiv:2103.15320
Newell, A., Yang, K., Deng, J.: Stacked hourglass networks for human pose estimation. In: European Conference on Computer Vision, pp. 483–499. Springer (2016)
Wei, S.E., Ramakrishna, V., Kanade, T., et al.: Convolutional pose machines. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4724–4732 (2016)
Luo, Z., Wang, Z., Huang, Y., et al.: Rethinking the heatmap regression for bottom-up human pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13264–13273 (2021)
Sun, K., Xiao, B., Liu, D., et al.: Deep high-resolution representation learning for human pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5693–5703 (2019)
Chu, X., Yang, W., Ouyang, W., et al.: Multi-context attention for human pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1831–1840 (2017)
Ke, L., Chang, M.C., Qi, H., et al.: Multi-scale structure-aware network for human pose estimation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 713–728 (2018)
Tang, W., Yu, P., Wu, Y.: Deeply learned compositional models for human pose estimation. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 190–206 (2018)
Chou, C.J., Chien, J.T., Chen, H.T., Self adversarial training for human pose estimation.: Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC). IEEE 2018, 17–30 (2018)
Google Scholar
Chen, Y., Shen, C., Wei, X.S., et al.: Adversarial posenet: a structure-aware convolutional network for human pose estimation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1212–1221 (2017)
Li, Y., Yang, S., Zhang, S., et al.: Is 2D Heatmap Representation Even Necessary for Human Pose Estimation? (2021). arXiv preprint arXiv:2107.03332
Ren, S., He, K., Girshick, R., et al.: Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural. Inf. Process. Syst. 28, 91–99 (2015)
Google Scholar
He, K., Gkioxari, G., Dollár, P., et al.: Mask r-cnn. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2961–2969 (2017)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Xiao, B., Wu, H., Wei, Y.: Simple baselines for human pose estimation and tracking. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 466–481 (2018)
Chen, Y., Wang, Z., Peng, Y., et al.: Cascaded pyramid network for multi-person pose estimation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7103–7112 (2018)
Moon, G., Chang, J.Y., Lee, K.M.: Posefix: model-agnostic general human pose refinement network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7773–7781 (2019)
Cao, Z., Simon, T., Wei, S.E., et al.: Realtime multi-person 2d pose estimation using part affinity fields. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7291–7299 (2017)
Geng, Z., Sun, K., Xiao, B., et al.: Bottom-up human pose estimation via disentangled keypoint regression. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14676–14686 (2021)
Howard, A.G., Zhu, M., Chen, B., et al.: Mobilenets: Efficient convolutional neural networks for mobile vision applications (2017). arXiv preprint arXiv:1704.04861
Zhang, X., Zhou, X., Lin, M., et al.: Shufflenet: an extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6848–6856 (2018)
Sandler, M., Howard, A., Zhu, M., et al.: Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4510–4520 (2018)
Howard, A., Sandler, M., Chu, G., et al.: Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314–1324 (2019)
Ma, N., Zhang, X., Zheng, H.T. et al.: Shufflenet v2: Practical guidelines for efficient cnn architecture design. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 116–131 (2018)
Tang, Z., Peng, X., Geng, S., et al.: Quantized densely connected u-nets for efficient landmark localization. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 339–354 (2018)
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-assisted Intervention, pp. 234–241. Springer (2015)
Debnath, B., O’brien, M., Yamaguchi, M., et al.: Adapting mobilenets for mobile based upper body pose estimation. In: 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). IEEE, pp. 1–6 (2018)
Zhang, F., Zhu, X., Ye, M.: Fast human pose estimation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3517–3526 (2019)
Kim, S.-T., Lee, H.J.: Lightweight stacked hourglass network for human pose estimation. Appl. Sci. 10(18), 6497 (2020)
Article Google Scholar
Yu, C., Xiao, B., Gao, C., et al. Lite-hrnet: a lightweight high-resolution network. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10440–10450 (2021)
Yang, L., Qin, Y., Zhang, X.: Lightweight densely connected residual network for human pose estimation. J. Real-Time Image Proc. 18(3), 825–837 (2021)
Article Google Scholar
Lin, T.Y., Maire, M., Belongie, S., et al.: Microsoft coco: Common objects in context. In: European Conference on Computer Vision, pp. 740–755. Springer (2014)
Andriluka, M., Pishchulin, L., Gehler, P., et al.: 2d human pose estimation: New benchmark and state of the art analysis. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3686–3693 (2014)
Papandreou, G., Zhu, T., Kanazawa, N., et al.: Towards accurate multi-person pose estimation in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4903–4911 (2017)
Sun, X., Xiao, B., Wei, F., et al.: Integral human pose regression. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 529–545 (2018)
Fang, H.S., Xie, S., Tai, Y.W., et al.: Rmpe: regional multi-person pose estimation. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2334–2343 (2017)
Yang, S., Quan, Z., Nie, M., et al.: Transpose: Keypoint localization via transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 11802–11812 (2021)
Balakrishnan, K., Upadhyay, D.: BTranspose: Bottleneck Transformers for Human Pose Estimation with Self-Supervised Pre-Training (2022). arXiv preprint arXiv:2204.10209
Yang, Yi., Ramanan, D.: Articulated human detection with flexible mixtures of parts. IEEE Trans. Software Eng. 35(12), 2878–2890 (2013). https://doi.org/10.1109/TPAMI.2012.261
Article Google Scholar
Debapriya Maji, Soyeb Nagori, Manu Mathew, Deepak Poddar: YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2637–2646 (2022). https://doi.org/10.48550/arXiv.2204.06806.

Download references

Funding

This research was funded by National Natural Science Foundation of China (NSFC) under Grant No. 61771299.

Author information

Authors and Affiliations

School of Communication and Information Engineering, Shanghai University, Shanghai, China
Yanping Li, Ruyi Liu, Xiangyang Wang & Rui Wang
Office of Academic Affairs, Shanghai University, Shanghai, China
Yanping Li

Authors

Yanping Li
View author publications
You can also search for this author in PubMed Google Scholar
Ruyi Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xiangyang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rui Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rui Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Li, Y., Liu, R., Wang, X. et al. Human pose estimation based on lightweight basicblock. Machine Vision and Applications 34, 3 (2023). https://doi.org/10.1007/s00138-022-01352-4

Download citation

Received: 12 May 2022
Revised: 09 October 2022
Accepted: 21 October 2022
Published: 13 November 2022
DOI: https://doi.org/10.1007/s00138-022-01352-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Human pose estimation based on lightweight basicblock

Abstract

Access this article

Similar content being viewed by others

Lightweight Human Pose Estimation Based on Multi-Attention Mechanism

EfficientPose: Efficient human pose estimation with neural architecture search

Efficient High-Resolution Human Pose Estimation

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Human pose estimation based on lightweight basicblock

Abstract

Access this article

Similar content being viewed by others

Lightweight Human Pose Estimation Based on Multi-Attention Mechanism

EfficientPose: Efficient human pose estimation with neural architecture search

Efficient High-Resolution Human Pose Estimation

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation