research-article

Enhanced Human Pose Estimation with Attention-Augmented HRNet

Authors:
Junjie Zhang

Beijing Normal University-Hong Kong Baptist University United International College, China

Beijing Normal University-Hong Kong Baptist University United International College, China

0009-0005-7741-7263
View Profile

,
Haojie Yang

School of Mathematical Sciences, Shanghai Jiao Tong University, China

School of Mathematical Sciences, Shanghai Jiao Tong University, China

0009-0003-9116-2346
View Profile

,
Yancong Deng

Jacob School of Engineering, University of California, United States

Jacob School of Engineering, University of California, United States

0000-0001-7047-1164
View Profile

IPMV '24: Proceedings of the 2024 6th International Conference on Image Processing and Machine VisionJanuary 2024Pages 88–93https://doi.org/10.1145/3645259.3645274

Published:03 May 2024Publication History

IPMV '24: Proceedings of the 2024 6th International Conference on Image Processing and Machine Vision

Pages 88–93

ABSTRACT

Abstract

Human pose estimation is a pivotal task in computer vision, aiming to predict the spatial locations of key body joints within an image accurately. The challenge arises from the need to understand complex human poses, occlusions, and variations in body configurations, which often perplex traditional pose estimation models. To bolster the accuracy and robustness of human pose estimation models, we introduce an Attention-Augmented HRNet Architecture. This proposed model augments the original HRNet by integrating self-attention mechanisms. These mechanisms capture long-range dependencies among keypoints and concentrate on pivotal body regions more effectively. Experimental results demonstrate that the Attention-Augmented HRNet surpasses the baseline HRNet that lacks attention, attaining state-of-the-art performance on the COCO dataset. Specifically, our model achieves an Average Precision (AP) of 74.5%.

References

Andriluka, M., Pishchulin, L., Gehler, P., & Schiele, B. (2014). “2d human pose estimation: New benchmark and state of the art analysis”. In Proceedings of the IEEE Conference on computer Vision and Pattern Recognition.Google ScholarDigital Library
Sun, K., Xiao, B., Liu, D., & Wang, J. (2019). “Deep high-resolution representation learning for human pose estimation”. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition.Google ScholarCross Ref
MacKenzie, I. Scott. (2012). Human-computer interaction: An empirical research perspective.Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). “Attention is all you need”. Advances in neural information processing systems.Google Scholar
Albawi, S., Mohammed, T. A., & Al-Zawi, S. (2017). “Understanding of a convolutional neural network”. International conference on engineering and technology (ICET).Google ScholarCross Ref
Medsker, L. R., & Jain, L. C. (2001). “Recurrent neural networks”. Design and Applications, 5(2): 64-67.Google Scholar
Newell A, Yang K, Deng J. “Stacked hourglass networks for human pose estimation”. Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands.Google Scholar
Woo, S., Park, J., Lee, J. Y., & Kweon, I. S. (2018), “Cbam: Convolutional block attention module”, Proceedings of the European conference on computer vision (ECCV), 3-19.Google ScholarDigital Library
Wang, J., Sun, K., Cheng, T., Jiang, B., Deng, C., Zhao, Y., ... & Fu, W. (2020). “Deep High-Resolution Representation Learning for Visual Recognition”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(2): 665-678.Google Scholar
Lin T Y, Maire M, Belongie S, “Microsoft coco: Common objects in context”. Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland.Google Scholar

Recommendations

A survey of human pose estimation

Summarization of methods on human pose estimation in recent years.Conclusion of the traditional human pose estimation methods.Illustrated based on a two-stage framework.Comprehensive comparisons are given based on the open source methods. Estimating ...
Read More
Human pose estimation via multi-layer composite models

We introduce a hierarchical part-based approach for human pose estimation in static images. Our model is a multi-layer composite of tree-structured pictorial-structure models, each modeling human pose at a different scale and with a different graphical ...
Read More
Lightweight human pose estimation algorithm based on polarized self-attention
Abstract
In recent years, human pose estimation has been widely used in human-computer interaction, augmented reality, video surveillance, and many other fields, but the task of pose estimation still faces many challenges. To address the large number of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

IPMV '24: Proceedings of the 2024 6th International Conference on Image Processing and Machine Vision
January 2024
129 pages
ISBN:9798400708473
DOI:10.1145/3645259

Copyright © 2024 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 3 May 2024
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Attention Mechanism
HRNet
Human Pose Estimation
Qualifiers
- research-article
- Research
- Refereed limited
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 3
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Enhanced Human Pose Estimation with Attention-Augmented HRNet

IPMV '24: Proceedings of the 2024 6th International Conference on Image Processing and Machine Vision

ABSTRACT

References

Cited By

Recommendations

A survey of human pose estimation

Human pose estimation via multi-layer composite models

Lightweight human pose estimation algorithm based on polarized self-attention

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Enhanced Human Pose Estimation with Attention-Augmented HRNet

IPMV '24: Proceedings of the 2024 6th International Conference on Image Processing and Machine Vision

ABSTRACT

References

Cited By

Recommendations

A survey of human pose estimation

Human pose estimation via multi-layer composite models

Lightweight human pose estimation algorithm based on polarized self-attention

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media