skip to main content
10.1145/3594315.3594341acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Continuity Rotation Representation for Head Pose Estimation without Keypoints

Authors Info & Claims
Published:02 August 2023Publication History

ABSTRACT

In this paper, we present an improved end-to-end head pose estimation method in an unconstrained environment, which transforms the Head Pose Estimation(HPE) problem into a problem of directly predicting continuous 6D rotation matrix parameters belongs 3D Special Orthogonal Group(SO(3)). The method uses RepVGGplus-L2pse as the backbone, followed by one FC layer to output the results, model be trained on 300W-LP. The improved Root Mean Square Error of Geodesic Distance(RSME_GD) is used as the loss function to enhance the accuracy. The experiments on the two public face datasets AFLW-2000 and BIWI show that the results measured by Mean Absolute Error of Vectors (MAEV) are improved by 19.68% and 13.98% respectively compared with the original SOTA method.

References

  1. Zhu, X., Lei, Z., Liu, X., Shi, H. and Li, S. Z. Face alignment across large poses: A 3d solution. City, 2016.Google ScholarGoogle Scholar
  2. Zhu, X., Lei, Z., Yan, J., Yi, D. and Li, S. Z. High-fidelity pose and expression normalization for face recognition in the wild. City, 2015.Google ScholarGoogle Scholar
  3. Fanelli, G., Dantone, M., Gall, J., Fossati, A. and Van Gool, L. Random forests for real time 3d face analysis. International journal of computer vision, 101 (2013), 437-458.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cao, Z. W., Chu, Z. C., Liu, D. F., Chen, Y. J. and Ieee A Vector-based Representation to Enhance Head Pose Estimation. Ieee Computer Soc, City, 2021.Google ScholarGoogle Scholar
  5. Tang, H., Dai, M., Du, X., Hung, J.-L. and Li, H. An EEG study on college students’ attention levels in a blended computer science class. Innovations in Education and Teaching International (2023), 1-13.Google ScholarGoogle Scholar
  6. Jahanmahin, R., Masoud, S., Rickli, J. and Djuric, A. Human-robot interactions in manufacturing: A survey of human behavior modeling. Robotics and Computer-Integrated Manufacturing, 78 (2022), 102404.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Lundgren, A. V. A., Santos, M. A. O. d., Bezerra, B. L. D. and Bastos-Filho, C. J. A. Systematic Review of Computer Vision Semantic Analysis in Socially Assistive Robotics. Ai, 3, 1 (2022), 229-249.Google ScholarGoogle Scholar
  8. Suzuki, R., Karim, A., Xia, T., Hedayati, H. and Marquardt, N. Augmented reality and robotics: A survey and taxonomy for ar-enhanced human-robot interaction and robotic interfaces. City, 2022.Google ScholarGoogle Scholar
  9. Oniki, K., Kikuchi, T. and Ozasa, Y. Training Data Generation Based on Observation Probability Density for Human Pose Refinement. Journal of Image and Graphics, 9, 2 (2021), 50-54.Google ScholarGoogle ScholarCross RefCross Ref
  10. Zhou, Y. and Gregson, J. Whenet: Real-time fine-grained estimation for wide range head pose. arXiv preprint arXiv:2005.10353 (2020).Google ScholarGoogle Scholar
  11. Yang, T. Y., Chen, Y. T., Lin, Y. Y. and Chuang, Y. Y. FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation From a Single Image. City, 2019.Google ScholarGoogle Scholar
  12. Zhang, H., Wang, M. M., Liu, Y., Yuan, Y. and Assoc Advancement Artificial, I. FDN: Feature Decoupling Network for Head Pose Estimation. Assoc Advancement Artificial Intelligence, City, 2020.Google ScholarGoogle Scholar
  13. Hsu, H.-W., Wu, T.-Y., Wan, S., Wong, W. H. and Lee, C.-Y. QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss. IEEE Transactions on Multimedia, 21, 4 (2019), 1035-1046.Google ScholarGoogle Scholar
  14. Zhou, Y., Barnes, C., Lu, J., Yang, J. and Li, H. On the Continuity of Rotation Representations in Neural Networks. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2019), [insert City of Publication],[insert 2019 of Publication].Google ScholarGoogle ScholarCross RefCross Ref
  15. Xiang, Y., Schmidt, T., Narayanan, V. and Fox, D. Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes. arXiv preprint arXiv:1711.00199 (2017).Google ScholarGoogle Scholar
  16. Hempel, T., Abdelrahman, A. A. and Al-Hamadi, A. 6d Rotation Representation For Unconstrained Head Pose Estimation. City, 2022.Google ScholarGoogle Scholar
  17. Wu, C. Y., Xu, Q. G., Neumann, U. and Soc, I. C. Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry. Ieee Computer Soc, City, 2021.Google ScholarGoogle ScholarCross RefCross Ref
  18. Ruiz, N., Chong, E. and Rehg, J. M. Fine-grained head pose estimation without keypoints. City, 2018.Google ScholarGoogle Scholar
  19. Ding, X. H., Zhang, X. Y., Ma, N. N., Han, J. G., Ding, G. G., Sun, J. and Ieee Comp, S. O. C. RepVGG: Making VGG-style ConvNets Great Again. Ieee Computer Soc, City, 2021.Google ScholarGoogle Scholar
  20. He, K., Zhang, X., Ren, S. and Sun, J. Deep Residual Learning for Image Recognition. City, 2016.Google ScholarGoogle Scholar

Index Terms

  1. Continuity Rotation Representation for Head Pose Estimation without Keypoints

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      ICCAI '23: Proceedings of the 2023 9th International Conference on Computing and Artificial Intelligence
      March 2023
      824 pages
      ISBN:9781450399029
      DOI:10.1145/3594315

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 2 August 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed limited
    • Article Metrics

      • Downloads (Last 12 months)29
      • Downloads (Last 6 weeks)3

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format