A Study of General Data Improvement for Large-Angle Head Pose Estimation

Bai, Jue; Peng, Chenglei; Li, Zhaoxu; Du, Sidan; Li, Yang

doi:10.1007/978-3-030-89131-2_18

Jue Bai¹⁴,
Chenglei Peng¹⁴,
Zhaoxu Li¹⁴,
Sidan Du¹⁴ &
…
Yang Li¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 13053))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

906 Accesses
1 Citations

Abstract

Predicting Euler angles of head pose using end-to-end CNN from a single RGB image is a popular application in recent years. However, the existing methods ignored the information about the rotation order contained in the Euler angles, always following the traditional pitch-yaw-roll order. They also neglected the error sources from outlier samples with large-angle poses. We analyzed current shortcomings and made suggestions for improvement from the perspective of data distribution. We studied the influence of different rotation orders on the data distribution and showed choosing an appropriate rotation order to learn head pose can significantly optimize the data distribution and improve the prediction accuracy. Then a data enhancement method was proposed to increase the large-angle poses by rotating the 2D images randomly and solving the corresponding head poses, which can improve network performance on the large-angle poses. Evaluated on two popular networks and different datasets, our methods were proved to be effective and general.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ruiz, N., Chong, E., Rehg, J.M.: Fine-grained head pose estimation without keypoints. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2018)
Google Scholar
Zhu, X., et al.: Face alignment across large poses: a 3D solution. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Google Scholar
Fanelli, G., Dantone, M., Gall, J., Fossati, A., Van Gool, L.: Random forests for real time 3D face analysis. Int. J. Comput. Vis. 101(3), 437–458 (2013)
Article Google Scholar
Bulat, A., Tzimiropoulos, G.: How far are we from solving the 2D & 3D face alignment problem? (and a Dataset of 230,000 3D Facial Landmarks). In: IEEE International Conference on Computer Vision (2017)
Google Scholar
Miao, X., et al.: Direct shape regression networks for end-to-end face alignment. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Kowalski, M., Naruniec, J., Trzcinski, T.: Deep alignment network: a convolutional neural network for robust face alignment. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2017)
Google Scholar
Yang, T.Y., et al.: FSA-net: learning fine-grained structure aggregation for head pose estimation from a single image. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Google Scholar
Zhu, X., Liu, X., Lei, Z., Li, S.Z.: Face alignment in full pose range: a 3D total solution. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 78–92 (2019)
Article Google Scholar
Feng, Y., Wu, F., Shao, X., Wang, Y., Zhou, X.: Joint 3D face reconstruction and dense alignment with position map regression network. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 557–574. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_33
Chapter Google Scholar
Ranjan, R., Patel, V.M., Chellappa, R.: HyperFace: a deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41(1), 121–135 (2019)
Article Google Scholar
Kumar, A., Alavi, A., Chellappa, R.: KEPLER: keypoint and pose estimation of unconstrained faces by learning efficient H-CNN regressors (2017)
Google Scholar
Zhou, Y., Gregson, J.: WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose (2020). https://arxiv.org/abs/2005.10353
Shao, M., et al.: Improving head pose estimation with a combined loss and bounding box margin adjustment. In: 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019) (2019)
Google Scholar
Liu, Z., et al.: Facial pose estimation by deep learning from label distributions. In: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW) (2020)
Google Scholar
Heng, W., et al.: QuatNet: quaternion-based head pose estimation with multiregression loss. Multimedia IEEE Transactions on (2018)
Google Scholar
Pio, R.: Euler angle transformations. Automat. Control IEEE Trans. 11(4), 707–715 (1966)
Article Google Scholar

Download references

Acknowledgement

We acknowledge the computational resources supported by High-Performance Computing Center of Collaborative Innovation Center of Advanced Microstructures, Nanjing University, and Nanjing Institute of Advanced Artificial Intelligence.

Author information

Authors and Affiliations

Nanjing University, Nanjing, 210046, China
Jue Bai, Chenglei Peng, Zhaoxu Li, Sidan Du & Yang Li

Authors

Jue Bai
View author publications
You can also search for this author in PubMed Google Scholar
Chenglei Peng
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoxu Li
View author publications
You can also search for this author in PubMed Google Scholar
Sidan Du
View author publications
You can also search for this author in PubMed Google Scholar
Yang Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chenglei Peng , Sidan Du or Yang Li .

Editor information

Editors and Affiliations

Cyprus University of Technology, Limassol, Cyprus
Nicolas Tsapatsoulis
University of Cyprus, Nicosia, Cyprus
Andreas Panayides
University of Cyprus, Nicosia, Cyprus
Theo Theocharides
Cyprus University of Technology, Limassol, Cyprus
Andreas Lanitis
University of Cyprus, Nicosia, Cyprus
Constantinos Pattichis
University of Salerno, Salerno, Italy
Mario Vento

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bai, J., Peng, C., Li, Z., Du, S., Li, Y. (2021). A Study of General Data Improvement for Large-Angle Head Pose Estimation. In: Tsapatsoulis, N., Panayides, A., Theocharides, T., Lanitis, A., Pattichis, C., Vento, M. (eds) Computer Analysis of Images and Patterns. CAIP 2021. Lecture Notes in Computer Science(), vol 13053. Springer, Cham. https://doi.org/10.1007/978-3-030-89131-2_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-89131-2_18
Published: 31 October 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-89130-5
Online ISBN: 978-3-030-89131-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics