Unseen head pose prediction using dense multivariate label distribution

Sang, Gao-li; Chen, Hu; Huang, Ge; Zhao, Qi-jun

doi:10.1631/FITEE.1500235

Unseen head pose prediction using dense multivariate label distribution

Published: 11 June 2016

Volume 17, pages 516–526, (2016)
Cite this article

Frontiers of Information Technology & Electronic Engineering Aims and scope Submit manuscript

Gao-li Sang^1,2,
Hu Chen¹,
Ge Huang¹ &
…
Qi-jun Zhao¹

100 Accesses
6 Citations
Explore all metrics

Abstract

Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation, and emotion analysis. Most existing methods estimate head poses that are included in the training data (i.e., previously seen head poses). To predict head poses that are not seen in the training data, some regression-based methods have been proposed. However, they focus on estimating continuous head pose angles, and thus do not systematically evaluate the performance on predicting unseen head poses. In this paper, we use a dense multivariate label distribution (MLD) to represent the pose angle of a face image. By incorporating both seen and unseen pose angles into MLD, the head pose predictor can estimate unseen head poses with an accuracy comparable to that of estimating seen head poses. On the Pointing’04 database, the mean absolute errors of results for yaw and pitch are 4.01° and 2.13°, respectively. In addition, experiments on the CAS-PEAL and CMU Multi-PIE databases show that the proposed dense MLD-based head pose estimation method can obtain the state-of-the-art performance when compared to some existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Microsoft COCO: Common Objects in Context

Facial emotion recognition using convolutional neural networks (FERC)

Article 18 February 2020

Ninad Mehendale

ImageNet Large Scale Visual Recognition Challenge

Article 11 April 2015

Olga Russakovsky, Jia Deng, … Li Fei-Fei

References

Aghajanian, J., Prince, S.J.D., 2009. Face pose estimation in uncontrolled environments. Proc. British Machine Vision Conf., p.1–11.
Google Scholar
Berger, A.L., Pietra, V.J.D., Pietra, S.A.D., 1996. A maximum entropy approach to natural language processing. Comput. Ling., 22(1): 39–71.
Article Google Scholar
Bowyer, K.W., Chang, K., Flynn, P., 2006. A survey of approaches and challenges in 3D and multi-modal 3D+2D face recognition. Comput. Vis. Image Understand., 101(1): 1–15. http://dx.doi.org/10.1016/j.cviu.2005.05.005
Article Google Scholar
Brunelli, R., 1997. Estimation of pose and illuminant direction for face processing. Image Vis. Comput., 15(10): 741–748. http://dx.doi.org/10.1016/S0262-8856(97)00024-3
Article Google Scholar
Cai, Y., Yang, M.L., Li, Z.Q., 2015. Robust head pose estimation using a 3D morphable model. Math. Prob. Eng., 2015:678973.1–678973.10. http://dx.doi.org/10.1155/2015/678973
Google Scholar
Do, M.N., 2003. Fast approximation of Kullback-Leibler distance for dependence trees and hidden Markov models. IEEE Signal Process. Lett., 10(4): 115–118. http://dx.doi.org/10.1109/LSP.2003.809034
Article Google Scholar
Fenzi, M., Leal-Taixé, L., Rosenhahn, B., et al., 2013. Class generative models based on feature regression for pose estimation of object categories. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, p.755–762.
Google Scholar
Fitzpatrick, P., 2000. Head Pose Estimation Without Manual Initialization. Report, Massachusetts Institute of Technology, Cambridge.
Book Google Scholar
Gao, W., Cao, B., Shan, S.G., et al., 2008. The CASPEAL large-scale Chinese face database and baseline evaluations. IEEE Trans. Syst. Man Cybern. A, 38(1): 149–161. http://dx.doi.org/10.1109/TSMCA.2007.909557
Article Google Scholar
Geng, X., Xia, Y., 2014. Head pose estimation based on multivariate label distribution. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, p.1837–1842.
Google Scholar
Gourier, N., Hall, D., Crowley, J.L., 2004. Estimating face orientation from robust detection of salient facial features. Proc. Int. Workshop on Visual Observation of Deictic Gestures. Available from http://www-prima. inrialpes.fr/perso/Gourier/Faces/HPDatabase.html.
Google Scholar
Gross, R., Matthews, I., Cohn, J., et al., 2010. Multi-PIE. Image Vis. Comput., 28(5): 807–813. http://dx.doi.org/10.1016/j.imavis.2009.08.002
Article Google Scholar
Haj, M.A., Gonzà lez, J., Davis, L.S., 2012. On partial least squares in head pose estimation: how to simultaneously deal with misalignment. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, p.2602–2609. http://dx.doi.org/10.1109/CVPR.2012.6247979
Google Scholar
Hu, C.L., Gong, L.Y., Wang, T.J., et al., 2014. An effective head pose estimation approach using Lie algebrized Gaussians based face representation. Multim. Tools Appl., 73(3): 1863–1884. http://dx.doi.org/10.1007/s11042-013-1676-5
Article Google Scholar
Huang, D., Storer, M., de la Torre, F., et al., 2011. Supervised local subspace learning for continuous head pose estimation. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, p.2921–2928. http://dx.doi.org/10.1109/CVPR.2011.5995683
Google Scholar
Jain, V., Crowley, J.L., 2013. Head pose estimation using multi-scale Gaussian derivatives. Proc. 18th Scandinavian Conf. on Image Analysis, p.319–328. http://dx.doi.org/10.1007/978-3-642-38886-6_31
Google Scholar
Krüger, V., Sommer, G., 2002. Gabor wavelet networks for efficient head pose estimation. Image Vis. Comput., 20(9-10):665–672. http://dx.doi.org/10.1016/S0262-8856(02)00056-2
Article Google Scholar
Liu, D.C., Nocedal, J., 1989. On the limited memory BFGS method for large scale optimization. Math. Program., 45(1): 503–528. http://dx.doi.org/10.1007/BF01589116
Article MathSciNet Google Scholar
Lu, F., Sugano, Y., Okabe, T., et al., 2012. Head pose-free appearance-based gaze sensing via eye image synthesis. Proc. 21st Int. Conf. on Pattern Recognition, p.1008–1011.
Google Scholar
Lu, F., Okabe, T., Sugano, Y., et al., 2014. Learning gaze biases with head motion for head pose-free gaze estimation. Image Vis. Comput., 32(3): 169–179. http://dx.doi.org/10.1016/j.imavis.2014.01.005
Article Google Scholar
Ma, B.P., Chai, X.J., Wang, T.J., 2013. A novel feature descriptor based on biologically inspired feature for head pose estimation. Neurocomputing, 115:1–10. http://dx.doi.org/10.1016/j.neucom.2012.11.005
Article Google Scholar
Ma, B.P., Li, A.N., Chai, X.J., et al., 2014. CovGa: a novel descriptor based on symmetry of regions for head pose estimation. Neurocomputing, 143:97–108. http://dx.doi.org/10.1016/j.neucom.2014.06.014
Article Google Scholar
Ma, B.P., Huang, R., Qin, L., 2015. VoD: a novel image representation for head yaw estimation. Neurocomputing, 148:455–466. http://dx.doi.org/10.1016/j.neucom.2014.07.019
Article Google Scholar
Ma, X.H., Tan, Y.Q., Zheng, G.M., 2013. A fast classification scheme and its application to face recognition. J. Zhejiang Univ.-Sci. C (Comput. & Electron.), 14(7): 561–572. http://dx.doi.org/10.1631/jzus.CIDE1309
Article Google Scholar
Murphy-Chutorian, E., Trivedi, M.M., 2009. Head pose estimation in computer vision: a survey. IEEE Trans. Patt. Anal. Mach. Intell., 31(4): 607–626. http://dx.doi.org/10.1109/TPAMI.2008.106
Article Google Scholar
Pang, H., Lin, A., Holford, M., et al., 2006. Pathway analysis using random forests classification and regression. Bioinformatics, 22(16): 2028–2036. http://dx.doi.org/10.1093/bioinformatics/btl344
Article Google Scholar
Sim, T., Baker, S., Bsat, M., 2002. The CMU pose, illumination, and expression (PIE) database. Proc. 5th IEEE Int. Conf. on Automatic Face and Gesture Recognition, p.46–51. http://dx.doi.org/10.1109/AFGR.2002.1004130
Google Scholar
Tang, Y.Q., Sun, Z.N., Tan, T.N., 2014. A survey on head pose estimation. Patt. Recogn. Artif. Intell., 27(3): 213–225.
Google Scholar
Wu, J.W., Trivedi, M.M., 2008. A two-stage head pose estimation framework and evaluation. Patt. Recog., 41(3): 1138–1158. http://dx.doi.org/10.1016/j.patcog.2007.07.017
Article Google Scholar
Zhang, Z.P., Luo, P., Loy, C.C., et al., 2014. Facial landmark detection by deep multi-task learning. Proc. 13th European Conf. on Computer Vision, p.94–108. http://dx.doi.org/10.1007/978-3-319-10599-4_7
Google Scholar
Zhu, R.H., Sang, G.L., Cai, Y., et al., 2013. Head pose estimation with improved random regression forests. Proc. 8th Chinese Conf. on Biometric Recognition, p.457–465. http://dx.doi.org/10.1007/978-3-319-02961-0_57
Chapter Google Scholar
Zhu, X.X., Ramanan, D., 2012. Face detection, pose estimation, and landmark localization in the wild. Proc. IEEE Conf. on Computer Vision and Pattern Recognition, p.2879–2886. http://dx.doi.org/10.1109/CVPR.2012.6248014
Google Scholar

Download references

Author information

Authors and Affiliations

State Key Laboratory of Fundamental Science on Synthetic Vision, College of Computer Science, Sichuan University, Chengdu, 610064, China
Gao-li Sang, Hu Chen, Ge Huang & Qi-jun Zhao
College of Mathematics and Information Engineering, Jiaxing University, Jiaxing, 314001, China
Gao-li Sang

Authors

Gao-li Sang
View author publications
You can also search for this author in PubMed Google Scholar
Hu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ge Huang
View author publications
You can also search for this author in PubMed Google Scholar
Qi-jun Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qi-jun Zhao.

Additional information

Project supported by the National Key Scientific Instrument and Equipment Development Project of China (No. 2013YQ49087903) and the National Natural Science Foundation of China (No. 61202160)

ORCID: Gao-li SANG, http://orcid.org/0000-0002-6567-1652

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sang, Gl., Chen, H., Huang, G. et al. Unseen head pose prediction using dense multivariate label distribution. Frontiers Inf Technol Electronic Eng 17, 516–526 (2016). https://doi.org/10.1631/FITEE.1500235

Download citation

Received: 23 July 2015
Revised: 16 February 2016
Published: 11 June 2016
Issue Date: June 2016
DOI: https://doi.org/10.1631/FITEE.1500235

Keywords

CLC number

TP391.4

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Unseen head pose prediction using dense multivariate label distribution

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

Facial emotion recognition using convolutional neural networks (FERC)

ImageNet Large Scale Visual Recognition Challenge

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

CLC number

Navigation

Unseen head pose prediction using dense multivariate label distribution

Abstract

Access this article

Similar content being viewed by others

Microsoft COCO: Common Objects in Context

Facial emotion recognition using convolutional neural networks (FERC)

ImageNet Large Scale Visual Recognition Challenge

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

CLC number

Search

Navigation