skip to main content
10.1145/3357254.3357278acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

Face pose estimation with ensemble multi-scale representations

Authors Info & Claims
Published:16 August 2019Publication History

ABSTRACT

Face pose estimation plays important roles in broad applications such as visual based surveillance, face authentication, human-computer intelligent interactions, etc. However, face pose estimation is also a challenge issue, especially under complicated real application environments. In this paper, we proposed a novel face pose estimation approach with integrating two multi-scale representations. The first one is multi-scale VGG-Face representations, which using VGG-Face CNN as backbone three middle scale layer outputs are extracted and go through additional transfer learning. The second one is multi-scale Curvelet representations. These two sub multi-scale representations are integrated and then several dense layers processing are added to form the entire ensemble system which is used for the prediction of face pose. The experiment results show that the proposed approach achieved mean absolute errors (MAE) of 0.33° and 0.23° for yaw and pitch angle on CAS-PEAL pose database, and achieved mean absolute errors of 3.88° and 1.98° for yaw and pitch angle on Pointing'04 database.

References

  1. Ng, J., Gong, S. G. 2002, Composite support vector machines for detection of faces across views and pose estimation. Image and Vision Computing, 20(5--6):359--368Google ScholarGoogle Scholar
  2. Ng, J., Gong, S. G. 1999, Multi-view face detection and pose estimation using a composite support vector machine across the view sphere. In Proceedings of International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems, pp 14--21Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chen, L., Zhang, L., Hu, Y. X., Li, M. J., Zhang, H. J. 2003, Head pose estimation using fisher manifold learning. Proceedings of the IEEE International Workshop on Analysis and Modeling of Faces and Gestures, pp 203--207Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Li, S. Z., Fu, Q., Gu, L., Scholkopf, B., Cheng, Y., Zhang, H. 2001, Kernel machine based learning for multi-view face detection and pose estimation. Proceedings of 8th IEEE International Conference on Computer Vision, 2: 674--679Google ScholarGoogle ScholarCross RefCross Ref
  5. McKenna, S. J., Gong, S. 1998, Real-time face pose estimation. Real-Time Imaging, 4(5): 333--347Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Wu, J. W., Trivedi, M. M. 2008, A two-stage head pose estimation framework and evaluation. Pattern Recognition, 41(3):1138--1158Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Gee, A. H., Cipolla, R. 1994, Determining the gaze of faces in images. Image and Vision Computing, 12(10):639--647Google ScholarGoogle ScholarCross RefCross Ref
  8. Wang, J. G., Sung, E. 2007, EM enhancement of 3D head pose estimated by point at infinity. Image and Vision Computing, 25(12):1864--1874Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Canton-Ferrer, C., Casas, J. R., Pardàs, M. 2008, Head orientation estimation using particle filtering in multiview scenarios. Multimodal Technologies for Perception of Humans, 4625:317--327Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hinton, G., Salakhutdinov, R. 2006, Reducing the dimensionality of data with neural networks. Science, 313(5786):504--507Google ScholarGoogle ScholarCross RefCross Ref
  11. Hinton, G., Osindero, S. and The, Y. 2006, A fast learning algorithm for deep belief nets. Neural Computation, 18(7):1527--1554Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bengio, Y. 2009, Learning Deep Architectures for AI. Foundations & Trends in Machine Learning, 2(1): 1--127Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Bengio, Y., Lecun, Y. 2010, Scaling learning algorithms towards AI. Large-Scale Kernel Machines, pp 321--359Google ScholarGoogle Scholar
  14. Lecun, Y., Bengio, Y., Hinton, G. 2015, Deep learning. Nature, 521: 436--444Google ScholarGoogle ScholarCross RefCross Ref
  15. Su, T. M., Cheng, F. Y., Han, Z. C., Ou, Z. Y. 2016, Pose Classification of Human Face Based on Deep Learning and Gradient Information Fusion. Journal of Data Acquisition and Processing, 31(5):941--948 (in Chinese)Google ScholarGoogle Scholar
  16. He, K., Zhang, X., Ren, S., Sun, J.. 2016, Deep Residual Learning for Image Recognition. IEEE Computer Society, pp 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  17. Huang, G., Liu, Z., Maaten, L. V. D., Weinberger, K. Q. 2017, Densely Connected Convolutional Networks. In CVPRGoogle ScholarGoogle Scholar
  18. Ioffe, S., Szegedy, C. 2015, Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the 32nd International Conference on Machine Learning, pp 448--456Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Candès, E. J., Donoho, D. L. 2000, Curvelets---a surprisingly effective nonadaptive representationfor objects with edges. In: Rabut C, Cohen A, Schumaker LL (eds) Curves and Surfaces. Vanderbilt University Press, Nashville, pp 105--120Google ScholarGoogle Scholar
  20. Candes, E. J., Guo, F. 2002, New multiscale transforms, minimum total variation synthesis: Applications to edge-preserving image reconstruction. Signal Processing, 82:1519--1543Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Candes, E. J., Demanet, L., Donoho, D. L., Ying, L. X. 2006, Fast discrete curvelet transforms. Multiscale Model Simul, 5:861--899Google ScholarGoogle ScholarCross RefCross Ref
  22. Parkhi, O. M., Vedaldi, A., and Zisserman, A. 2015, Deep Face Recognition. British Machine Vision Conference, pp 41.1--41.12Google ScholarGoogle Scholar
  23. Simonyan, K., Zisserman, A. 2015, Very deep convolutional networks for large-scale image recognition. In International Conference on Learning RepresentationsGoogle ScholarGoogle Scholar
  24. Gao, W., Cao, B., Shan, S. G., Zhou, D. L., Zhang, X. H., Zhao, D. B. 2004, The CAS-PEAL large-scale chinese face database and baseline evaluations. http://www.jdl.ac.cn/peal/files/TechReport4CAS-PEAL-R1.pdfGoogle ScholarGoogle Scholar
  25. http://www.aiar.xjtu.edu.cn/groups/face/Chinese/Homepage.htmGoogle ScholarGoogle Scholar
  26. Gourier, N., Hall, D., Crowley, J. L. 2004, Estimating face orientation from robust detection of salient facial structures. Fg Net Workshop on Visual Observation of Deictic GesturesGoogle ScholarGoogle Scholar
  27. Hu, C., Gong, L., Wang, T., Liu, F., Feng, Q. 2014, An effective head pose estimation approach using Lie Algebrized Gaussians based face representation. Multimedia Tools and Applications, 73(3):1863--1884Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Geng, X., Xia, Y. 2014, Head Pose Estimation Based on Multivariate Label Distribution. IEEE Conference on Computer Vision and Pattern RecognitionGoogle ScholarGoogle Scholar
  29. Sang, G., Chen, H., Huang, G., Zhao, Q. 2016,. Unseen head pose prediction using dense multivariate label distribution. Frontiers of Information Technology & Electronic Engineering, 17(6):516--526Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Face pose estimation with ensemble multi-scale representations

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      AIPR '19: Proceedings of the 2nd International Conference on Artificial Intelligence and Pattern Recognition
      August 2019
      198 pages
      ISBN:9781450372299
      DOI:10.1145/3357254
      • Conference Chairs:
      • Li Ma,
      • Xu Huang

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 August 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader