Skip to main content

Evaluation of Head Pose Estimation for Studio Data

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4122))

Abstract

This paper introduces our head pose estimation system that localizes nose-tip of the faces and estimate head poses in studio quality pictures. After the nose-tip in the training data are manually labeled, the appearance variation caused by head pose changes is characterized by tensor model. Given images with unknown head pose and nose-tip location, the nose-tip of the face is localized in a coarse-to-fine fashion, and the head pose is estimated simultaneously by the head pose tensor model. The image patches at the localized nose tips are then cropped and sent to two other head pose estimators based on LEA and PCA techniques. We evaluated our system on the Pointing’04 head pose image database. With the nose-tip location known, our head pose estimators can achieve 94~96% head pose classification accuracy(within ±15o). With nose-tip unknown, we achieves 85% nose-tip localization accuracy(within 3 pixels from the ground truth), and 81~84% head pose classification accuracy(within ±15o).

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wu, J.W., Pedersen, J.M., Putthividhya, D., Norgaard, D., Trivedi, M.M.: A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis. In: Pointing 2004 (2004)

    Google Scholar 

  2. Rainer Stiefelhagen: Estimating Head Pose with Neural Networks-Results on the Pointing04 ICPR Workshop Evaluation Data. In: Pointing 2004 (2004)

    Google Scholar 

  3. Gourier, N., Hall, D., Crowley, J.L.: Estimating Face orientation from Robust Detection of Salient Facial Structures. In: Pointing 2004 (2004)

    Google Scholar 

  4. Burt, P.J., Adelson, E.H.: The Laplacian pyramid as a compact image code. IEEE Trans. Commun. 31(4), 532–540 (1983)

    Article  Google Scholar 

  5. Vasilescu, M.A.O., Terzopoulos, D.: Multilinear Subspace Analysis for Image Ensembles. In: Proc. Computer Vision and Pattern Recognition Conf., vol. 2, pp. 93–99 (2003)

    Google Scholar 

  6. Vasilescu, M.A.O., Terzopoulos, D.: Multilinear Independent Components Analysis. In: Proc. Computer Vision and Pattern Recognition Conf. (2005)

    Google Scholar 

  7. Jones, M.J., Rehg, J.M.: Statistical Color Models with Application to Skin Detection. Int. J. of Computer Vision 46(1), 81–96 (2002)

    Article  MATH  Google Scholar 

  8. Fu, Y., Huang, T.S.: Graph Embedded Analysis for Head Pose Estimation. In: 7th IEEE International Conference Automatic Face and Gesture Recognition (2006)

    Google Scholar 

  9. Turk, M., Pentland, A.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 3(1), 71–86 (1991)

    Article  Google Scholar 

  10. Ostu, N.: A thresholding selection method from gray-level histograms, IEEE Trans. IEEE Trans. Systems, Man & Cybernetics 9(1), 62–66 (1979)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Rainer Stiefelhagen John Garofolo

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this paper

Cite this paper

Tu, J., Fu, Y., Hu, Y., Huang, T. (2007). Evaluation of Head Pose Estimation for Studio Data. In: Stiefelhagen, R., Garofolo, J. (eds) Multimodal Technologies for Perception of Humans. CLEAR 2006. Lecture Notes in Computer Science, vol 4122. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69568-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69568-4_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69567-7

  • Online ISBN: 978-3-540-69568-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics