Skip to main content

Robust Classification of Head Pose from Low Resolution Images

  • Conference paper
  • First Online:
Book cover Computer Analysis of Images and Patterns (CAIP 2019)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11678))

Included in the following conference series:

  • 1573 Accesses

Abstract

We propose a method for the coarse classification of head pose from low-resolution images. We devise a mechanism that uses a cascade of three binary Support Vector Machines (SVM) classifiers. We use two sets of appearance features, Similarity Distance Map (SDM) and Gabor Wavelet (GW) as input to the SVM classifiers. For training, we employ a large dataset that combines five publicly available databases. We test our approach with cross-validation using the eight databases and on videos we collected in a lab experiment. We found a significant improvement in the results achieved by the proposed method over existing schemes. In the cross-validation test, we achieved a head pose detection accuracy of 98.60%. Moreover, we obtained a head pose detection accuracy of 93.76% for high-resolution and 89.81% for low-resolution videos collected in the lab under loosely constrained conditions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Langton, S., Honeyman, H., Tessler, E.: The influence of head contour and nose angle on the perception of eye-gaze direction. Percept. Psychophys. 66(5), 752–771 (2004)

    Article  Google Scholar 

  2. Stiefelhagen, R., Finke, M., Yang, J., Waibel, A.: From gaze to focus of attention. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 765–772. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-48762-X_94

    Chapter  Google Scholar 

  3. Stiefelhagen, R., Zhu, J.: Head orientation and gaze direction in meetings. In: CHI 2002 Extended Abstracts on Human Factors in Computing Systems (2002)

    Google Scholar 

  4. Stiefelhagen, R.: Tracking focus of attention in meetings. In: 4th IEEE International Conference on Multimodal Interfaces. IEEE Computer Society (2002)

    Google Scholar 

  5. Orozco, J., Gong, S., Xiang, T.: Head pose classification in crowded scenes. In: British Machine Vision Conference, pp. 120.1–120.11. BMVA Press (2009). https://doi.org/10.5244/C.23.120

  6. Benfold, B., Reid, I.: Colour invariant head pose classification in low resolution video. In: British Machine Vision Conference, pp. 1–10 (2008)

    Google Scholar 

  7. Voit, M., Nickel, K., Stiefelhagen, R.: Multi-view head pose estimation using neural networks. In: The 2nd Canadian Conference on Computer and Robot Vision (CRV 2005), pp. 347–352. IEEE (2005). https://doi.org/10.1109/CRV.2005.55

  8. Voit, M., Nickel, K., Stiefelhagen, R.: A Bayesian approach for multi-view head pose estimation. In: 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, pp. 31–34. IEEE (2006). https://doi.org/10.1109/MFI.2006.265627

  9. Tan, X., Triggs, B.: Fusing Gabor and LBP feature sets for kernel-based face recognition. In: Zhou, S.K., Zhao, W., Tang, X., Gong, S. (eds.) AMFG 2007. LNCS, vol. 4778, pp. 235–249. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-75690-3_18

    Chapter  Google Scholar 

  10. Ba, S., Odobez, J.: A probabilistic framework for joint head tracking and pose estimation. In: 2004 Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 4 (2004)

    Google Scholar 

  11. Smith, K., Ba, S., Odobez, J.: Tracking the visual focus of attention for a varying number of wandering people (2008)

    Article  Google Scholar 

  12. Fanelli, G., Gall, J., Van Gool, L.: Real time head pose estimation with random regression forests. In: Conference on Computer Vision and Pattern Recognition, pp. 617–624. IEEE (2011). https://doi.org/10.1109/CVPR.2011.5995458

  13. Breitenstein, M.D., Kuettel, D., Weise, T., van Gool, L., Pfister, H.: Real-time face pose estimation from single range images. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008). https://doi.org/10.1109/CVPR.2008.4587807

  14. Amos, B., Ludwiczuk, B., Satyanarayanan, M.: OpenFace: a general-purpose face recognition library with mobile applications. CMU School of Computer Science (2016)

    Google Scholar 

  15. Patacchiola, M., Cangelosi, A.: Head pose estimation in the wild using convolutional neural networks and adaptive gradient methods. Pattern Recogn. 71, 132–143 (2017). https://doi.org/10.1016/j.patcog.2017.06.009

    Article  Google Scholar 

  16. Robertson, N., Reid, I.: Estimating gaze direction from low-resolution faces in video. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 402–415. Springer, Heidelberg (2006). https://doi.org/10.1007/11744047_31

    Chapter  Google Scholar 

  17. Comaniciu, D., Meer, P.: Mean shift analysis and applications. In: 1999 Proceedings of the Seventh IEEE International Conference on Computer Vision (1999)

    Google Scholar 

  18. Debnath, R., Takahide, N., Takahashi, H.: A decision based one-against-one method for multi-class support vector machine. Pattern Anal. Appl. 7, 164–175 (2004). https://doi.org/10.1007/s10044-004-0213-6

    Article  MathSciNet  Google Scholar 

  19. Branch, H.: Imagery library for intelligent detection systems (i-LIDS). In: 2006 Institution of Engineering and Technology Conference on Crime and Security. IET (2006)

    Google Scholar 

  20. Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4) (2009). https://doi.org/10.1109/TPAMI.2008.106

    Article  Google Scholar 

  21. Shen, L., Bai, L.: A review on Gabor wavelets for face recognition. Pattern Anal. Appl. 9, 273–292 (2006). https://doi.org/10.1007/s10044-006-0033-y

    Article  MathSciNet  Google Scholar 

  22. Pizer, S.M., et al.: Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 39, 355–368 (1987). https://doi.org/10.1016/S0734-189X(87)80186-X

    Article  Google Scholar 

  23. Smith, B., Yin, Q., Feiner, S., Nayar, S.: Gaze locking: passive eye contact detection for human-object interaction. In: Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology (2013)

    Google Scholar 

  24. Ariz, M., Bengoechea, J.J., Villanueva, A., Cabeza, R.: A novel 2D/3D database with automatic face annotation for head tracking and pose estimation. Comput. Vis. Image Underst. 148, 201–210 (2016). https://doi.org/10.1016/j.cviu.2015.04.009

    Article  Google Scholar 

  25. Asteriadis, S., Soufleros, D., Karpouzis, K.: A natural head pose and eye gaze dataset. In: Proceedings of the International Workshop on Affective-Aware Virtual Agents and Social Robots (2009)

    Google Scholar 

  26. Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression (PIE) database. In: Automatic Face Gesture (2002)

    Google Scholar 

  27. Gourier, N., Hall, D., Crowley, J.: Estimating face orientation from robust detection of salient facial structures. In: FG Net Workshop on Visual Observation of Deictic Gestures (2004)

    Google Scholar 

  28. Muñoz-Salinas, R., Yeguas-Bolivar, E., Saffiotti, A.: Multi-camera head pose estimation. Mach. Vis. Appl. 23(3), 479–490 (2012)

    Article  Google Scholar 

  29. Samaria, F., Harter, A.: Parameterisation of a stochastic model for human face identification. In: 1994 Proceedings of the Second IEEE Workshop on Applications of Computer Vision. IEEE (1994)

    Google Scholar 

  30. Liao, S., Jain, A., Li, S.: A fast and accurate unconstrained face detector. IEEE Trans. Pattern Anal. 38(2), 211–223 (2016)

    Article  Google Scholar 

  31. Smith, B.A., Yin, Q., Feiner, S.K., Nayar, S.K.: Gaze locking: passive eye contact detection for human-object interaction. In: Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology, UIST 2013, pp. 271–280. ACM Press, New York (2013). https://doi.org/10.1145/2501988.2501994

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hussein Al Osman .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Khaki, M., Ayoub, I., Javadtalab, A., Osman, H.A. (2019). Robust Classification of Head Pose from Low Resolution Images. In: Vento, M., Percannella, G. (eds) Computer Analysis of Images and Patterns. CAIP 2019. Lecture Notes in Computer Science(), vol 11678. Springer, Cham. https://doi.org/10.1007/978-3-030-29888-3_43

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-29888-3_43

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-29887-6

  • Online ISBN: 978-3-030-29888-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics