Abstract
We propose a method for the coarse classification of head pose from low-resolution images. We devise a mechanism that uses a cascade of three binary Support Vector Machines (SVM) classifiers. We use two sets of appearance features, Similarity Distance Map (SDM) and Gabor Wavelet (GW) as input to the SVM classifiers. For training, we employ a large dataset that combines five publicly available databases. We test our approach with cross-validation using the eight databases and on videos we collected in a lab experiment. We found a significant improvement in the results achieved by the proposed method over existing schemes. In the cross-validation test, we achieved a head pose detection accuracy of 98.60%. Moreover, we obtained a head pose detection accuracy of 93.76% for high-resolution and 89.81% for low-resolution videos collected in the lab under loosely constrained conditions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Langton, S., Honeyman, H., Tessler, E.: The influence of head contour and nose angle on the perception of eye-gaze direction. Percept. Psychophys. 66(5), 752–771 (2004)
Stiefelhagen, R., Finke, M., Yang, J., Waibel, A.: From gaze to focus of attention. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 765–772. Springer, Heidelberg (1999). https://doi.org/10.1007/3-540-48762-X_94
Stiefelhagen, R., Zhu, J.: Head orientation and gaze direction in meetings. In: CHI 2002 Extended Abstracts on Human Factors in Computing Systems (2002)
Stiefelhagen, R.: Tracking focus of attention in meetings. In: 4th IEEE International Conference on Multimodal Interfaces. IEEE Computer Society (2002)
Orozco, J., Gong, S., Xiang, T.: Head pose classification in crowded scenes. In: British Machine Vision Conference, pp. 120.1–120.11. BMVA Press (2009). https://doi.org/10.5244/C.23.120
Benfold, B., Reid, I.: Colour invariant head pose classification in low resolution video. In: British Machine Vision Conference, pp. 1–10 (2008)
Voit, M., Nickel, K., Stiefelhagen, R.: Multi-view head pose estimation using neural networks. In: The 2nd Canadian Conference on Computer and Robot Vision (CRV 2005), pp. 347–352. IEEE (2005). https://doi.org/10.1109/CRV.2005.55
Voit, M., Nickel, K., Stiefelhagen, R.: A Bayesian approach for multi-view head pose estimation. In: 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, pp. 31–34. IEEE (2006). https://doi.org/10.1109/MFI.2006.265627
Tan, X., Triggs, B.: Fusing Gabor and LBP feature sets for kernel-based face recognition. In: Zhou, S.K., Zhao, W., Tang, X., Gong, S. (eds.) AMFG 2007. LNCS, vol. 4778, pp. 235–249. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-75690-3_18
Ba, S., Odobez, J.: A probabilistic framework for joint head tracking and pose estimation. In: 2004 Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 4 (2004)
Smith, K., Ba, S., Odobez, J.: Tracking the visual focus of attention for a varying number of wandering people (2008)
Fanelli, G., Gall, J., Van Gool, L.: Real time head pose estimation with random regression forests. In: Conference on Computer Vision and Pattern Recognition, pp. 617–624. IEEE (2011). https://doi.org/10.1109/CVPR.2011.5995458
Breitenstein, M.D., Kuettel, D., Weise, T., van Gool, L., Pfister, H.: Real-time face pose estimation from single range images. In: 2008 IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8. IEEE (2008). https://doi.org/10.1109/CVPR.2008.4587807
Amos, B., Ludwiczuk, B., Satyanarayanan, M.: OpenFace: a general-purpose face recognition library with mobile applications. CMU School of Computer Science (2016)
Patacchiola, M., Cangelosi, A.: Head pose estimation in the wild using convolutional neural networks and adaptive gradient methods. Pattern Recogn. 71, 132–143 (2017). https://doi.org/10.1016/j.patcog.2017.06.009
Robertson, N., Reid, I.: Estimating gaze direction from low-resolution faces in video. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 402–415. Springer, Heidelberg (2006). https://doi.org/10.1007/11744047_31
Comaniciu, D., Meer, P.: Mean shift analysis and applications. In: 1999 Proceedings of the Seventh IEEE International Conference on Computer Vision (1999)
Debnath, R., Takahide, N., Takahashi, H.: A decision based one-against-one method for multi-class support vector machine. Pattern Anal. Appl. 7, 164–175 (2004). https://doi.org/10.1007/s10044-004-0213-6
Branch, H.: Imagery library for intelligent detection systems (i-LIDS). In: 2006 Institution of Engineering and Technology Conference on Crime and Security. IET (2006)
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4) (2009). https://doi.org/10.1109/TPAMI.2008.106
Shen, L., Bai, L.: A review on Gabor wavelets for face recognition. Pattern Anal. Appl. 9, 273–292 (2006). https://doi.org/10.1007/s10044-006-0033-y
Pizer, S.M., et al.: Adaptive histogram equalization and its variations. Comput. Vis. Graph. Image Process. 39, 355–368 (1987). https://doi.org/10.1016/S0734-189X(87)80186-X
Smith, B., Yin, Q., Feiner, S., Nayar, S.: Gaze locking: passive eye contact detection for human-object interaction. In: Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology (2013)
Ariz, M., Bengoechea, J.J., Villanueva, A., Cabeza, R.: A novel 2D/3D database with automatic face annotation for head tracking and pose estimation. Comput. Vis. Image Underst. 148, 201–210 (2016). https://doi.org/10.1016/j.cviu.2015.04.009
Asteriadis, S., Soufleros, D., Karpouzis, K.: A natural head pose and eye gaze dataset. In: Proceedings of the International Workshop on Affective-Aware Virtual Agents and Social Robots (2009)
Sim, T., Baker, S., Bsat, M.: The CMU pose, illumination, and expression (PIE) database. In: Automatic Face Gesture (2002)
Gourier, N., Hall, D., Crowley, J.: Estimating face orientation from robust detection of salient facial structures. In: FG Net Workshop on Visual Observation of Deictic Gestures (2004)
Muñoz-Salinas, R., Yeguas-Bolivar, E., Saffiotti, A.: Multi-camera head pose estimation. Mach. Vis. Appl. 23(3), 479–490 (2012)
Samaria, F., Harter, A.: Parameterisation of a stochastic model for human face identification. In: 1994 Proceedings of the Second IEEE Workshop on Applications of Computer Vision. IEEE (1994)
Liao, S., Jain, A., Li, S.: A fast and accurate unconstrained face detector. IEEE Trans. Pattern Anal. 38(2), 211–223 (2016)
Smith, B.A., Yin, Q., Feiner, S.K., Nayar, S.K.: Gaze locking: passive eye contact detection for human-object interaction. In: Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology, UIST 2013, pp. 271–280. ACM Press, New York (2013). https://doi.org/10.1145/2501988.2501994
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Khaki, M., Ayoub, I., Javadtalab, A., Osman, H.A. (2019). Robust Classification of Head Pose from Low Resolution Images. In: Vento, M., Percannella, G. (eds) Computer Analysis of Images and Patterns. CAIP 2019. Lecture Notes in Computer Science(), vol 11678. Springer, Cham. https://doi.org/10.1007/978-3-030-29888-3_43
Download citation
DOI: https://doi.org/10.1007/978-3-030-29888-3_43
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29887-6
Online ISBN: 978-3-030-29888-3
eBook Packages: Computer ScienceComputer Science (R0)