Abstract
Human gaze tracking plays an important role in the field of Human-Computer Interaction. This paper presents a brief review on appearance-based gaze tracking. Based on the appearance of human eyes, input features can be classified into three categories according to the different ways of extracting human eyes features, namely, complete human eye image, pixel-based feature and 3D reconstruction image. The estimation process from human eye feature to fixation point mainly uses different mapping functions. In this paper, common mapping functions and related algorithms are described in detail: k-nearest neighbor (KNN), random forest (RF) regression, gaussian process (GP) regression, support vector machines (SVM) and artificial neural networks (ANN). This paper evaluates the performance of these gaze tracking algorithms using different mapping functions. Based on the results of the evaluation, potential challenges are summarized and the future directions of gaze estimation are prospected.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Stiefelhagen, R., Yang, J.: Gaze tracking for multimodal human-computer interaction. In: 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, Munich (1997)
Morimoto, C.H., Mimica, M.R.M.: Eye gaze tracking techniques for interactive applications. Comput. Vis. Image Underst. 98(1), 4–24 (2015)
Sawahata, Y., Khosla, R., Komine, K.: Determining comprehension and quality of TV programs using eye-gaze tracking. Pattern Recogn. 41(5), 1610–1626 (2008)
Guan, Q., Tang, F., Zhou, X., Min, H.: A survey of 3D eye model based gaze tracking. J. Comput.-Aided Des. Comput. Graph. 29(9), 1579–1589 (2017)
Pham, C., Thiele, S., Parkinson, J., Li, S.: Alcohol warning label awareness and attention: a multi-method study. Alcohol Alcohol. 53(1), 1–7 (2017)
Pfeiffer, T.: Towards gaze interaction in immersive virtual reality : evaluation of a monocular eye tracking set-up. In: Schumann, M., Kuhlen, T. (eds.) Virtuelle und Erweiterte RealitatFunfter Workshop der GIFachgruppe VRAR, pp. 81–92. Shaker Verlag (2008)
Valenti, R., Gevers, T.: Accurate eye center location and tracking using isophote curvature. In: IEEE Conference on Computer Vision & Pattern Recognition, vol. 1, pp. 1–8. IEEE, Alaska (2008)
Valenti, R., Staiano, J., Sebe, N., Gevers, T.: Webcam-based visual gaze estimation. In: Foggia, P., Sansone, C., Vento, M. (eds.) ICIAP 2009. LNCS, vol. 5716, pp. 662–671. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04146-4_71
Guestrin, E.D., Eizenman, M.: General theory of remote gaze estimation using the pupil center and corneal reflections. IEEE Trans. Biomed. Eng. 53(6), 1124–1133 (2006)
Zhang, Y., Bulling, A., Gellersen, H.: Discrimination of gaze directions using low-level eye image features. In: Proceedings of the 1st International Workshop on Pervasive Eye Tracking & Mobile Eye-Based Interaction, pp. 9–14. ACM, Beijing (2011)
Wang, Y.: Appearance-based gaze estimation using deep features and random forest regression. Knowl.-Based Syst. 110, 293–301 (2016)
Tan, K., Kriegman, D.J., Ahuja, N.: Appearance-based eye gaze estimation. In: 6th IEEE Workshop on Applications of Computer Vision, p. 191. IEEE Computer Society, Orlando (2002)
Martinez, F., Carbone, A., Pissaloux, E.: Gaze estimation using local features and non-linear regression. In: 19th IEEE International Conference on Image Processing, pp. 1961–1964. IEEE, Orlando (2013)
Guo, Z., Zhou, Z., Liu, Z.: Appearance-based gaze estimation under slight head motion. Multimed. Tools Appl. 76(2), 2203–2222 (2016)
Wood, E.,Tadas, B., Louis, M., Peter, R., Andreas, B.: Learning an appearance-based gaze estimator from one million synthesised images. In: Proceedings of the Ninth Biennial ACM Symposium on Eye Tracking Research and Applications, pp. 131–138. ACM, New York (2016)
Zhang,Y., Bulling, A., Gellersen, H.: Discrimination of gaze directions using low-level eye image features. In: Proceedings of the 1st International Workshop on Pervasive Eye Tracking & Mobile Eye-based Interaction, pp. 9–14. ACM, New York (2011)
Wang, Y., Zhao, T., Ding, X.: Learning a gaze estimator with neighbor selection from large-scale synthetic eye images. Knowl.-Based Syst. 139, 41–49 (2017)
Wang, Y.: Appearance-based gaze estimation using deep features and random forest regression. Knowl. Based Syst. 110, 293–301 (2016)
Sugano, Y., Matsushita, Y., Sato, Y.: Learning-by-Synthesis for Appearance-Based 3D Gaze Estimation. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1821–1828. IEEE, Columbus (2014)
Kacete, A., Séguier, R., Collobert, M., Royan, J.: Unconstrained gaze estimation using random forest regression voting. In: Lai, S.-H., Lepetit, V., Nishino, K., Sato, Y. (eds.) ACCV 2016. LNCS, vol. 10113, pp. 419–432. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-54187-7_28
Huang, Q., Veeraraghavan, A., Sabharwal, A.: Tabletgaze: dataset and analysis for unconstrained appearance-based gaze estimation in mobile tablets. Mach. Vis. Appl. 28(5–6), 445–461 (2017)
Ounpraseuth, S.T.: Gaussian processes for machine learning. Int. J. Neural Syst. 14(2), 69–106 (2004)
Wojke, N.: Gaze-estimation for consumer-grade cameras using a Gaussian process latent variable model. Pattern Recogn. Image Anal. 26(1), 248–255 (2016)
Williams, O., Blake, A.: Sparse and semi-supervised visual mapping with the S3 GP. In: 2016 IEEE Computer Society Conference on Computer Vision & Pattern Recognition, pp. 230–237. IEEE, New York (2006)
Ferhat, O., Vilarino, F., Sánchez, F.J.: A cheap portable eye-tracker solution for common setups. J. Eye Mov. Res. 7(3), 1–10 (2014)
Sugano, Y., Matsushita, Y.: Appearance-based gaze estimation using visual saliency. IEEE Trans. Pattern Anal. Mach. Intell. 35(2), 329–341 (2013)
Huang, Y., Dong, X., Hao, M.: Eye gaze calibration based on support vector regression machine. In: 9th World Congress on Intelligent Control and Automation, pp. 454–456. IEEE, Taipei (2011)
Zhu, Z., Ji, Q., Bennett, K.P.: Nonlinear eye gaze mapping function estimation via support vector regression. In: 18th International Conference on Pattern Recognition, pp. 1132–1135. IEEE, Hong Kong (2006)
Wu, Y.L., Yeh, C.T., Wei, H.: Gaze direction estimation using support vector machine with active appearance model. Multimed. Tools Appl. 70(3), 2037–2062 (2014)
Chuang, M.-C., Bala, R., Bernal, E.A., Paul, P., Burry, A.: Estimating gaze direction of vehicle drivers using a smartphone camera. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 165–170. IEEE, Columbus (2014)
Baluja, S., Pomerleau, D.: Non-intrusive gaze tracking using artificial neural networks. Adv. Neural. Inf. Process. Syst. 98(1), 753–760 (1993)
Yu, L., Xu, J., Huang, S.: Eye-gaze tracking system based on particle swarm optimization and BP neural network. In: 12th World Congress on Intelligent Control and Automation, pp. 1269–1273. IEEE, Guilin (2016)
Sewell, W., Komogortsev, O.: Real-time eye gaze tracking with an unmodified commodity webcam employing a neural network. In: Mynatt, E.D., Schoner, D., Fitzpatrick, G., Hudson, S.E., Edwards, W.K., Rodden, T. (eds.) Proceedings of the Extended Abstracts on Human Factors in Computing Systems (CHI EA 2010), pp. 3739–3744. ACM, Atlanta (2010)
Zhang, X., Sugano, Y., Fritz, M.: It’s written all over your face: full-face appearance-based gaze estimation. Comput. Vis. Pattern Recogn. 1(5), 2299–2308 (2016)
Cheng, Y., Lu, F., Zhang, X.: Appearance-based gaze estimation via evaluation-guided asymmetric regression. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) Computer Vision – ECCV 2018. LNCS, vol. 11218, pp. 105–121. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01264-9_7
Zhang, C., Rui, Y., Cai, J.: Efficient eye typing with 9-direction gaze estimation. Multimed. Tools Appl. 77(15), 1–18 (2017)
Palmero, C., Selva, J., Bagheri, M. A., Escalera, S.: Recurrent CNN for 3D gaze estimation using appearance and shape cues. Comput. Vis. Pattern Recogn. 1(3), 1–13 (2018)
Fischer, T., Chang, H.J., Demiris, Y.: RT-GENE: real-time eye gaze estimation in natural environments. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11214, pp. 339–357. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01249-6_21
Yu, Y., Liu, G., Odobez, J.-M.: Deep multitask gaze estimation with a constrained landmark-gaze model. In: Leal-Taixé, L., Roth, S. (eds.) ECCV 2018. LNCS, vol. 11130, pp. 456–474. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-11012-3_35
Acknowledgement
This work was supported by National Natural Science Foundation of China (61876168, U1509207), National Key R&D Program of China (2018YFB1305200), and Zhejiang Provincial Natural Science Foundation of China (LY18F030020).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Jiang, J., Zhou, X., Chan, S., Chen, S. (2019). Appearance-Based Gaze Tracking: A Brief Review. In: Yu, H., Liu, J., Liu, L., Ju, Z., Liu, Y., Zhou, D. (eds) Intelligent Robotics and Applications. ICIRA 2019. Lecture Notes in Computer Science(), vol 11745. Springer, Cham. https://doi.org/10.1007/978-3-030-27529-7_53
Download citation
DOI: https://doi.org/10.1007/978-3-030-27529-7_53
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27528-0
Online ISBN: 978-3-030-27529-7
eBook Packages: Computer ScienceComputer Science (R0)