Abstract
For real-time evaluation of the position and orientation of the human head using depth image, we propose a novel algorithm, the dynamic random regression forests (DRRF), which enhances the conventional random forests (RF) in four aspects. Firstly, the DRRF employs the boosting strategy for data induction to upgrade the learning quality; secondly, the key parameters are optimized in a dynamic manner in order to train the DRRF classifier efficiently; thirdly, a stem operator is integrated into the conventional tree-shaped classifier to increase the possibility of optimum data split; fourthly, a weighted voting scheme utilizes the learning knowledge to determine the regression result more efficiently and accurately. Comparative experiments verify the advantages of the aforementioned four improvement schemes, and demonstrate the DRRF’s accuracy and robustness against partial occlusion and the variations of head pose, illumination, and facial expression.
Similar content being viewed by others
References
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009)
Yang, R.G., Zhang, Z.Y.: Model-Based Head Pose Tracking with Stereovision. In: IEEE International Conference on Atuomatic Face and Gesture Recognition, May 2002, pp. 255–260 (2002)
Yao J., Cham W.K.: Efficient Model-Based Linear Head Motion Recovery from Movies. In: IEEE International Conference on Computer Vision and, Pattern Recognition, vol. 2, Jul. 2004, pp. 414–421 (2004)
Whitehill J., Movellan, J.R.: A Discriminative Approach to Frame-by-Frame Head Pose Tracking. In: IEEE International Conference on Automatic Face and Gesture Recognition, Sept. 2008, pp. 1–7 (2008)
Balasubramanian, V.N., Ye, J.P., Panchanathan, S.: Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2007, pp. 1–7 (2007)
Xiao, J., Baker, S., Matthews, I., Kanade, T.: Real-Time Combined 2D+3D Active Appearance Models. In: IEEE International Conference on Computer Vision and Pattern Recognition, vol. 2, Jul. 2004, pp. 535–542 (2004)
Huang, C., Ding, X.Q., Fang, C.: Head Pose Estimation Based on Random Forests for Multiclass Classification. In: IEEE International Conference on Pattern Recognition, Aug. 2010, pp. 934–937 (2010)
Huang, D., Storer, M., De La Torre, F., Bischof, H.: Supervised Local Subspace Learning for Continuous Head Pose Estimation. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 2921–2928 (2011)
Cai, Q., Gallup, D., Zhang, C., Zhang, Z.Y.: 3D Deformable Face Tracking with a Commodity Depth Camera. In: 11th European Conference on Computer Vision Part III, Sept. 2010, pp. 229–242 (2010)
Weise, T., Bouaziz, S., Li, H., Pauly, M.: Realtime performance-based facial animation. SIGGRAPH 30(4), 77 (2011)
Malassiotis, S., Strintzis, M.G.: Robust real-time 3D head pose estimation from range data. Pattern Recogn. 38(8), 1153–1165 (2005)
Chang, K.I., Bowyer, W., Flynn, P.J.: Multiple nose region matching for 3D face recognition under varying facial expression. IEEE Trans. Pattern Anal. Mach. Intell. 28(10), 1695–1700 (2006)
Sun, Y., Yin, L.J.: Automatic Pose Estimation of 3D Facial Models. In: International Conference on Pattern Recognition, pp. 1–4 (2008)
Breitenstein, M.D., Kuettel, D., Weise, T., Van Gool, L., Pfister, H.: Real-Time Face Pose Estimation from Single Range Images. In: IEEE International Conference on Computer Vision and, Pattern Recognition, Jun. 2008, pp. 1–8 (2008)
Breitenstein, M.D., Jensen, J., Hoilun, C., Moeslund, T.B., Gool, L.: Head Pose Estimation from Passive Stereo Images. In: Scandinavian Conference on Image Analysis, pp. 219–228 (2009)
Bleiweiss A., Werman, M.: Robust Head Pose Estimation by Fusing Time-of-Flight Depth and Color. IEEE International Workshop on Multimedia Signal Processing, pp. 116–121 (2010)
Zabulis, X., Sarmis, T., Argyros, A.A.: 3D Head Pose Estimation from Multiple Distant Views, In: British Machine Vision Conference (2009)
Fanelli, G., Gall, J., Van Gool, L.: Real Time Head Pose Estimation with Random Regression Forests. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 617–624 (2011)
Fanelli, G., Weise, T., Gall, J., Van Gool, L.: Real Time Head Pose Estimation from Consumer Depth Cameras, 33rd Annual Symposium of the German Association for Pattern Recognition, Sept. 2011, pp. 101–110 (2011)
Bernard, S., Adam, S., Heutte, L.: Dynamic random forests. Pattern Recogn. Lett. 33(12), 1580–1586 (2012)
Jones M., Viola, P.: Fast Multi-View Face Detection, Technical Report TR2003-096, Mitsubishi Electric Research Laboratories (2003)
Geurats, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (Apr. 2006)
Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. IEEE Trans. Pattern Anal. Mach. Intell. 28(9), 1465–1479 (2006)
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-Time Human Pose Recognition in Parts from Single Depth Images. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 1297–1304 (2011)
Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)
Cheng, Y.Z.: Mean shift, mode seaking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17(8), 790–799 (1995)
Padeleris, P., Zabulis, X.,Argyros, A.A. : Head Pose Estimation on Depth Data Based on Particle Swarm Optimization, IEEE International Workshop on Computer Vision and Pattern Recognition, Jun. 2012, pp. 42–49 (2012)
Gall, J., Yao, A., Razavi, N., Van Gool, L., Lempitsky, V.: Hough forests for object detection, tracking, and action recognition, IEEE Trans. Pattern Anal Mach. Intell. 33(11), 2188–2202 (2011)
Criminisi A., Shotton J., Robertson D., Konukoglu, E.: Regression Forests for Efficient Anatomy Detection and Localization in CT Studies. Medical Computer Vision: Recognition Techniques and Applications in Medical Imaging, pp. 106–117 (2010)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ying, Y., Wang, H. Dynamic random regression forests for real-time head pose estimation. Machine Vision and Applications 24, 1705–1719 (2013). https://doi.org/10.1007/s00138-013-0524-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-013-0524-y