Skip to main content

Advertisement

Log in

Dynamic random regression forests for real-time head pose estimation

  • Original Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

For real-time evaluation of the position and orientation of the human head using depth image, we propose a novel algorithm, the dynamic random regression forests (DRRF), which enhances the conventional random forests (RF) in four aspects. Firstly, the DRRF employs the boosting strategy for data induction to upgrade the learning quality; secondly, the key parameters are optimized in a dynamic manner in order to train the DRRF classifier efficiently; thirdly, a stem operator is integrated into the conventional tree-shaped classifier to increase the possibility of optimum data split; fourthly, a weighted voting scheme utilizes the learning knowledge to determine the regression result more efficiently and accurately. Comparative experiments verify the advantages of the aforementioned four improvement schemes, and demonstrate the DRRF’s accuracy and robustness against partial occlusion and the variations of head pose, illumination, and facial expression.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  2. Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 31(4), 607–626 (2009)

    Article  Google Scholar 

  3. Yang, R.G., Zhang, Z.Y.: Model-Based Head Pose Tracking with Stereovision. In: IEEE International Conference on Atuomatic Face and Gesture Recognition, May 2002, pp. 255–260 (2002)

  4. Yao J., Cham W.K.: Efficient Model-Based Linear Head Motion Recovery from Movies. In: IEEE International Conference on Computer Vision and, Pattern Recognition, vol. 2, Jul. 2004, pp. 414–421 (2004)

  5. Whitehill J., Movellan, J.R.: A Discriminative Approach to Frame-by-Frame Head Pose Tracking. In: IEEE International Conference on Automatic Face and Gesture Recognition, Sept. 2008, pp. 1–7 (2008)

  6. Balasubramanian, V.N., Ye, J.P., Panchanathan, S.: Biased Manifold Embedding: A Framework for Person-Independent Head Pose Estimation. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2007, pp. 1–7 (2007)

  7. Xiao, J., Baker, S., Matthews, I., Kanade, T.: Real-Time Combined 2D+3D Active Appearance Models. In: IEEE International Conference on Computer Vision and Pattern Recognition, vol. 2, Jul. 2004, pp. 535–542 (2004)

  8. Huang, C., Ding, X.Q., Fang, C.: Head Pose Estimation Based on Random Forests for Multiclass Classification. In: IEEE International Conference on Pattern Recognition, Aug. 2010, pp. 934–937 (2010)

  9. Huang, D., Storer, M., De La Torre, F., Bischof, H.: Supervised Local Subspace Learning for Continuous Head Pose Estimation. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 2921–2928 (2011)

  10. Cai, Q., Gallup, D., Zhang, C., Zhang, Z.Y.: 3D Deformable Face Tracking with a Commodity Depth Camera. In: 11th European Conference on Computer Vision Part III, Sept. 2010, pp. 229–242 (2010)

  11. Weise, T., Bouaziz, S., Li, H., Pauly, M.: Realtime performance-based facial animation. SIGGRAPH 30(4), 77 (2011)

    Google Scholar 

  12. Malassiotis, S., Strintzis, M.G.: Robust real-time 3D head pose estimation from range data. Pattern Recogn. 38(8), 1153–1165 (2005)

    Article  Google Scholar 

  13. Chang, K.I., Bowyer, W., Flynn, P.J.: Multiple nose region matching for 3D face recognition under varying facial expression. IEEE Trans. Pattern Anal. Mach. Intell. 28(10), 1695–1700 (2006)

    Google Scholar 

  14. Sun, Y., Yin, L.J.: Automatic Pose Estimation of 3D Facial Models. In: International Conference on Pattern Recognition, pp. 1–4 (2008)

  15. Breitenstein, M.D., Kuettel, D., Weise, T., Van Gool, L., Pfister, H.: Real-Time Face Pose Estimation from Single Range Images. In: IEEE International Conference on Computer Vision and, Pattern Recognition, Jun. 2008, pp. 1–8 (2008)

  16. Breitenstein, M.D., Jensen, J., Hoilun, C., Moeslund, T.B., Gool, L.: Head Pose Estimation from Passive Stereo Images. In: Scandinavian Conference on Image Analysis, pp. 219–228 (2009)

  17. Bleiweiss A., Werman, M.: Robust Head Pose Estimation by Fusing Time-of-Flight Depth and Color. IEEE International Workshop on Multimedia Signal Processing, pp. 116–121 (2010)

  18. Zabulis, X., Sarmis, T., Argyros, A.A.: 3D Head Pose Estimation from Multiple Distant Views, In: British Machine Vision Conference (2009)

  19. Fanelli, G., Gall, J., Van Gool, L.: Real Time Head Pose Estimation with Random Regression Forests. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 617–624 (2011)

  20. Fanelli, G., Weise, T., Gall, J., Van Gool, L.: Real Time Head Pose Estimation from Consumer Depth Cameras, 33rd Annual Symposium of the German Association for Pattern Recognition, Sept. 2011, pp. 101–110 (2011)

  21. Bernard, S., Adam, S., Heutte, L.: Dynamic random forests. Pattern Recogn. Lett. 33(12), 1580–1586 (2012)

    Article  Google Scholar 

  22. Jones M., Viola, P.: Fast Multi-View Face Detection, Technical Report TR2003-096, Mitsubishi Electric Research Laboratories (2003)

  23. Geurats, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (Apr. 2006)

    Google Scholar 

  24. Lepetit, V., Fua, P.: Keypoint recognition using randomized trees. IEEE Trans. Pattern Anal. Mach. Intell. 28(9), 1465–1479 (2006)

    Article  Google Scholar 

  25. Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-Time Human Pose Recognition in Parts from Single Depth Images. In: IEEE International Conference on Computer Vision and Pattern Recognition, Jun. 2011, pp. 1297–1304 (2011)

  26. Ozuysal, M., Calonder, M., Lepetit, V., Fua, P.: Fast keypoint recognition using random ferns. IEEE Trans. Pattern Anal. Mach. Intell. 32(3), 448–461 (2010)

    Article  Google Scholar 

  27. Cheng, Y.Z.: Mean shift, mode seaking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17(8), 790–799 (1995)

    Article  Google Scholar 

  28. Padeleris, P., Zabulis, X.,Argyros, A.A. : Head Pose Estimation on Depth Data Based on Particle Swarm Optimization, IEEE International Workshop on Computer Vision and Pattern Recognition, Jun. 2012, pp. 42–49 (2012)

  29. Gall, J., Yao, A., Razavi, N., Van Gool, L., Lempitsky, V.: Hough forests for object detection, tracking, and action recognition, IEEE Trans. Pattern Anal Mach. Intell. 33(11), 2188–2202 (2011)

    Article  Google Scholar 

  30. Criminisi A., Shotton J., Robertson D., Konukoglu, E.: Regression Forests for Efficient Anatomy Detection and Localization in CT Studies. Medical Computer Vision: Recognition Techniques and Applications in Medical Imaging, pp. 106–117 (2010)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ying Ying.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ying, Y., Wang, H. Dynamic random regression forests for real-time head pose estimation. Machine Vision and Applications 24, 1705–1719 (2013). https://doi.org/10.1007/s00138-013-0524-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-013-0524-y

Keywords

Navigation