skip to main content
10.1145/1924559.1924581acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

A robust head pose estimation system for uncalibrated monocular videos

Published:12 December 2010Publication History

ABSTRACT

We present a robust head pose estimation system that is capable of estimating the 3D pose of a human head in video sequences captured using a single camera. The proposed system is able to accurately estimate the 3D pose parameters even without the knowledge of camera parameters. The face is modelled using a parametrized face mask in 3D. SIFT is used to match consecutive image frames. We propose a novel interpolation technique that captures the 3D movement of feature points to estimate the 2D-3D correspondences between the 3D model and the face image. The pose is established using the POSIT algorithm in a RANSAC framework that fits a 3D deformable face model onto the given face image. We evaluate the performance of the proposed scheme on standard test datasets. The mean absolute errors of estimated pitch, yaw and roll are found comparable and in some cases better than the results reported in literature.

References

  1. B. Ma, W. Zhang, S. Shan, X. Chen, and W. Gao, "Robust head pose estimation using LGBP," Pattern Recognition, vol. 2, pp. 512--515, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. T. Gritti, "Toward fully automated face pose estimation," in Proceedings of the International workshop on Interactive multimedia for Consumer Electronics, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. E. Murphy-Chutorian and M. Trivedi, "Head pose estimation in computer vision: A survey," IEEE Transactions on PAMI, pp. 607--626, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Jang and T. Kanade, "Robust 3D head tracking by online feature registration," in 8th IEEE International Conference on Automatic Face and Gesture Recognition, 2008.Google ScholarGoogle Scholar
  5. S. Choi and D. Kim, "Robust head tracking using 3D ellipsoidal head model in particle filter," Pattern Recognition, vol. 41, no. 9, pp. 2901--2915, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Ohayon and E. Rivlin, "Robust 3d head tracking using camera pose estimation," in International Conference on Pattern Recognition, vol. 1, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Ruddarraju, A. Haro, and I. Essa, "Fast multiple camera head pose tracking," Proceedings, Vision Interface, 2003.Google ScholarGoogle Scholar
  8. T. Brox, B. Rosenhahn, J. Gall, and D. Cremers, "Combined region and motion-based 3D tracking of rigid and articulated objects.," IEEE Transactions on PAMI, vol. 32, no. 3, p. 402, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. G. Aggarwal, A. Veeraraghavan, and R. Chellappa, "3d Facial pose tracking in Uncalibrated videos," Pattern Recognition and Machine Intelligence, pp. 515--520, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. D. Lowe, "Distinctive image features from scale-invariant keypoints," International journal of computer vision, vol. 60, no. 2, pp. 91--110, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. DeMenthon and L. Davis, "Model-based object pose in 25 lines of code," International Journal of Computer Vision, vol. 15, no. 1, pp. 123--141, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Fischler and R. Bolles, "Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography," Communications of the ACM, vol. 24, no. 6, pp. 381--395, 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Ahlberg, "Candide-3-an updated parametrized face," Report No. LiTH-ISY, 2001.Google ScholarGoogle Scholar
  14. C. Barber, D. Dobkin, and H. Huhdanpaa, "The Quickhull algorithm for convex hulls," ACM Transactions on Mathematical Software (TOMS), vol. 22, no. 4, pp. 469--483, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. De Berg, O. Cheong, M. Van Kreveld, and M. Overmars, Computational geometry: Algorithms and applications. Springer, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. H. Edelsbrunner, Geometry and topology for mesh generation. Cambridge Univ. Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. C. Bradley, "The Algebra of Geometry: Cartesian, Areal and Projective Co-ordinates," Highperception Ltd., Bath, 2007.Google ScholarGoogle Scholar
  18. F. Dornaika and J. Ahlberg, "Face and facial feature tracking using deformable models," International Journal of Image and Graphics, vol. 4, no. 3, p. 499, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  19. F. Dornaika and J. Ahlberg, "Fitting 3D face models for tracking and active appearance model training," Image and Vision Computing, vol. 24, no. 9, pp. 1010--1024, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  20. I. Matthews and S. Baker, "Active appearance models revisited," International Journal of Computer Vision, vol. 60, no. 2, pp. 135--164, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. R. F. O. Jesorsky, K. Kirchberg, "Audio and Video based Person Authentication - AVBPA," IEEE Transactions on PAMI, 2001.Google ScholarGoogle Scholar
  22. R. Hartley and A. Zisserman, Multiple view geometry in computer vision. Cambridge University Press New York, NY, USA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. M. La Cascia, S. Sclaroff, and V. Athitsos, "Fast, reliable head tracking under varying illumination: an approach based on registration of texture-mapped 3 D models," IEEE Transactions on PAMI, vol. 22, no. 4, pp. 322--336, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Jepson, D. Fleet, and T. El-Maraghi, "Robust online appearance models for visual tracking," IEEE Transactions on PAMI, pp. 1296--1311, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Xiao, T. Moriyama, T. Kanade, and J. Cohn, "Robust full-motion recovery of head by dynamic templates and re-registration techniques," International Journal of Imaging Systems and Technology, vol. 13, no. 1, pp. 85--94, 2003.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. A robust head pose estimation system for uncalibrated monocular videos

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          ICVGIP '10: Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
          December 2010
          533 pages
          ISBN:9781450300605
          DOI:10.1145/1924559

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 December 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate95of286submissions,33%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader