Synthesis of Novel Views from a Single Face Image

Vetter, Thomas

doi:10.1023/A:1008058932445

Synthesis of Novel Views from a Single Face Image

Published: June 1998

Volume 28, pages 103–116, (1998)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Thomas Vetter¹

364 Accesses
110 Citations
3 Altmetric
Explore all metrics

Abstract

Images formed by a human face change with viewpoint. A new technique is described for synthesizing images of faces from new viewpoints, when only a single 2D image is available. A novel 2D image of a face can be computed without explicitly computing the 3D structure of the head. The technique draws on a single generic 3D model of a human head and on prior knowledge of faces based on example images of other faces seen in different poses. The example images are used to “learn” a pose-invariant shape and texture description of a new face. The 3D model is used to solve the correspondence problem between images showing faces in different poses.

The proposed method is interesting for view independent face recognition tasks as well as for image synthesis problems in areas like teleconferencing and virtualized reality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Aizawa, K. Harashima, H. and Saito, T. 1989. Model-based analysis synthesis image coding (MBASIC) system for a person'sface. Signal Processing: Image Communication, 1:139–152.
Google Scholar
Akimoto, T., Suenaga, Y. and Wallace, R.S. 1993. Automatic creation of 3D facial models. IEEE Computer Graphics and Applications, 13(3):16–22.
Google Scholar
Bergen, J.R. Anandan, P., Hanna, K.J. and Hingorani, R. 1992. Hierarchical model-based motion estimation. In Proceedings of the European Conference on Computer Vision, Santa Margherita Ligure, Italy, pp. 237–252.
Google Scholar
Bergen, J.R. and Hingorani, R. 1990. Hierarchical motion-based frame rate conversion. Technical report, David Sarnoff Research Center, Princeton, NJ.
Google Scholar
Beymer, D. 1993. Face recognition under varying pose. A.I. Memo No. 1461, Artificial Intelligence Laboratory, Massachusetts Institute of Technology.
Google Scholar
Beymer, D. and Poggio, T. 1995. Face recognition from one model view. In Proceedings of the 5th International Conference on Computer Vision.
Beymer, D. and Poggio, T. 1996. Image representation for visual learning. Science, 272:1905–1909.
Google Scholar
Beymer, D., Shashua, A. and Poggio, T. 1993. Example-based image analysis and synthesis. A.I. Memo No. 1431, Artificial Intelligence Laboratory, Massachusetts Institute of Technology.
Burt, P.J. and Adelson, E.H. 1983. The Laplacian pyramide as a compact image code. IEEE Transactions on Communications, 31:532–540.
Google Scholar
Burt, P.J. and Adelson, E.H. 1985. Merging images through pattern decomposition. Applications of Digital Image Processing VIII, 575:73–181. SPIE The International Society for Optical Engeneering.
Google Scholar
Choi, C.S., Okazaki, T., Harashima, H. and Takebe, T. 1991. A system of analyzing and synthesizing facial images. In Proc. IEEE Int. Symposium of Circuit and Syatems (ISCAS91), pp. 2665– 2668.
Cootes, T.F., Taylor, C.J., Cooper, D.H. and Graham, J. 1995. Active shape models-their training and application. Computer Vision and Image Understanding, 61:38–59.
Google Scholar
Craw, I. and Cameron, P. 1991. Parameterizing images for recognition and reconstruction. In Proc. British Machine Vision Conference, Springer, pp. 367–370.
Hallinan, P.W. 1995. A deformable model for the recognition of human faces under arbitrary illumination. Doctoral thesis, Harvard University, Cambridge, MA.
Google Scholar
Horn, B.K.P. 1987. Robot Vision. MIT Press: Cambridge, MA.
Google Scholar
Huang, T.S., and Lee, C.H. 1989. Motion and structure from orthographic projections. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2(5):536–540.
Google Scholar
Jones, M., and Poggio, T. 1995. Model-based matching of line drawings by linear combination of prototypes. In Proceedings of the 5th International Conference on Computer Vision.
Lanitis, A., Taylor, C.J., Cootes, T.F., and Ahmad, T. 1995. Automatic interpretation of human faces and hand gestures using flexible models. In Proc. InternationalWorkshop on Face and Gesture Recognition, Zurich, Switzerland, pp. 98–103.
O'Toole, A.J., Deffenbacher, K.A., Valentin, D. and Abdi, H. 1994. Structural aspects of face recognition and the other-race effect. Memory and Cognition, 22:208–224.
Poggio, T. and Brunelli, R. 1992. A novel approach to graphics. Technical report 1354, MIT Media Laboratory Perceptual Computing Section.
Press, Teukolsky, Vetterling and Flannery. 1992. Numerical recipes in C: the art of scientific computing. Cambridge University Press: Cambridge.
Google Scholar
C.A. Rothwell, D.A. Forsyth, Zissermann, A. and Mundy, J.L. 1993. Extracting projective structure from single perspective views of 3D point sets. In Proceedings of the International Conference on Computer Vision (ICCV), Berlin, Germany, pp. 573–582.
Terzopoulos, D. and Waters, K. 1993 Analysis and synthesis of facial image sequences using physical and anatomical models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(6):569–579.
Google Scholar
Thalmann, N.D. and Thalmann, D. 1995. Digital actors for interactive television. In Proceedings of the IEEE, 83(7):1022–1031.
Google Scholar
Vetter, T., Jones, M. and Poggio, T. 1997. A bootstrapping algorithm for learning linearized models of object classes. in IEEE Conference on Computer Vision and Pattern Recognition.
Vetter, T. and Poggio, T. 1994. Symmetric 3D objects are an easy case for 2D object recognition. Spatial Vision, 8(4):443–453.
Google Scholar
Vetter, T. and Poggio, T. 1996. Image synthesis from a single example image. In volume 1065 of LNCS,Computer Vision – ECCV'96, Cambridge UK. Springer.
Google Scholar
Wolberg, G. 1990. Image Warping. IEEE Computer Society Press: Los Alamitos, CA.
Google Scholar
Xu, W. and Hauske, G. 1994. Picture quality evaluation based on error segmentation. In Proc. SPIE, Visual Communications and Image Processing, 2308:1–12.
Google Scholar

Download references

Author information

Authors and Affiliations

Max-Planck-Institut für Biologische Kybernetik, Spemannstr. 38, 72076, Tüubingen, Germany. E-mail
Thomas Vetter

Authors

Thomas Vetter
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Vetter, T. Synthesis of Novel Views from a Single Face Image. International Journal of Computer Vision 28, 103–116 (1998). https://doi.org/10.1023/A:1008058932445

Download citation

Issue Date: June 1998
DOI: https://doi.org/10.1023/A:1008058932445

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Synthesis of Novel Views from a Single Face Image

Abstract

Access this article

Similar content being viewed by others

Latent transformations neural network for object view synthesis

Image Synthesis in Consideration of a Human Visual System

Novel View-Synthesis from Multiple Sources for Conversion to 3DS

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Synthesis of Novel Views from a Single Face Image

Abstract

Access this article

Similar content being viewed by others

Latent transformations neural network for object view synthesis

Image Synthesis in Consideration of a Human Visual System

Novel View-Synthesis from Multiple Sources for Conversion to 3DS

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation