Elsevier

Pattern Recognition

Volume 34, Issue 5, May 2001, Pages 1015-1031
Pattern Recognition

Facial modeling from an uncalibrated face image using a coarse-to-fine genetic algorithm

https://doi.org/10.1016/S0031-3203(00)00044-3Get rights and content

Abstract

This paper presents a genetic algorithm-based optimization approach for facial modeling from an uncalibrated face image using a flexible generic parameterized facial model (FGPFM). The FGPFM can be easily modified using the facial features as parameters of FGPFM to construct an accurate specific 3D facial model from only a photograph of an individual with a yawed face based on the projection transformation. The facial modeling problem is formulated as a parameter optimization problem and the objective function is also given. Moreover, a coarse-to-fine approach based on our intelligent genetic algorithm which can efficiently solve the large parameter optimization problems is used to accelerate the search for an optimal solution. Furthermore, sensitivity analysis and experimental results with texture mapping demonstrate the effectiveness of the proposed method.

Introduction

Face images have received considerable attention, particularly in the fields of computer vision and signal processing communities. For instance, model-based image coding methods have been proposed for future videophone and video conference services. However, the images in these applications are complex and highly variable, even for a specific individual. An important problem is how to create a 3D model of a specific individual. Automatic creation of a 3D facial model of a specific individual plays an important role in many applications, such as model-based coding for narrow-band visual communication [1], [2], [3], [4], view-independent face recognition tasks [5], [6], and image synthesis problems in areas like virtualized reality [7] and synthesis of novel views [8], [9].

3D facial models can be categorized into two classes: those based on the view-independent 3D facial structure and those considering only view-dependent facial models. The view-dependent facial model uses multiview representation in which a set of 2D image-based example facial models are combined into a flexible 3D facial model by a weighted sum of given example facial models [9], [10]. The limitations of view-dependence narrow the scope of 3D facial model-related applications. The approaches used to automatically create 3D facial models with a view-independent 3D facial structure can be applied more extensively [3]. Approaches capable of creating a view-independent 3D facial model of a specific individual can be categorized into two groups: use of an actual 3D face of a specific individual and use of a generic facial model with 2D face images of an individual. The approaches that need an actual 3D face include active vision [2], 3D digitizer [8], and vision-based methods [11].

Approaches belonging to the second group in which the 3D facial model of a specific individual is constructed consist of two steps. Firstly, select a generic model representing the topological structure of a typical face and typical 2D face images of the individual. Secondly, adjust the geometrical shape of the generic model to that of the actual face images through 3D transformation and modification according to the positions of some facial features such as eyes, mouth, nose, and facial contour. Akimoto et al. [1], Tang and Huang [12], and Ip and Yin [7] used two orthogonal face images of an individual's head to acquire the features deemed necessary for fitting a generic model to an individual's head. Pei et al. [4] used the transformation of the generic facial model with an affine mapping for model-based image coding by tracking 3D contour feature points. Aizawa et al. [14] used multiple face images to adjust a flexible three-dimensional facial model for a particular face. Eisert and Girod [15] changed the texture and control points’ position using information from 3D laser scans to adjust the generic 3D model to a specific individual.

Given only a photograph of an individual's face shown in front-viewed or as a randomly yawed face when viewing the face from close range, can one automatically create an accurate 3D facial model of the individual? For this purpose, assessing the effectiveness of the above approaches is relatively difficult since no work has reported on the generalization performance to automatically create 3D facial models that takes the pose of the face and the death information in the fitting process into consideration from only a photograph.

Luo and King [13] developed a facial-feature-extraction algorithm to automate the process of fitting the general facial wire frame model to the actual face image. Furthermore, by introducing face orientation detection during the fitting process, the facial model construction method is robust with respect to its ability to adjust the specific facial 3D model for arbitrary orientations. However, the depth values of the facial model were adjusted in the z-direction by an amount proportional to its change in 2D image plane [13]. A weak perspective imaging process is assumed valid when depth changes on the face are small compared with the long distance between the face and the camera. This is generally a good approximation, except when viewing the face from close range with a short focal length lens, which results in significant perspective distortion in the image [35]. To adjust the 3D control points of a generic facial model to fit the face image accurately, the pose of the yawed face must be determined and depth information of facial features must be taken into account simultaneously, especially when viewing the face from close image. How to accurately modify the generic facial model to fit the specific face image from an uncalibrated face image using the perspective imaging process is investigated in the work.

Two fundamental problems which must fully cooperate with each other are the model modification method and the establishment of the generic facial model, described as follows.

In the light of the above two problems, this paper presents a GA-based optimization approach for facial modeling from an uncalibrated face image using a flexible generic parameterized facial model (FGPFM). The microstructural information can be expressed using the structural FGPFM with representative facial features that can be accurately found in the image. The reconstruction procedure can be regarded as a block function of FGPFM, and the input parameters are the 3D face-centered coordinates of control points. Once the control points are given, the desired 3D facial model is determined based on the topological and geometric descriptions of FGPFM. How to reconstruct the 3D facial model is transformed into a problem of how to acquire the accurate 3D control points.

Since the solution space is large and complex considering the large number of control points in 3D space, the proposed coarse-to-fine approach based on our intelligent genetic algorithm IGA [31] is used to efficiently solve the optimization problem. IGA is an efficient general-purposed algorithm capable of solving large parameter optimization problem. Coarse-to-fine approach can efficiently adjust control points in 3D space. The fitness function takes into account the evidence from the face image and human perception. The proposed coarse-to-fine IGA can effectively construct an optimal facial model. Merits of the proposed method are summarized as follow. (1) FGPFM is presented so that the good parameters, the control points, of FGPFM can yield the good facial model for a specific person. (2) An analytic solution for the pose determination of human faces (PDF) [32] from a monocular image is applied to obtain the initial 3D control points and make coarse-to-fine IGA more efficient. (3) The reconstruction problem is formulated as a parameter optimization problem based on the ability of FGPFM and PDF. Furthermore, the coarse-to-fine IGA is also used to speed up the search for an optimal solution which is a set of optimal control points. The facial model construction method can obtain more accurate facial models with respect to its uses of the perspective fitting process and the coarse-to-fine IGA for adjusting the 3D control points of FGPFM, compared with that of the existing ones, e.g. Ref. [13].

The rest of this paper is organized as follows. Section 2 describes how to establish a flexible generic parameterized facial model. Section 3 formulates the facial modeling problem as an optimization problem and also outlines the reconstruction procedure. Section 4 summaries the sensitivity analysis and shows the experimental results with texture mapping. Conclusions are finally made in Section 5.

Section snippets

Establishment of FGPFM

The FGPFM consists of a topological structure and geometric knowledge of human faces. The topological description consists of a set of well-designed triangular polygons with a multi-layered elastic structure in which the microstructural information can be expressed without complicated facial feature. All the geometric values are obtained from a set of training facial models using statistical approaches and genetic algorithms.

Facial modeling as an optimization problem

As widely recognized, accurate 3D control points based on FGPFM can lead to an accurate 3D facial model of a specific individual. Herein, the reconstruction problem is formulated as a parameter optimization problem as follows.FindasetofcontrolpointsV1,suchthatF(V1)=Min.F(V1),where V1 is the set of control points of the FGPFM. The two major problems are:

(a)How to construct the fitness function F(V1)?
(b)How to search for the optimal solution V1?
A coarse-to-fine IGA is presented to cope with

Experimental results

In this section, five experiments using FGPFM, synthetic face images and actual face images are analyzed to demonstrate the feasibility of the proposed method. In the first experiment, a known facial model is used to verify the superiority of IGA and the effectiveness of the coarse-to-fine approach. The second experiment demonstrates the effectiveness of the proposed fitting function and the high performance of the coarse-to-fine IGA. In the third and fourth experiments, an application applies

Conclusions

This study has presented a novel genetic algorithm-based optimization approach for facial modeling from an uncalibrated monocular face image using flexible generic parameterized facial model. The proposed method, has the following features. (1) FGPFM is presented so that the good parameters, the control points, of FGPFM can yield the good facial model for a specific person. (2) An analytic solution for the pose determination of human faces (PDF) from a monocular image is applied to obtain the

About the Author—SHINN-YING HO was born in Taiwan, ROC, on March 25, 1962. He received the B.S., M.S., and Ph.D. degrees in computer science and information engineering from National Chiao Tung University, Hsinchu, Taiwan, in 1984, 1986, and 1992, respectively. He is currently an associate professor in the Department of Information Engineering at Feng Chia University, Taichung, Taiwan. His research interests include image processing, pattern recognition, virtual reality applications of computer

References (37)

  • K. Aizawa et al.

    Model-based analysis synthesis image coding (MBASIC) system for a person's face

    Signal Process.: Image Commun.

    (1989)
  • A. Gee et al.

    Determining the gaze of faces in images

    Image Vision Comput.

    (1994)
  • G. Chow et al.

    Towards a system for automatic facial feature detection

    Pattern Recognition

    (1993)
  • Takaaki Akimoto et al.

    Automatic creation of 3D facial models

    IEEE Comput. Graphic Appl. 13

    (1993)
  • P. Eisert et al.

    Analyzing facial expressions for virtual conferencing

    IEEE Comput. Graphic Appl.

    (1998)
  • A. Lanitis et al.

    Automatic interpretation and coding of face images using flexible models

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1997)
  • Soo-Chang Pei

    Global motion estimation in model-based image coding by tracking three-dimensional contour feature point

    IEEE Trans. Circutits Systems Video Technol.

    (1998)
  • S.Y. Lee et al.

    Recognition of human front faces using knowledge-based feature extraction and neuro-fuzzy algorithm

    Pattern Recognition

    (1994)
  • Z. Mao, A.J. Naftel, Improved area-based stereo matching using an image segmentation approach for 3-D facial Imaging,...
  • H.S. Ip, Li-Jun Yin, Arbitrary facial views generation from two orthogonal facial images, IEEE International...
  • D. Terzopoulos et al.

    Analysis and synthesis of facial image sequences using physical and anatomical models

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1998)
  • T. Vetter

    Synthesis of novel views from a single face image

    Int. J. Comput. Vision

    (1998)
  • M.J. Tones et al.

    Multidimensional morphable models: a framework for representing and matching object classes

    Int. J. Comput. Vision

    (1998)
  • T.S. Huang et al.

    Motion and structure from orthographic projections

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1989)
  • Li-an Tang, and Thomas S. Huang, Automatic construction of 3D human face models based on 2D images, IEEE Proceedings of...
  • Suhuai Luo, R.W. King, Automatic human modeling in model-base facial image coding, Proceedings of Conference on...
  • Peter Eisert et al.

    Analyzing facial expressions for virtual conferencing

    IEEE Comput. Graphic Appl.

    (1998)
  • A. Guarda, C. Le Gal, A. Lux, Evolving visual features and detectors, Computer graphics, image processing, and vision,...
  • Cited by (30)

    • Adaptive multi-resolution fitting and its application to realistic head modeling

      2004, Proceedings - Geometric Modeling and Processing 2004
    • Facial modeling from an uncalibrated face image using flexible generic parameterized facial models

      2001, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
    • Robust human detection and localization in security applications

      2017, Concurrency and Computation: Practice and Experience
    • 3D face reconstruction using images from cameras with varying parameters

      2017, International Journal of Automation and Computing
    View all citing articles on Scopus

    About the Author—SHINN-YING HO was born in Taiwan, ROC, on March 25, 1962. He received the B.S., M.S., and Ph.D. degrees in computer science and information engineering from National Chiao Tung University, Hsinchu, Taiwan, in 1984, 1986, and 1992, respectively. He is currently an associate professor in the Department of Information Engineering at Feng Chia University, Taichung, Taiwan. His research interests include image processing, pattern recognition, virtual reality applications of computer vision, genetic algorithms, large parameter optimization problems, and system optimization.

    About the Author—HUI-LING HUANG was born in Taiwan, ROC, on April 4, 1968. She received the B.S. degree in Mathematics from Fu Jen C.U., Csin Chuang, Taiwan, in 1992, and the M.S. degree in information engineering form Feng Chia University, Taichung, Taiwan, in 1998. She is presently working toward the Ph.D. degree in the Institute of Information Engineering at Feng Chia University, Taichung, Taiwan. Her research interests include image processing, pattern recognition, virtual reality applications of computer vision, and large parameter optimization problems.

    This work was supported by the National Science Council, Taiwan, ROC, Grant No. NSC88-2213-E-035-012.

    View full text