Facial modeling from an uncalibrated face image using a coarse-to-fine genetic algorithm

doi:10.1016/S0031-3203(00)00044-3

Pattern Recognition

Volume 34, Issue 5, May 2001, Pages 1015-1031

https://doi.org/10.1016/S0031-3203(00)00044-3 Get rights and content

Abstract

This paper presents a genetic algorithm-based optimization approach for facial modeling from an uncalibrated face image using a flexible generic parameterized facial model (FGPFM). The FGPFM can be easily modified using the facial features as parameters of FGPFM to construct an accurate specific 3D facial model from only a photograph of an individual with a yawed face based on the projection transformation. The facial modeling problem is formulated as a parameter optimization problem and the objective function is also given. Moreover, a coarse-to-fine approach based on our intelligent genetic algorithm which can efficiently solve the large parameter optimization problems is used to accelerate the search for an optimal solution. Furthermore, sensitivity analysis and experimental results with texture mapping demonstrate the effectiveness of the proposed method.

Introduction

Face images have received considerable attention, particularly in the fields of computer vision and signal processing communities. For instance, model-based image coding methods have been proposed for future videophone and video conference services. However, the images in these applications are complex and highly variable, even for a specific individual. An important problem is how to create a 3D model of a specific individual. Automatic creation of a 3D facial model of a specific individual plays an important role in many applications, such as model-based coding for narrow-band visual communication [1], [2], [3], [4], view-independent face recognition tasks [5], [6], and image synthesis problems in areas like virtualized reality [7] and synthesis of novel views [8], [9].

3D facial models can be categorized into two classes: those based on the view-independent 3D facial structure and those considering only view-dependent facial models. The view-dependent facial model uses multiview representation in which a set of 2D image-based example facial models are combined into a flexible 3D facial model by a weighted sum of given example facial models [9], [10]. The limitations of view-dependence narrow the scope of 3D facial model-related applications. The approaches used to automatically create 3D facial models with a view-independent 3D facial structure can be applied more extensively [3]. Approaches capable of creating a view-independent 3D facial model of a specific individual can be categorized into two groups: use of an actual 3D face of a specific individual and use of a generic facial model with 2D face images of an individual. The approaches that need an actual 3D face include active vision [2], 3D digitizer [8], and vision-based methods [11].

Approaches belonging to the second group in which the 3D facial model of a specific individual is constructed consist of two steps. Firstly, select a generic model representing the topological structure of a typical face and typical 2D face images of the individual. Secondly, adjust the geometrical shape of the generic model to that of the actual face images through 3D transformation and modification according to the positions of some facial features such as eyes, mouth, nose, and facial contour. Akimoto et al. [1], Tang and Huang [12], and Ip and Yin [7] used two orthogonal face images of an individual's head to acquire the features deemed necessary for fitting a generic model to an individual's head. Pei et al. [4] used the transformation of the generic facial model with an affine mapping for model-based image coding by tracking 3D contour feature points. Aizawa et al. [14] used multiple face images to adjust a flexible three-dimensional facial model for a particular face. Eisert and Girod [15] changed the texture and control points’ position using information from 3D laser scans to adjust the generic 3D model to a specific individual.

Given only a photograph of an individual's face shown in front-viewed or as a randomly yawed face when viewing the face from close range, can one automatically create an accurate 3D facial model of the individual? For this purpose, assessing the effectiveness of the above approaches is relatively difficult since no work has reported on the generalization performance to automatically create 3D facial models that takes the pose of the face and the death information in the fitting process into consideration from only a photograph.

Luo and King [13] developed a facial-feature-extraction algorithm to automate the process of fitting the general facial wire frame model to the actual face image. Furthermore, by introducing face orientation detection during the fitting process, the facial model construction method is robust with respect to its ability to adjust the specific facial 3D model for arbitrary orientations. However, the depth values of the facial model were adjusted in the z-direction by an amount proportional to its change in 2D image plane [13]. A weak perspective imaging process is assumed valid when depth changes on the face are small compared with the long distance between the face and the camera. This is generally a good approximation, except when viewing the face from close range with a short focal length lens, which results in significant perspective distortion in the image [35]. To adjust the 3D control points of a generic facial model to fit the face image accurately, the pose of the yawed face must be determined and depth information of facial features must be taken into account simultaneously, especially when viewing the face from close image. How to accurately modify the generic facial model to fit the specific face image from an uncalibrated face image using the perspective imaging process is investigated in the work.

Two fundamental problems which must fully cooperate with each other are the model modification method and the establishment of the generic facial model, described as follows.

In the light of the above two problems, this paper presents a GA-based optimization approach for facial modeling from an uncalibrated face image using a flexible generic parameterized facial model (FGPFM). The microstructural information can be expressed using the structural FGPFM with representative facial features that can be accurately found in the image. The reconstruction procedure can be regarded as a block function of FGPFM, and the input parameters are the 3D face-centered coordinates of control points. Once the control points are given, the desired 3D facial model is determined based on the topological and geometric descriptions of FGPFM. How to reconstruct the 3D facial model is transformed into a problem of how to acquire the accurate 3D control points.

Since the solution space is large and complex considering the large number of control points in 3D space, the proposed coarse-to-fine approach based on our intelligent genetic algorithm IGA [31] is used to efficiently solve the optimization problem. IGA is an efficient general-purposed algorithm capable of solving large parameter optimization problem. Coarse-to-fine approach can efficiently adjust control points in 3D space. The fitness function takes into account the evidence from the face image and human perception. The proposed coarse-to-fine IGA can effectively construct an optimal facial model. Merits of the proposed method are summarized as follow. (1) FGPFM is presented so that the good parameters, the control points, of FGPFM can yield the good facial model for a specific person. (2) An analytic solution for the pose determination of human faces (PDF) [32] from a monocular image is applied to obtain the initial 3D control points and make coarse-to-fine IGA more efficient. (3) The reconstruction problem is formulated as a parameter optimization problem based on the ability of FGPFM and PDF. Furthermore, the coarse-to-fine IGA is also used to speed up the search for an optimal solution which is a set of optimal control points. The facial model construction method can obtain more accurate facial models with respect to its uses of the perspective fitting process and the coarse-to-fine IGA for adjusting the 3D control points of FGPFM, compared with that of the existing ones, e.g. Ref. [13].

The rest of this paper is organized as follows. Section 2 describes how to establish a flexible generic parameterized facial model. Section 3 formulates the facial modeling problem as an optimization problem and also outlines the reconstruction procedure. Section 4 summaries the sensitivity analysis and shows the experimental results with texture mapping. Conclusions are finally made in Section 5.

Section snippets

Establishment of FGPFM

The FGPFM consists of a topological structure and geometric knowledge of human faces. The topological description consists of a set of well-designed triangular polygons with a multi-layered elastic structure in which the microstructural information can be expressed without complicated facial feature. All the geometric values are obtained from a set of training facial models using statistical approaches and genetic algorithms.

Facial modeling as an optimization problem

As widely recognized, accurate 3D control points based on FGPFM can lead to an accurate 3D facial model of a specific individual. Herein, the reconstruction problem is formulated as a parameter optimization problem as follows. $Find a set of control points V^{∗}_{1}, such that F(V^{∗}_{1})= Min . F(V_{1}),$ where V₁ is the set of control points of the FGPFM. The two major problems are:

(a)	How to construct the fitness function $F(V_{1})$ ?
(b)	How to search for the optimal solution $V^{∗}_{1}$ ?

A coarse-to-fine IGA is presented to cope with

Experimental results

In this section, five experiments using FGPFM, synthetic face images and actual face images are analyzed to demonstrate the feasibility of the proposed method. In the first experiment, a known facial model is used to verify the superiority of IGA and the effectiveness of the coarse-to-fine approach. The second experiment demonstrates the effectiveness of the proposed fitting function and the high performance of the coarse-to-fine IGA. In the third and fourth experiments, an application applies

Conclusions

This study has presented a novel genetic algorithm-based optimization approach for facial modeling from an uncalibrated monocular face image using flexible generic parameterized facial model. The proposed method, has the following features. (1) FGPFM is presented so that the good parameters, the control points, of FGPFM can yield the good facial model for a specific person. (2) An analytic solution for the pose determination of human faces (PDF) from a monocular image is applied to obtain the

About the Author—SHINN-YING HO was born in Taiwan, ROC, on March 25, 1962. He received the B.S., M.S., and Ph.D. degrees in computer science and information engineering from National Chiao Tung University, Hsinchu, Taiwan, in 1984, 1986, and 1992, respectively. He is currently an associate professor in the Department of Information Engineering at Feng Chia University, Taichung, Taiwan. His research interests include image processing, pattern recognition, virtual reality applications of computer

References (37)

K. Aizawa et al.
Model-based analysis synthesis image coding (MBASIC) system for a person's face
Signal Process.: Image Commun.
(1989)
A. Gee et al.
Determining the gaze of faces in images
Image Vision Comput.
(1994)
G. Chow et al.
Towards a system for automatic facial feature detection
Pattern Recognition
(1993)
Takaaki Akimoto et al.
Automatic creation of 3D facial models
IEEE Comput. Graphic Appl. 13
(1993)
P. Eisert et al.
Analyzing facial expressions for virtual conferencing
IEEE Comput. Graphic Appl.
(1998)
A. Lanitis et al.
Automatic interpretation and coding of face images using flexible models
IEEE Trans. Pattern Anal. Mach. Intell.
(1997)
Soo-Chang Pei
Global motion estimation in model-based image coding by tracking three-dimensional contour feature point
IEEE Trans. Circutits Systems Video Technol.
(1998)
S.Y. Lee et al.
Recognition of human front faces using knowledge-based feature extraction and neuro-fuzzy algorithm
Pattern Recognition
(1994)
Z. Mao, A.J. Naftel, Improved area-based stereo matching using an image segmentation approach for 3-D facial Imaging,...
H.S. Ip, Li-Jun Yin, Arbitrary facial views generation from two orthogonal facial images, IEEE International...

D. Terzopoulos et al.

Analysis and synthesis of facial image sequences using physical and anatomical models

IEEE Trans. Pattern Anal. Mach. Intell.

(1998)

T. Vetter

Synthesis of novel views from a single face image

Int. J. Comput. Vision

(1998)

M.J. Tones et al.

Multidimensional morphable models: a framework for representing and matching object classes

Int. J. Comput. Vision

(1998)

T.S. Huang et al.

Motion and structure from orthographic projections

IEEE Trans. Pattern Anal. Mach. Intell.

(1989)

Li-an Tang, and Thomas S. Huang, Automatic construction of 3D human face models based on 2D images, IEEE Proceedings of...

Suhuai Luo, R.W. King, Automatic human modeling in model-base facial image coding, Proceedings of Conference on...

Peter Eisert et al.

Analyzing facial expressions for virtual conferencing

IEEE Comput. Graphic Appl.

(1998)

A. Guarda, C. Le Gal, A. Lux, Evolving visual features and detectors, Computer graphics, image processing, and vision,...

Cited by (30)

Adaptive multi-resolution fitting and its application to realistic head modeling
2004, Proceedings - Geometric Modeling and Processing 2004
Plantago asiatica, Scrophularia ningpoensis and Ilex pubescens are among the traditional Chinese medicines which are more frequently prescribed for treating depression-like ailments in the past and present traditional Chinese medical practice. The present work was therefore conducted to evaluate the presumable antidepressant effects of the extracts derived from the three remedies in mice using the learned helplessness model being used for screening for antidepressant compounds in modern medicinal researches. As a result, the petroleum extracts of Plantago asiatica and Ilex pubescens as well as the EtOAc extract of Scrophularia ningpoensis and the petroleum-soluble fraction of the acidic hydrolysate of the water extract of Ilex pubescens (after petroleum extraction) decreased significantly the number of escape failures relative to the control. The finding rationalized the clinical prescription of the herbs for the treatment of depression, and shined a clue for the characterization of the antidepressant phytochemical(s).
Mesh optimization for surface approximation using an efficient coarse-to-fine evolutionary algorithm
2003, Pattern Recognition
The investigated mesh optimization problem C(N,n) for surface approximation, which is NP-hard, is to minimize the global error between a digital surface and its approximating mesh surface by efficiently locating a limited number n of grid points which are a subset of the original N sample points. This paper proposes an efficient coarse-to-fine evolutionary algorithm (CTFEA) with a novel orthogonal array crossover (OAX) for solving the mesh optimization problem. OAX adaptively divides the meshes of parents into a number of parts using a tuning parameter for applying a coarse-to-fine technique. Meshes of children are formed from an intelligent combination of the good parts from their parents rather than the conventional random combination. The better one of two parts in two parents is chosen by evaluating the contribution of the individual parts to the fitness function based on orthogonal experimental design. The coarse-to-fine technique of CTFEA can advantageously solve large mesh optimization problems. Furthermore, CTFEA using an additional inheritance technique can further efficiently locate the grid points in the mesh surface. It is shown empirically that CTFEA outperforms the existing evolutionary algorithm in terms of both approximation quality and convergence speed, especially in solving large mesh optimization problems.
Facial modeling from an uncalibrated face image using flexible generic parameterized facial models
2001, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Toward surface defect detection in electronics manufacturing by an accurate and lightweight YOLO-style object detector
2023, Scientific Reports
Robust human detection and localization in security applications
2017, Concurrency and Computation: Practice and Experience
3D face reconstruction using images from cameras with varying parameters
2017, International Journal of Automation and Computing

View all citing articles on Scopus

About the Author—HUI-LING HUANG was born in Taiwan, ROC, on April 4, 1968. She received the B.S. degree in Mathematics from Fu Jen C.U., Csin Chuang, Taiwan, in 1992, and the M.S. degree in information engineering form Feng Chia University, Taichung, Taiwan, in 1998. She is presently working toward the Ph.D. degree in the Institute of Information Engineering at Feng Chia University, Taichung, Taiwan. Her research interests include image processing, pattern recognition, virtual reality applications of computer vision, and large parameter optimization problems.

^☆: This work was supported by the National Science Council, Taiwan, ROC, Grant No. NSC88-2213-E-035-012.

View full text

Facial modeling from an uncalibrated face image using a coarse-to-fine genetic algorithm☆

Abstract

Introduction

Section snippets

Establishment of FGPFM

Facial modeling as an optimization problem

Experimental results

Conclusions

Signal Process.: Image Commun.

Image Vision Comput.

Pattern Recognition

Automatic creation of 3D facial models

IEEE Comput. Graphic Appl. 13

Analyzing facial expressions for virtual conferencing

IEEE Comput. Graphic Appl.

Automatic interpretation and coding of face images using flexible models

IEEE Trans. Pattern Anal. Mach. Intell.

Global motion estimation in model-based image coding by tracking three-dimensional contour feature point

IEEE Trans. Circutits Systems Video Technol.

Recognition of human front faces using knowledge-based feature extraction and neuro-fuzzy algorithm

Pattern Recognition

Analysis and synthesis of facial image sequences using physical and anatomical models

IEEE Trans. Pattern Anal. Mach. Intell.

Synthesis of novel views from a single face image

Int. J. Comput. Vision

Multidimensional morphable models: a framework for representing and matching object classes

Int. J. Comput. Vision

Motion and structure from orthographic projections

IEEE Trans. Pattern Anal. Mach. Intell.

Analyzing facial expressions for virtual conferencing

IEEE Comput. Graphic Appl.