Elsevier

Pattern Recognition

Volume 40, Issue 5, May 2007, Pages 1605-1620
Pattern Recognition

Face recognition under arbitrary illumination using illuminated exemplars

https://doi.org/10.1016/j.patcog.2006.09.016Get rights and content

Abstract

Recently, the importance of face recognition has been increasingly emphasized since popular CCD cameras are distributed to various applications. However, facial images are dramatically changed by lighting variations, so that facial appearance changes caused serious performance degradation in face recognition. Many researchers have tried to overcome these illumination problems using diverse approaches, which have required a multiple registered images per person or the prior knowledge of lighting conditions. In this paper, we propose a new method for face recognition under arbitrary lighting conditions, given only a single registered image and training data under unknown illuminations. Our proposed method is based on the illuminated exemplars which are synthesized from photometric stereo images of training data. The linear combination of illuminated exemplars can represent the new face and the weighted coefficients of those illuminated exemplars are used as identity signature. We make experiments for verifying our approach and compare it with two traditional approaches. As a result, higher recognition rates are reported in these experiments using the illumination subset of Max-Planck Institute face database and Korean face database.

Introduction

Lighting is the most significant factor affecting the appearance of an object. Changes in a person's appearance induced by illumination are larger than differences in appearance of individuals. Person identification from a facial image across lighting variations is still the most challenging problem in face recognition. Fig. 1 shows the 10 illuminated images of the same individual under varying lighting conditions from Yale database [1]. In the past few years, many methods have been proposed to solve this problem with improvements in recognition. Early works in illumination-invariant face recognition focused on image representations that are mostly insensitive to changes under various lighting [2]. Various image representations are compared by measuring distances on a controlled face database. Edge map, second derivatives and 2D Gabor filters are examples of the image representations used. However, these kinds of approaches have some drawbacks. First, the different representations of image can be only extracted once they overcome some degree of illumination variations. Second, features for the person's identity are weakened whereas the illumination-invariant features are extracted.

A photometric stereo method is an approach based on the low dimensionality of the image space. The images of one object with a Lambertian surface taken from a fixed viewpoint and varying illuminations lie in a linear subspace. We can classify the new probe image by checking if it lies in the linear subspace of the registered gallery images. These gallery images are composed of at least three images of the same person under different illuminations. The main restriction in these approaches is that multiple registered images of the same person are required. Since it recognizes the new image by checking that it is spanned in a linear subspace of the multiple gallery images, it cannot handle the new images of a different person which is not included in the gallery set.

To solve the necessity of multiple gallery images, the bilinear analysis approach is proposed [3]. It applies singular value decomposition (SVD) to a variety of vision problems including identity and lighting on a collection of objects in the same class. For bilinear analysis, photometric stereo images of different people under the same set of illuminations are always required. A method based on quotient image was introduced; this method uses the basic idea of bilinear analysis [4]. Under the assumption of a Lambertian surface without shadow, quotient image is created, deduced from the ratio between a probe image and the linear combination of photometric stereo images. The quotient image should be invariant to varying illumination conditions of the input image. This method can synthesize an image under unknown lighting condition and recognize a new face by computing its quotient image. However, the assumption of a fixed shape is not valid for real faces and the quality of the quotient image is influenced by training data. A poor quality in the quotient image results when the training data is unknown or not controlled. The generalized photometric stereo method in the bilinear analysis approach has been proposed with a rank constraint on the product of albedo and surface normal [5]. The observation matrix can be factorized by a rank constraint. The main limitation of these bilinear analysis methods is that prior knowledge of the images like the lighting direction of training data is required.

Unlike the methods described above, Blanz and Vetter use 3D morphable models of a human head [6]. The 3D model is created using a database collected by Cyberware laser scanner. Both geometry and texture are linearly spanned by the training ensemble. This approach enables them to handle illumination, pose and expression variations. But it requires the external 3D model and high computational cost. For illumination-robust face recognition, we have to solve the following problem: Given a single image of a face under the arbitrary illumination, how can the same faces under different illuminations be recognized?

In this paper, we propose a new approach for solving this problem based on illuminated exemplars. The illuminated exemplars are synthesized from photometric stereo images of each person from training data and the new probe image can be represented by a linear combination of these illuminated exemplars. The weighted coefficients are estimated in this representation and can be used as the illumination-invariant identity signature. For face recognition, our proposed method has several distinct advantages over the previously proposed methods. First, the information regarding the lighting condition of training data is not required. We can synthesize the illuminated exemplars under other illumination which is not included in the training data. Second, we can perform recognition with only one gallery image by using linear analysis of illuminated exemplars in the same class. Third, the coefficients of illuminated exemplars are used as the identity signature for face recognition across variation in lighting, which results in high recognition rates.

This paper is organized as follows. Section 2 provides the results of a literature search for past studies related to this work. In Section 3, we describe how the illuminated exemplars are synthesized, how they are analyzed and what can be used as a signature identity for face recognition, and then we propose the face recognition method using the illuminated exemplars. Section 4 gives our experimental results on Max-Planck Institute face database (MPI DB) and Korean face database (KFDB). Finally, we make a conclusion regarding our approach and suggest some areas of further research in Section 5.

Section snippets

Background

We begin with a brief review of the photometric stereo method with Lambertian lighting model and bilinear analysis of illuminated training images. We will explain what the Lambertian reflectance is and how it can be used in the photometric stereo method for face recognition [7]. We will also explain recognition methods using the bilinear analysis of the training data [3], [4], [5].

Linear analysis of illuminated exemplars

We propose an illumination-invariant face recognition method based on synthesizing illuminated exemplars. We synthesize the illuminated exemplars from photometric stereo images and then we can represent a new probe image by linear combination of illuminated exemplars and the weighted coefficients offer an illumination-invariant identity signature. By computing the correlations between the set of coefficients for gallery image and that for probe image, we can determine the identity of new probe

Experiments

We have conducted a number of experiments with our approach using the MPI DB [6] and KFDB [10]. We performed experiments for face recognition using the illumination subset of the whole database. In these experiments, we compared the proposed method with eigenface/WO3 [11], [12] method and bilinear analysis method [5].

Eigenface/WO3: Eigenface is a method commonly used in face recognition. We have implemented the eigenface approach by training the eigenvector from the same training data. To solve

Conclusions and further research

We have addressed a new approach for illumination-invariant face recognition. The idea here is to synthesize illuminated exemplars using photometric stereo images and apply them to represent the new input image under the arbitrary illumination. This method only requires one input image and one registered image per person for recognition. The weighted coefficients are used as the signature identity, so that a new image can be represented as a linear combination of a small number of illuminated

Acknowledgments

This research was supported by 2005 Seoul R& BD Program. And we would like to thank the Max-planck Institute for providing the MPIDB.

About the Author—SANG-WOONG LEE received his BS degree in Electronics and Computer Engineering from Korea University, Seoul, Republic of Korea, in 1996 and his MS and PhD degrees in Computer Science and Engineering from Korea University, Seoul, Republic of Korea, in 2001 and 2006, respectively. Currently, he is a visiting researcher in Carnegie Mellon University. His research interests include computer vision, robotics, pattern recognition and related applications.

References (14)

  • A.S. Georghiades et al.

    From few to many: illumination Cone Models for face recognition under variable lighting and pose

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2001)
  • Y. Adini et al.

    Face recognition: the problem of compensating for changes in illumination direction

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1997)
  • W.T. Freeman, J.B. Tenenbaum, Learning bilinear models for two-factor problems in vision, in: Proceedings of the IEEE...
  • A. Shashua et al.

    The quotient image: class based re-rendering and recognition with varying illuminations

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2001)
  • K. Zhou, R. Chellappa, Rank constrained recognition under unknown illuminations, in: Proceedings of the IEEE...
  • V. Blanz et al.

    Face recognition based on fitting a 3D morphable model

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2003)
  • R. Basri, D. Jacobs, Photometric stereo with general, unknown lighting, in: Proceedings of the IEEE Conference on...
There are more references available in the full text version of this article.

Cited by (22)

  • Deep face recognition for dim images

    2022, Pattern Recognition
    Citation Excerpt :

    Although data augmentation may help solve the problem, it is practically infeasible to collect a dataset containing sufficiently well distributed face images under all possible illumination conditions. Since the effect of illumination variation on face recognition has to do with the three-dimensional structure of human faces, another possible solution is to construct a general 3D human face model to fit different illumination conditions and to normalize the brightness and contrast of the face image [39–48]. However, this approach is computationally inefficient since it requires the knowledge of 3D structure including albedos and physical contour.

  • Orthogonal discriminant vector for face recognition across pose

    2012, Pattern Recognition
    Citation Excerpt :

    Appearance-based method represents a w-by-h face image by a vector in the wh-dimensional image space. It is concluded that the dimensionality of the face space is too high to allow robust face recognition [11], due to the variations of pose [12], occlusion [4], and illumination [13]. A number of algorithms are proposed [37–42] to recognize face images from different poses.

  • Kernel discriminant transformation for image set-based face recognition

    2011, Pattern Recognition
    Citation Excerpt :

    In addition, multiscale facial structure [16] and local contrast enhancement [17] is also explored to recognize faces under varying illumination. Notwithstanding the contributions of [10,11,13–15,18], a single testing input provides insufficient information to guarantee a reliable recognition performance. Better performance could be obtained from sets of testing images since multiple images provide more appearance variances of the input data.

  • Face recognition based on 2D images under illumination and pose variations

    2011, Pattern Recognition Letters
    Citation Excerpt :

    Basri and Jacobs (2003) represented lighting by using spherical harmonics and described the effects of Lambertian reflectance as an analogy to convolution. Lee et al. (2007) represented a face image under arbitrary illumination using a linear combination of illuminated exemplars which were synthesized from photometric stereo images of training data. For pose variation, generic 3D shape models were used by considering a uniform face shape as a tool for transforming image pixels (Liu and Chen, 2005; Zhang et al., 2006).

  • Face recognition using string grammar fuzzy K-nearest neighbor

    2016, 2016 8th International Conference on Knowledge and Smart Technology, KST 2016
View all citing articles on Scopus

About the Author—SANG-WOONG LEE received his BS degree in Electronics and Computer Engineering from Korea University, Seoul, Republic of Korea, in 1996 and his MS and PhD degrees in Computer Science and Engineering from Korea University, Seoul, Republic of Korea, in 2001 and 2006, respectively. Currently, he is a visiting researcher in Carnegie Mellon University. His research interests include computer vision, robotics, pattern recognition and related applications.

About the Author—SONG-HYANG MOON received her BS degree in Computer Science from Sookmyung Women's University, Seoul, Republic of Korea, in 2003 and her MS degree in Computer Science and Engineering from Korea University, Seoul, Republic of Korea, in 2005. Currently, she is a researcher in LG electronics, Inc. Her research interests include face recognition, computer vision and visual communication.

About the Author—SEONG WHAN LEE received his BS degree in Computer Science and Statistics from Seoul National University, Seoul, Republic of Korea, in 1984, and his MS and PhD degrees in Computer Science from KAIST in 1986 and 1989, respectively. From February 1989 to February 1995, he was an assistant professor in the Department of Computer Science at Chungbuk National University, Cheongju, Republic of Korea. In March 1995, he joined the faculty of the Department of Computer Science and Engineering at Korea University, Seoul, Republic of Korea, as an associate professor, and he is now a full professor. He was the winner of the Annual Best Paper Award of the Korea Information Science Society in 1986. He obtained the First Outstanding Young Researcher Award at the Second International Conference on Document Analysis and Recognition in 1993, and the First Distinguished Research Professor Award from Chungbuk National University in 1994. He also obtained the Outstanding Research Award from the Korea Information Science Society in 1996. He has been the founding coeditor-in-chief of the International Journal on Document Analysis and Recognition and the associate editor of the Pattern Recognition Journal, the International Journal of Pattern Recognition and Artificial Intelligence and the International Journal of Computer Processing of Oriental Languages since 1997. He was the program cochair of the Sixth International Workshop on Frontiers in Handwriting Recognition, the Second International Conference on Multimodal Interface, the 17th International Conference on the Computer Processing of Oriental Languages, the Fifth International Conference on Document Analysis and Recognition, and the Seventh International Conference on Neural Information Processing. He was the workshop cochair of the Third International Workshop on Document Analysis Systems and the First IEEE International Workshop on Biologically Motivated Computer Vision. He served on the program committees of several well-known international conferences. He is a fellow of IAPR, a senior member of the IEEE Computer Society and a life member of the Korea Information Science Society. His research interests include pattern recognition, computer vision and neural networks. He has published more than 200 publications in these areas in international journals and conference proceedings, and has authored five books.

A preliminary version of this paper has been presented in 2005 International Conference on Audio- and Video-Based Biometric Person Authentication which was held in New York, USA, July 20–22, 2005.

View full text