Local appearance based face recognition method using block based steerable pyramid transform

doi:10.1016/j.sigpro.2010.06.005

Signal Processing

Volume 91, Issue 1, January 2011, Pages 38-50

https://doi.org/10.1016/j.sigpro.2010.06.005 Get rights and content

Abstract

In this paper, an efficient local appearance feature extraction method based on Steerable Pyramid (S-P) wavelet transform is proposed for face recognition. Local information is extracted by computing the statistics of each sub-block obtained by dividing S-P sub-bands. The obtained local features of each sub-band are combined at the feature and decision level to enhance face recognition performance. The purpose of this paper is to explore the usefulness of S-P as feature extraction method for face recognition. The proposed approach is compared with some related feature extraction methods such as principal component analysis (PCA), as well as linear discriminant analysis LDA and boosted LDA. Different multi-resolution transforms, wavelet (DWT), gabor, curvelet and contourlet, are also compared against the block-based S-P method. Experimental results on ORL, Yale, Essex and FERET face databases convince us that the proposed method provides a better representation of the class information, and obtains much higher recognition accuracies in real-world situations including changes in pose, expression and illumination.

Introduction

Among face recognition methods, the most popular are holistic appearance-based approaches such as PCA [13], LDA [46], ICA [4] and review of these methods has been presented in [21]. Subspace analysis based methods has been proposed in [47], [48], [49] in order to give an effective feature extraction in high dimensional space. These methods outperform holistic methods in recognition accuracy. Recently, there are more and more attempts to develop face recognition systems based on local features. The approach of analyzing faces locally is believed to outperform the holistic appearance-based approaches, where a local change affects only the corresponding part of the representation and does not modify the representation vector as a whole [32]. These approaches exhibit good performance and robustness in controlled environments but still do not perform well in many real-world situations, due to variations in pose, lighting and expression. In order to address this issue, many researchers propose to deploy a pre-processing step in order to capture more discriminant features for use in the recognition step, such as local binary patterns (LBP) and local discrete cosine transform [33], [34]. In [41] the authors propose to utilize the total variation (TV) mode to factorize an image in order to overcome the illumination limitation and to preserve the edge information of images. However, the TV model is able to only process images in certain scale. In addition, the TV model is an iterative type approach, thus the computational expense is very high. More recently, interest has grown in using multi-resolution methods where multiple evidences (sub-bands) from the same face are obtained allowing extraction of less sensitive features to intrinsic deformations due to expression or due to extrinsic factors, like illumination. These transforms have more edge-preserving ability than TV model in low frequency illumination fields [40]. Multi-resolution methods have been successfully used in many challenging pattern recognition applications including character recognition [2] and face recognition [1], [3], [26].

Among multi-resolution methods for face recognition, the most popular are discrete wavelet transform (DWT) [22], [23], [24], [25], [50] and Gabor wavelets [12]. These methods have proved to be very successful to capture more discriminant features of face images leading to higher performance and robustness against various challenging conditions. However, wavelet-based features are not suitable for face recognition in uncontrolled environments since images do not always exhibit isotropic scaling (horizontal, vertical and diagonal). In [40], authors treated a particular problem of illumination invariance. Since illumination is represented by a convolution, it can be avoided in log-domain by wavelet denoising technique. Their method claimed to be robust in different illumination conditions leading to the best results in Yale B face database. Others have shown that Gabor filters can attain good results in many face recognition applications. However, the use of Gabor filters dramatically increase the computational cost of the face recognition method, requiring that each kernel is convolved with the input image [12]. Contourlet [9], curvelet transforms [11], and steerable pyramid are another multi-resolution transforms similar to the two-dimensional DWT, but with interesting translation and rotation-invariance properties [5]. The steerable pyramid (S-P) is a linear multi-scale, multi-orientation image decomposition which has been developed to overcome the wavelet limitations. Though steerable pyramids provide more scale and orientation than wavelets. The curvelet transform captures curves instead of points as in S-P transform in the continuous domain. Contourlets are an extension of curvelets, which can be approximated in the discrete domain. Contourlets, however, are defined and derived in the discrete domain from the beginning. They both allow for directionality and anisotropy [42].

Applications of Contourlet to face recognition have been investigated in work presented by Boukabou et al. [3]. Authors propose to employ contourlet with PCA in order to extract discriminant features and to obtain higher recognition rates. They have evaluated the proposed method on two different databases (Yale and FERET Database) and stated that the contourlet transform outperforms the original PCA method. More experiments have to be performed on large database and many comparisons against well established existing techniques must be done to assess this result. Mandal et al. [1] propose curvelet based face recognition system by fusing results from multiple SVM classifiers trained with curvelets coefficients from images having different gray scale resolutions (2, 4 and 8 bits). However, this algorithm is computationally expensive since it requires taking the curvelet transform of the original image and its quantized representations. In [26], [27], curvelet transform is introduced in conjunction with different dimensionality reduction tools. These techniques appear to be robust to the changes in facial expression as they show good results for the Essex and the ORL database, but still do not perform well in YALE database that contains images with great variations in illumination and facial expression.

According to previous review on local based approaches, which have proven to be robust to most face recognition challenges comparing to global based approaches, we have proposed a face recognition method based on local presentation of curvelet transform [28]. Curvelet transform is applied to the face image and each of the resulting sub-bands is partitioned into a set of equally sized blocks in a non-overlapping way. Then the statistical measures (mean, variance and entropy) of the energy distribution of the curvelet coefficients for each block in each sub-band at each decomposition level is used to construct the feature vector. Then, we used curvelet transform as an improvement tool of LDA in [45]. The proposed methods have been evaluated on Yale, ORL and FERET databases and have shown better recognition accuracy in comparison to holistic approaches. However, local-based approaches cannot be applied to this technique because of the small sub-band size.

To solve all these mentioned problems, the steerable pyramid can be employed to produce any number of orientation bands. In addition, it conserves the same image resolution in the first scale level which is more adequate for local appearance based approaches. Several studies have investigated the discriminating power of steerable pyramid-based features in various applications including: image denoising [10], textures classification [12], image processing [30], [5] and face hallucination [29]. In [44], the S-P method has been proposed in conjunction with LBP of each sub-band in order to extract a local information of face images. This work gave promised results to investigate extensively the use of S-P transform both in global and local appearance, and feature/score fusion which is the subject of the present paper.

In this paper, we present a face recognition approach based on steerable pyramid decomposition. Following our previous study [43], the main contribution of this paper is to fully investigate the usefulness of steerable pyramids transform in a face recognition framework. Each face image is described by a subset of band filtered images containing steerable pyramid coefficients which characterize the face textures. We divide the S-P sub-bands into small sub-blocks, from which we extract compact and meaningful feature vectors using mean, variance and entropy. We conceive an experiment framework specifically to investigate the improvement in robustness against illumination and facial expression changes. We discuss the important problem of fusing the local observations that utilizes multiple sub-bands. Then, we investigate fusion schemes both at the feature and decision levels. Finally, we show how an efficient and reliable probabilistic metric can be used in order to classify the face feature vectors into person classes. Experimental results are presented using images from the FERET, ORL, ESSEX, YALE and YALE B databases. The efficiency of our approach is firstly analyzed by comparing the results with those obtained using multi-resolution methods such as wavelet, Gabor, contourlet and curvelet. Secondly it is compared to the best given results in the literature and has shown better recognition performance.

The remainder of the paper is organized as follows. In Section 2, face feature extraction based steerable pyramid transform is introduced. Block based S-P face identification proposed method is given in Section 3. In Section 4, three fusion schemes used in the study are explained, namely, data fusion, feature fusion, and decision fusion. Experimental results are presented and discussed in Section 5. Finally, in Section 6, conclusions and future recommendations are given.

Section snippets

Face feature extraction based steerable pyramid transform

A face image of a person contains similarity (approximation) information of the face as well as discriminatory (detail) information with respect to faces of all other persons. The discriminatory information is due to structural variations of the face which are acquired as intensity variations at different locations of the face. The location and degree of intensity variations in a face for an individual are unique features which discriminate one person from the rest of the population. Steerable

Steerable pyramid feature extraction methods

The S-P transforms can be used for feature extraction in two different ways:

S-P features and subspace analysis

This section presents work that utilizes S-P features and subspace analysis for face identification. Once features are extracted, subspace analysis could be applied for further class separability enhancement and feature dimension reduction.

Fig. 3 shows a flow chart demonstrating the use of S-P features and subspace analysis for face recognition. Initially a set of eight S-P filters are used to extract appropriate features, which are then induced to PCA or LDA. The S-P features extracted from a

Steerable pyramid fusion schemes

The main idea behind using the S-P analysis is firstly, to obtain multiple evidences (sub-band) from the same face, and search for those sub-bands that are less sensitive to intrinsic deformations due to expression or due to extrinsic factors, like illumination. Secondly, fuse the local observations that utilizes multiple sub-bands. Despite the fact that at first sight, these sub-bands can appear somewhat redundant and may contain less information, their prudent combination can prove often to

Experimental results

Four separate experiments are conducted to test the advantage of the S-P face recognition scheme. In the first experiment, we employ traditional transforms (PCA, LDA) to enhance and extract discriminative features in all sub-bands. While in the second experiment the sub-bands that are potentially insensitive to changes in expression and variations in illumination are searched. Whereas in the third experiment, the fusion of the best performing sub-bands is investigated. Finally a comparative

Conclusions and future works

The main contribution of this paper is to investigate a new approach using steerable pyramid coefficients to address the problem of human face recognition from still images. For each face image, S-P is performed to compute different sub-bands from which some statistical measures are extracted using a block-based technique. This is the first time steerable pyramid transform is being explored in face recognition application. In the case of ORL, YALE and ESSEX we have almost obtain the best

References (47)

T. Mandal et al.
Curvelet based face recognition via dimension reduction
Signal Processing
(2009)
T. Zhang et al.
Multiscale facial structure representation for face recognition under varying illumination
Pattern Recognition
(2009)
T. Mandal, A. Majumdar, Q.M.J. Wu, Face recognition by curvelet based feature extraction, in: ICIAR 2007, Lecture Notes...
A. Majumdar
Bangla basic character recognition using digital curvelet transform
Journal of Pattern Recognition Research JPRR
(2007)
W.R. Boukabou, A. Bouridane, Contourlet-based feature extraction with PCA for face recognition, in: NASA/ESA Conference...
M.S. Bartlett et al.
Independent component representations for face recognition
E.P. Simoncelli, A rotation-invariant pattern signature, in: Third IEEE International Conference on Image Processing,...
A.N. Belbachir, P.M. Goebel, The contourlet transform for image compression, in: Physics in Signal and Image...
A. Li, X. Li, S. Wang, H. Li, A multiscale and multidirectional image denoising algorithm based on contourlet...
E.J. Candes, D.L. Donoho, Curvelets: a surprisingly effective nonadaptive representation for objects with edges,...

S. Li et al.

Comparison and fusion of multiresolution features for texture classification

Pattern Recognition Letters

(2002)

M. Turk et al.

Eigenfaces for recognition

Journal of Cognitive Neuroscience

(1991)

W.T. Freeman et al.

The design and use of steerable filters

IEEE Transactions on Pattern Analysis and Machine Intelligence

(1991)

P.J. Phillips et al.

The FERET evaluation methodology for face recognition algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2000)

G.C. Feng et al.

Human face recognition using pca on wavelet subband

Journal of Electronic Imaging

(2000)

J.T. Chien et al.

Discriminant waveletfaces and nearest feature classifiers for face recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence

(2002)

M. Zhao, P. Li, Z. Liu, Face recognition based on wavelet transform weighted modular pca, in: Proceedings Congress in...

B.-L. Zhang et al.

Face recognition by applying wavelet subband representation and kernel associative memory

IEEE Transaction on Neural networks

(2004)

W. Zhao et al.

Face recognition: a literature survey

ACM Computing Survey

(2003)

E. Stollnitz et al.

Wavelets for computer graphics: a primer part 1

IEEE Computer Graphics and Application

(1995)

C. Mulcahy

Image compression using the Haar wavelet transform

Spelman Science & Mathematics Journal

(1997)

R.A. DeVore et al.

Image compression through wavelet transform coding

IEEE Transactions on Information Theory

(1992)

C. Garcia, G. Zikos, G. Tziritas, A wavelet-based framework for face recognition, in: Proceedings of the Workshop on...

Cited by (47)

Local appearance-based face recognition using adaptive directional wavelet transform
2019, Journal of King Saud University - Computer and Information Sciences
Citation Excerpt :
Additionally, the LDA not only provides the dimensionality reduction but also provides the discriminant features from LL subband. All these MRA face recognition methods (GFC, curvelet, ridgelet, contourlet, and LGBP) require multiple orientations and scales to describe the face feature vectors effectively (El Aroussi et al., 2011). They have a weakness in terms of high computational costs due to consideration of the large size of feature vectors for achieving a better performance.
The latest research has shown that adaptive directional wavelet transform can constitute edges and textures in images efficiently due to the adaptive directional selectivity. This paper is primarily focused on the application of adaptive directional wavelet transform in conjunction with linear discriminant analysis (LDA) for capturing the discriminant directional multiresolution facial features. The intention of this paper is to explore the efficacy of adaptive directional wavelet transform in facial feature extraction and to offer a stepping stone for further research in this direction. The proposed approach is compared with existing subspace and local descriptor feature extraction methods. A performance comparison is also demonstrated with existing non-adaptive multiresolution analysis methods such as discrete wavelet transform (DWT), Gabor wavelet transform (GWT), curvelets, ridgelets, contourlets, and local Gabor binary pattern. Evaluation of the proposed approach on famous databases such as ORL, Essex Grimace, Yale, and Sterling face convinces the effectiveness of the adaptive directional wavelet transform based subspace features.
Mixed neighborhood topology cross decoded patterns for image-based face recognition
2018, Expert Systems with Applications
Citation Excerpt :
The holistic face representation approaches extract features from whole face images. A wide variety of holistic methods for face recognition have been proposed in the literature, including approaches based on independent component analysis (ICA) method (Secchi, Vantini, & Zanini, 2016), Zernike moments method (Bereta, Pedrycz, & Reformat, 2013; Singh, Mittal, & Walia, 2011), global Gabor-Zernike feature descriptor (Fathi, Alirezazadeh, & Abdali-Mohammadi, 2016), eigenface method or principal components analysis (PCA) method (Cavalcanti, Ren, & Pereira, 2013; Wen, He, & Shi, 2012); different transforms such as wavelet sub-bands (Huang, Li, Shang, Wang, & Zhang, 2015a; Huang, Li, Wang, & Zhang, 2015b), Gabor filters (Abhishree, Latha, Manikantan, & Ramachandran, 2015; Yu, He, & Cao, 2010), optimal matrix factorization (Guan, Tao, Luo, & Yuan, 2012) and steerable pyramid transform (El Aroussi, El Hassouni, Ghouzali, Rziza, & Aboutajdine, 2011); Fisherface method which uses linear discriminant analysis (LDA) technique (Lu, Jin, & Zou, 2012a; Lu, Zou, & Wang, 2012b; Wen et al., 2012), etc. Despite some merits like low processing time for both feature extraction and computing similarity, holistic features come with the following limitations and disadvantages: i) they usually ignore local details (Yang, Wang, & Zhang, 2016a), ii) they may be oversensitive to pixel’s location and consequently fail to identify important visual characteristics (Datta, Joshi, Li, & Wang, 2008; Halawani, Teynor, Setia, Brunner, & Burkhardt, 2006) and iii) their performance degrades significantly under pose and illumination variations (Patel, Maheshwari, & Raman, 2016; Tolba, El-Baz, & El-Harby, 2006).
Face recognition becomes an important task performed routinely in our daily lives. This application is encouraged by the wide availability of powerful and low-cost desktop and embedded computing systems, while the need comes from the integration in too much real world systems including biometric authentication, surveillance, human-computer interaction, and multimedia management. Moreover, face recognition technology is now adopted in new intelligent systems and devices like smart-phones, which impose some constraints related to the complexity and execution time of the recognition process. This fact brings new challenges and gives much more area to extend the ongoing researches. This research field experienced the development of many methods and architectures aiming at producing face recognition systems which are efficient in terms of precision, robustness and computation time. In the same context, this article proposes a new feature descriptor referred to as Mixed Neighborhood Topology Cross Decoded Patterns (MNTCDP) as an effective face descriptor, The proposed handcrafted descriptor fulfills the needs of current face recognition applications and can be integrated in different platforms, requiring simple, robust and computationally low algorithms. Instead of heuristic code constructions, MNTCDP is built using new neighborhood topology and new pattern encoding scheme, which have high ability to extract discriminative and stable face representation. The adopted face recognition system consists of three stages: (1) face detection and alignment to normalize the input images to a common form if needed; (2) feature extraction using the proposed MNTCDP descriptor and (3) face recognition through a supervised image classification task using the simple K-Nearest Neighbors classifier. Simulated experiments on ORL, YALE, Extended Yale B, FERET and AR datasets acquired under different illumination conditions or facial expressions show that the proposed MNTCDP descriptor presents high performance ability in classifying face images. MNTCDP demonstrates superior performance than a large number of recent state-of-the-art LBP variants and deep learning methods, as well as recent most promising works of the literature.
A modified technique for face recognition under degraded conditions
2018, Journal of Visual Communication and Image Representation
In this paper an improved face recognition algorithm under degrading conditions is proposed. The proposed algorithm uses a combination of preprocessing techniques coupled with discriminative feature extractors to obtain the best distinctive features for classification. Preprocessing approach is the fusion of multi-scale Weber and enhanced complex wavelet transform. Combination of multiple feature extraction based on Gabor filters, block-based local phase quantization (LPQ) coupled with principal component analysis (PCA) proved to be very effective to improve correct rate of recognition. We have also used two known classifiers, extreme learning machine (ELM), and sparse classifier (SC), and fused their outputs to obtain best recognition rate. Experimental results show improved performance of proposed algorithm under poor illumination, partial occlusion and low-quality images in uncontrolled conditions. Our best recognition results using second version of face recognition grand challenge (FRGC 2.0.4) which is the most challenging database, indicated more than 28% improvement over previous works.
Steerable pyramid transform and local binary pattern based robust face recognition for e-health secured login
2016, Computers and Electrical Engineering
Citation Excerpt :
SPT was used in several applications of image processing, for example, image denoising [14], forgery detection [15], and texture classification [7]. It has also been investigated in the face recognition system [16]; however, it was not fully explored there. The contributions of this work are (i) the development of an SPT-LBP based face recognition system, (ii) a thorough investigation of different subbands of the SPT towards the recognition of face, and (iii) a selection of subbands that achieve optimum results.
This paper proposes a face recognition system based on a steerable pyramid transform (SPT) and local binary pattern (LBP) for e-Health secured login. In an e-Health framework, patients are sometimes unable to identify themselves by traditional login modalities such as username and password. Automatic face recognition can replace the conventional login modalities if the recognition system is robust. In the proposed system, SPT can decompose a face image into several subbands of different scales and orientations, and LBP can encode the subbands in binary texture pattern. Therefore, SPT-LBP scheme represents a face image in a robust way that includes multiple information sources from different scales and orientations. The proposed system is evaluated on the facial recognition technology (FERET) database. According to the results, the proposed system achieves 99.28% recognition in fb set, 80.17% in dup I set, and 79.54% in dup II set.
A new Global-Gabor-Zernike feature descriptor and its application to face recognition
2016, Journal of Visual Communication and Image Representation
Citation Excerpt :
In the global methods, features are extracted from all over the facial image and there is less sensitivity to noise [3]. Some of the most important global feature extraction techniques are: the eigenface method or principal components analysis (PCA) method which is applied to the face as a whole [4,5], the Fisherface method which uses linear discriminant analysis (LDA) technique [1,5,6], the methods based on different transforms such as Gabor filters [7,8], wavelet sub-bands [9,10], steerable pyramid transform [11] and optimal matrix factorization [12], the Zernike moments method [3,13], and the independent component analysis (ICA) method [14]. The PCA, LDA, and ICA methods use statistical techniques to obtain the distribution function of data and also to reduce the dimension of feature vector.
Face recognition is an important subject in computer vision and authentication systems. Feature extraction is one of the main steps in the face recognition systems, which greatly affects recognition accuracy. In the most of the existing methods, only local features in the facial area are extracted and employed in recognizing the person’s face. In this article, at first a novel multi-scale and rotation invariant global feature descriptor is introduced by applying the Zernike moment on the outputs of Gabor filters. Then the proposed global feature along with an efficient local feature, the histogram of oriented gradient (HOG), is employed to propose a new face recognition system. The proposed system was tested on three famous face recognition databases, namely ORL, Yale and AR and face recognition rates of 98%, 97.8% and 97.1% were obtained respectively. These rates are higher than other state-of-the-art methods.
Developing a late fusion of multi facial components for facial recognition with a voting method and global weights
2023, International Journal of Computational Vision and Robotics

View all citing articles on Scopus

View full text

Local appearance based face recognition method using block based steerable pyramid transform

Abstract

Introduction

Section snippets

Face feature extraction based steerable pyramid transform

Steerable pyramid feature extraction methods

S-P features and subspace analysis

Steerable pyramid fusion schemes

Experimental results

Conclusions and future works

Signal Processing

Pattern Recognition

Bangla basic character recognition using digital curvelet transform

Journal of Pattern Recognition Research JPRR

Independent component representations for face recognition

Comparison and fusion of multiresolution features for texture classification

Pattern Recognition Letters

Eigenfaces for recognition

Journal of Cognitive Neuroscience

The design and use of steerable filters

IEEE Transactions on Pattern Analysis and Machine Intelligence

The FERET evaluation methodology for face recognition algorithms

IEEE Transactions on Pattern Analysis and Machine Intelligence

Human face recognition using pca on wavelet subband

Journal of Electronic Imaging

Discriminant waveletfaces and nearest feature classifiers for face recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence

Face recognition by applying wavelet subband representation and kernel associative memory

IEEE Transaction on Neural networks

Face recognition: a literature survey

ACM Computing Survey

Wavelets for computer graphics: a primer part 1

IEEE Computer Graphics and Application

Image compression using the Haar wavelet transform

Spelman Science & Mathematics Journal

Image compression through wavelet transform coding

IEEE Transactions on Information Theory