Locality Regularization Embedding for face verification

doi:10.1016/j.patcog.2014.07.010

Pattern Recognition

Volume 48, Issue 1, January 2015, Pages 86-102

https://doi.org/10.1016/j.patcog.2014.07.010 Get rights and content

Highlights

•
Regularized graph embedding approach for face verification.
•
A regularization model adopts local Laplacian matrix to restore true data locality.
•
Based on the proposed regularization model, three dimensionality reduction techniques are presented.

Abstract

Graph embedding (GE) is a unified framework for dimensionality reduction techniques. GE attempts to maximally preserve data locality after embedding for face representation and classification. However, estimation of true data locality could be severely biased due to limited number of training samples, which trigger overfitting problem. In this paper, a graph embedding regularization technique is proposed to remedy this problem. The regularization model, dubbed as Locality Regularization Embedding (LRE), adopts local Laplacian matrix to restore true data locality. Based on LRE model, three dimensionality reduction techniques are proposed. Experimental results on five public benchmark face datasets such as CMU PIE, FERET, ORL, Yale and FRGC, along with Nemenyi Post-hoc statistical of significant test attest the promising performance of the proposed techniques.

Introduction

Face verification has been researched over the past few decades. However, how to design a reliable dimensionality reduction technique is remained an open problem. There are several issues from pattern recognition point of views. For instance, the facial data often resides on high dimension space despite limited training samples are available. Poor performance is expected if it is used directly due to curse of dimensionality [1]. Furthermore, a high dimensional data always contains redundant information and noises. Hence, a dimensionality reduction technique is required to break the curse while extracting useful features.

In general, there are two important concerns in designing a dimensionality reduction technique for face verification: (1) How to effectively exploit the limited available training samples? (2) How to seek the most discriminative facial feature representation? Conventionally, the most popular techniques include Principal Component Analysis (PCA) [2] and Linear Discriminant Analysis (LDA) [3]. PCA projects face samples onto linear directions with maximal variances. Unlike PCA which is an unsupervised method, LDA is a supervised method that utilizes available class specific information. LDA seeks a linear projection that is optimal in data discrimination. Both linear subspace techniques have demonstrated a fairly good performance under strictly controlled environment.

Graph Embedding Framework (GEF) was proposed as a method to unify several well-known dimensionality reduction techniques and it provides an insight to design new methods for dimensionality reduction [4]. The instances are Neighbourhood Preserving Embedding (NPE) [5], Locality Preserving Projection (LPP) [6], Marginal Fisher Analysis (MFA) [4] etc. In general, GEF seeks an embedded low-dimensional manifold based on data similarities on an affinity graph [4].

In this section, a brief account of GEF is given. In GEF, each data is represented as a vertex of a graph. Conforming manifold preserving criterion, graph embedding transforms the vertex to a low dimensional representation that best preserves the similarities between the vertex pairs [7]. The similarity is quantified by a similarity matrix of a locality graph that depicts certain geometrical properties of the data set.

Let $X = [x_{1}, x_{2}, \dots, x_{n}]$ with ${x_{i} \in R^{d} | i = 1, 2, \dots, n}$ be a set of $n$ numbers of $d$ -dimensional data, $G = {X, W}$ be a weighted graph with $X$ , $| X | = n$ and weight matrix $W \in R^{n \times n}$ . Each element in $W$ signifies the similarity of vertex pairs [4]. $W$ can be formulated based on different similarity criteria, such as prior class information in supervised learning algorithms [4], local neighborhood coefficients [5] and Gaussian similarity [6]. In brief, different definitions of graph $G$ correspond to different graph embedding algorithms.

For simplicity, one-dimensional case is considered and the low dimensional representations of vertices are represented as a vector $y = [y_{1}, y_{2}, \dots, y_{n}]$ . $y_{i}$ is the low dimensional representation of vertex $x_{i}$ . The target of the mapping is to make the vertices stay as close as possible to each other via a locality preserving criterion, given as $y^{⁎} = \underset{y B y^{T} = b}{argmin} \sum_{i \neq j} {‖ y_{i} - y_{j} ‖}^{2} W_{i j}$ where $b$ is a constant and $B$ is a constraint matrix for avoiding trivial solution.

With some simple algebraic manipulations, we obtain $\sum_{i, j} {‖ y_{i} - y_{j} ‖}^{2} W_{i j} = 2 y L y^{T}$ where $L$ is the Laplacian matrix and $D$ is a diagonal matrix defined as $L = D - W, D_{i i} = \sum_{j} W_{i j}, \forall i \neq j$

Hence, the above minimization problem can be reformulated to $y^{⁎} = \underset{y B y^{T} = b}{argmin} y L y^{T} = argmin [\frac{y L y^{T}}{y B y^{T}}]$ In general, $B$ is a diagonal matrix for scale normalization. $b$ is set to 1 to eliminate an arbitrary scaling factor. Since $L = D - W$ , the above optimization problem is equal to $y^{⁎} = \underset{y B y^{T} = 1}{argmax} y W y^{T} = argmax [\frac{y W y^{T}}{y B y^{T}}]$

Graph embedding excessively concerns about truthful data representation. It is no doubt that a reliable data representation is important. However, discriminative features play more crucial role in pattern recognition. The discriminating capability of these embedding can be explicitly boosted through discriminant criteria such as Fisher criterion [3] and Maximum Margin criterion [8]. This enhanced treatment is known as discriminant graph embedding.

As mentioned previously, higher locality preserving leads to better class discrimination. Hence, various regularization methods [9], [10], [11], [12], [13] have been proposed to address this requirement.

Summaries of some well-known graph embedding techniques, including criterion function, characteristics and limitations, are tabulated in Table 1, Table 2, Table 3 which correspond to unsupervised, supervised and regularized techniques respectively.

Unsupervised techniques learn a good representation from unlabeled training samples. Since there is no proper guidance, the performance of unsupervised techniques is not as pleasing as that in supervised and regularized counterparts. On the other hand, supervised techniques leverage foreknowledge of class label for leaning. Hence, these techniques always appear to be superior as far as recognition task is concerned. As another alternative, regularized techniques are meant to relieve the ill-posed problems due to numerical instability or improper model fitting and parameter estimation [9], [12], [13], [15], [16], [17].

Graph Embedding Framework (GEF) attempts to maximally preserve data locality after embedding, so that the embedded samples are remained in proximity. GEF could achieve this ultimate goal provided that population information is available, which is unrealistic. Practically, data locality is estimated based on finite yet noisy training samples. The estimation could be severely biased if the training samples poorly reflect the population information. As a result, this could negatively affect a projection function and leads to recognition performance deterioration.

Jiang [16], [17] performed a thorough analysis on how to reliably restore the population statistics based on regularization technique. Besides that, the author has also proposed a few solutions, such as by regulating eigenvalues of the covariance matrix with a piece-wise weighting function, a probabilistic subspace learning approach and eigenfeatures regularization and fitting with a model. Works in [10], [11], [12], [13], [15] are the instances of these solutions.

Inspired from [17], a graph embedding regularization technique is proposed in this paper. The regularization technique, dubbed as Locality Regularization Embedding (LRE), adopts a local Laplacian matrix to restore data locality. Even though both LRE and RLPDE [15] are using local Laplacian matrix for data locality regularization, LRE is a more general method that considers various local Laplacian matrices, whereas RLPDE is one specific instantiation of the LRE that applied a specific local Laplacian matrix.

The main contributions of this work include (1) efficient feature extraction methods that employ regularization for better locality preserving, leading to better discrimination, and (2) a theoretical analysis on the effectiveness of LRE on data discrimination.

The robustness of the proposed techniques is examined thoroughly with five public available face databases: CMU Pose, Illumination, and Expression (CMU PIE) [18], Facial Recognition Technology (FERET) [19], ORL Database of Faces (ORL) [20], Yale Face Database B (YaleB) [21] and Face Recognition Grand Challenge (FRGC) [22]. Armed with Nemenyi post-hoc statistical of significant test, the effectiveness of the proposed techniques in face verification is attested.

Section snippets

The locality preserving in graph embedding

In graph embedding, data are distributed on an underlying manifold, $M$ . Suppose that there is a map $h : M \to R$ . The gradient of $h$ from the manifold space $M$ to $R$ is denoted as $\nabla h (x)$ . For small $δ (x)$ [7] $| h (x + δ (x)) - h (x) | \approx | 〈 \nabla h (x), δ (x) 〉 | \leq ‖ \nabla h (x) ‖ ‖ δ (x) ‖$ From Eq. (6), it is noticed that data points near x is mapped to data points near $h (x)$ if $‖ \nabla h (x) ‖$ is small. Belkin and Niyogi [23] defined a function as a measure metric of locality preserving on average of $h$ , $\frac{\int_{M} {‖ \nabla h (x) ‖}^{2} d x}{\int_{M} {| h (x) |}^{2} d x}$ The above equation can be

Databases

The experiments are conducted by using five publicly available face datasets, namely CMU Pose, Illumination, and Expression (CMU PIE), Facial Recognition Technology (FERET), ORL Database of Faces (ORL), Yale Face Database B (YaleB) and Face Recognition Grand Challenge (FRGC).

CMU PIE database was acquired by using 13 synchronized cameras and an array of flashes as light sources. These camera flashes were placed in specific positions relative to the subject. They were triggered with a very short

Conclusion

Graph embedding techniques attempt to produce a high data locality projection for better recognition performance. However, under a scenario of limited training samples, the estimation of population/true data locality could be severely biased. The biased estimation triggers overfitting problem resulting poor generalization. A locality regularization of graph embedding is studied. Manipulation of a local Laplacian matrix is performed to approach true data locality for better data manifold

Conflict of interest

None declared.

Acknowledgment

This research was supported by UM-MMU Collaboration and Fundamental Research Grant Scheme – FRGS (#MMUE/140020) and Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and Future Planning (2013006574).

Pang Ying Han received her B.E. degree in Electronic Engineering in 2002, M.E. degree in 2005 and Ph.D. degree in 2013 from the Multimedia University, Malaysia. Her research interests include face recognition, manifold learning, dimensionality reduction, image processing and pattern recognition.

References (31)

Ying Han Pang et al.
Regularized locality preserving discriminant embedding for face recognition
Neurocomputing
(2012)
Pang Ying Han et al.
Neighbourhood preserving discriminant embedding in face recognition
J. Vis. Commun. Image Represent.
(2009)
Y. Kim et al.
An online AUC formulation for binary classification
Pattern Recognit.
(2012)
K. Fukunnaga
Introduction to Statistical Pattern Recognition
(1990)
M. Turk et al.
Eigenfaces for recognition
J. Cogn. Neurosci.
(1991)
P.N. Belhumeur et al.
Eigenfaces vs. Fisherfaces: recognition using class specific linear
IEEE Trans. Pattern Anal. Mach. Intell.
(1997)
S.C. Yan et al.
Graph embedding and extensions: a general framework for dimensionality reduction
IEEE Trans. Pattern Anal. Mach. Intell.
(2007)
X. He, Deng Cai, S. Yan, H.J. Zhang, Neighborhood preserving embedding, in: Proceedings of the Tenth IEEE International...
X. He et al.
Face recognition using Laplacianfaces
IEEE Trans. Pattern Anal. Mach. Intell.
(2005)
D. Cai, X. He, Y. Hu, J. Han, H. Thomas, Learning a spatially smooth subspace for face recognition, in: Proceedings of...

H. Li et al.

Efficient and robust feature extraction by Maximum Margin Criterion

IEEE Trans. Neural Netw.

(2006)

L. Dora, N.P. Rath, Face recognition by regularized-LDA using PRM, in: Proceedings of the International Conference on...

Dao-Qing Dai et al.

Face recognition by regularized discriminant analysis

IEEE Trans. Syst. Man Cybern.

(2007)

X. Jiang et al.

Eigenfeature regularization and extraction in face recogntion

IEEE Trans. Pattern Anal. Mach. Intell.

(2008)

J. Lu et al.

Regularized locality preserving projections and its extensions for face recognition

IEEE Trans. Syst. Man Cybern.

(2010)

Cited by (6)

Multilinear clustering via tensor Fukunaga–Koontz transform with Fisher eigenspectrum regularization
2021, Applied Soft Computing
Citation Excerpt :
Techniques for eigenspectrum regularization exist in the literature. For instance, eigenspectrum regularization initially proposed in [36,37] has been successfully applied to the supervised pattern-set classification tasks [38,39]. However, the application of regularization on clustering or tensor representation tasks has not been proposed yet.
Clustering is a fundamental learning task with many applications in a wide range of fields. Recently proposed techniques have shown that performing clustering in a discriminative space provides reliable results. Motivated by these results, as well as by advances in subspace representation, we introduce in this paper a new learning model that performs discriminative clustering on tensor data. The proposed method exploits the inherent tensor mode representation provided by multilinear data, extracting discriminative spaces in each mode, which are further combined in a product space. In previous work, the Fukunaga–Koontz transform was extended to handle multilinear data through the use of a tensor representation. That work yielded notable results in the clustering of gestures and actions from videos. However, the model may overfit because no regularization process is applied. Therefore, an efficient regularization scheme based on the Fisher score is proposed in this paper to optimize the clustering model. In addition to a new regularization scheme and discriminative properties, the advantages of our method include (1) sufficient flexibility to adapt to hierarchical and $k$ -means clustering algorithms with low computational cost inherited from subspace learning, (2) a new formulation of the mean between two tensors in terms of the product of spaces, and (3) a Fisher score definition for multilinear data. Comprehensive experimental results on diverse real-world datasets confirm that the proposed method provides results that are competitive with those from current tensor clustering algorithms.
Regularized constraint subspace based method for image set classification
2018, Pattern Recognition
Citation Excerpt :
As we can see, ERE and LRE in fact have the same algorithm structure (Fig. 1). By devising different eigenspectrum regularization models, they achieve different performances [2,3]. In this section, we first propose our method that is based on combining DS and OS, and then we generalize it to a general framework using the eigenspectrum regularization techniques.
Subspace methods are popular for image set classification due to the excellent representation ability of subspaces. Generalized difference subspace and orthogonal subspace are two currently effective projection strategies for extracting discriminative subspaces. However, both of these methods discard part of the common subspace to form the constraint subspace, which may cause a loss of discriminative information. In this work, we combine the difference subspace and orthogonal subspace to form a full rank constraint subspace. Moreover, we generalize this approach to a common framework using eigenspectrum regularization models (ERMs). The full rank constraint subspace that is regularized by different ERMs is called the regularized constraint subspace (RCS). Furthermore, we propose a new ERM using the concept of difference subspace, namely, the difference subspace regularization model (DSRM). The DSRM and two other current ERMs are incorporated in our RCS-based framework. The results from extensive experiments have demonstrated the effectiveness of our proposed approaches.
Spatial regularization in subspace learning for face recognition: Implicit vs. explicit
2016, Neurocomputing
Citation Excerpt :
Face recognition [1–10], as one of the most important issues in computer vision and pattern recognition, has been advanced and widely studied over the past few decades because of its wide applications in security, human-machine communication, etc.
In applying traditional statistical method to face recognition, each original face image is often vectorized as a vector. But such a vectorization not only leads to high-dimensionality, thus small sample size (SSS) problem, but also loses the original spatial relationship between image pixels. It has been proved that spatial regularization (SR) is an effective means to compensate the loss of such relationship and at the same time, and mitigate SSS problem by explicitly imposing spatial constraints. However, SR still suffers from two main problems: one is high computational cost due to high dimensionality and the other is the selection of the key regularization factors controlling the spatial regularization and thus learning performance. Accordingly, in this paper, we provide a new idea, coined as implicit spatial regularization (ISR), to avoid losing the spatial relationship between image pixels and deal with SSS problem simultaneously for face recognition. Different from explicit spatial regularization (ESR), which introduces directly spatial regularization term and is based on vector representation, the proposed ISR constrains spatial smoothness within each small image region by reshaping image and then executing 2D-based feature extraction methods. Specifically, we follow the same assumption as made in SSSL (a typical ESR method) that a small image region around an image pixel is smooth, and reshape each original image into a new matrix whose each column corresponds to a vectorized small image region, and then we extract features from the newly-formed matrix using any off-the-shelf 2D-based method which can take the relationship between pixels in the same row or column into account, such that the original spatial relationship within the neighboring region can be greatly retained. Since ISR does not impose constraint items, compared with ESR, ISR not only avoids the selection of the troublesome regularization parameter, but also greatly reduces computational cost. Experimental results on four face databases show that the proposed ISR can achieve competitive performance as SSSL but with lower computational cost.
Design of face recognition system based on fuzzy transform and radial basis function neural networks
2019, Soft Computing
Eigenspectrum Regularization on Grassmann Discriminant Analysis with Image Set Classification
2019, IEEE Access
A regularized margin fisher analysis method for face recognition
2017, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Andrew Beng Jin Teoh obtained his BEng (Electronic) in 1999 and Ph.D degree in 2003 from National University of Malaysia. He is currently an associate professor in Electrical and Electronic Engineering Department, College Engineering of Yonsei University, South Korea. His research interests are Pattern Recognition, Machine Learning and Information Security. He has published more than 220 international refereed journals, conference articles, and several book chapters. He has been a reviewer for more than 30 journals and conferences. He has served international conference committees worldwide.

Hiew Fu San received his B.E. degree in Computer Engineering in 2002 and M.E. degree in 2008 from the Multimedia University, Malaysia. His research interests include pattern recognition and remote sensing.

View full text

Locality Regularization Embedding for face verification

Highlights

Abstract

Introduction

Section snippets

The locality preserving in graph embedding

Databases

Conclusion

Conflict of interest

Acknowledgment

Neurocomputing

J. Vis. Commun. Image Represent.

Pattern Recognit.

Introduction to Statistical Pattern Recognition

Eigenfaces for recognition

J. Cogn. Neurosci.

Eigenfaces vs. Fisherfaces: recognition using class specific linear

IEEE Trans. Pattern Anal. Mach. Intell.

Graph embedding and extensions: a general framework for dimensionality reduction

IEEE Trans. Pattern Anal. Mach. Intell.

Face recognition using Laplacianfaces

IEEE Trans. Pattern Anal. Mach. Intell.

Efficient and robust feature extraction by Maximum Margin Criterion

IEEE Trans. Neural Netw.

Face recognition by regularized discriminant analysis

IEEE Trans. Syst. Man Cybern.

Eigenfeature regularization and extraction in face recogntion

IEEE Trans. Pattern Anal. Mach. Intell.

Regularized locality preserving projections and its extensions for face recognition

IEEE Trans. Syst. Man Cybern.