A novel local extrema based gravitational search algorithm and its application in face recognition using one training image per class

https://doi.org/10.1016/j.engappai.2014.05.002Get rights and content

Abstract

In this present paper a new methodology has been presented involving a stochastic optimization based approach to solve the face recognition problem with only one training image per class. Singular value decomposition (SVD) is used to decompose the single training image into two component images in order to compute the within class scatter matrix. The stochastic optimization approach is implemented employing gravitational search algorithm (GSA) which searches for an optimal transform matrix instead of using the traditional solution of general eigenvalue problem as is carried out in Fisher linear discriminant analysis (FLDA). The present paper also proposes two novel variants of GSA, namely the 2-D version of GSA, in order to cater for the 2-D image data, and the other one is a 2-D randomized local extrema based GSA (RLEGSA), which employs a stochastic local neighborhood based search instead of global search, as in basic GSA. Finally, a novel concept of performing an automated selection of projection vectors is incorporated in the 2-D RLEGSA to propose an improved variant, called the Modified RLEGSA (MRLEGSA). Experimental results, based on benchmark Yale A and ORL databases, show that the proposed methods outperform several existing schemes.

Introduction

In recent years face recognition has become a widely researched topic since it has numerous real world applications like authentication, identification, advanced human computer interaction and many other emerging fields of research. Face recognition spans the subjects of pattern recognition, image processing, computer vision, machine learning, etc. With the growing importance of biometric recognition systems (Jain and Prabhakar, 2004), due to low susceptibility to security loss, face recognition based biometrics has gained much popularity in recent times.

Many approaches to face recognition have been proposed over the last two decades (Zhao et al., 2003, Jafri and Arabnia, 2009, Chakrabarty et al., 2013) most of which are based on supervised learning. Hence they follow a common sequence of steps. There is a feature extraction step in which a set of discriminating features are extracted from a set of training images (Brunelli and Poggio, 1993). Then if the set of features extracted is dimensionally large, there may be a feature selection/reduction procedure where a reduced set of highly discriminating features are selected employing a suitable algorithm which may attempt to optimize a suitable cost function. For the face recognition problem, since images of different persons are, after all, human faces they have some common characteristics which indicates that some features in a large set of extracted features will not have enough discriminating power. This makes feature selection/reduction an important step as a large feature set might not necessarily result in a higher recognition rate (Tu et al., 2007). This step is followed by the classification step where the final conclusion regarding recognition or authentication is actually performed. Several variations of methods proposed in each of these steps generates new approaches in solving the problem.

A general drawback of the supervised learning method is that for a good classification accuracy rate, the number of training samples needs to be sufficiently large (depending on the number of test images and the number of classes). In those particular methods where inter class and intra class distances are used, the methods do not work at all when there is a single training image of each subject because in this case, the intra class distances are not defined (Gao et al., 2008). This drawback is prominent in several approaches which include the popular methodology of Fisher linear discriminant analysis (FLDA). A few methods have been proposed in recent years to solve this problem of FLDA based face recognition where there is only one training image per person e.g. generalized inverse method (Tian et al., 1988), perturbation based method (Hong and Yang 1991), direct FLDA method (Yu and Yang 2001), null space method (Lu et al., 2003), 2-D FLDA method (Ye et al., 2004), singular value decomposition (SVD) based method (Gao et al., 2008) etc.

Our present research concentrates on those more challenging face recognition problems which suffer from small sample size (SSS) problem, typically in those situations where there is only a single training sample available per class/person. The research on face recognition problem using single training sample per person is well known as a very challenging problem and it has gained prominence in recent times (Gao et al., 2008, Tan et al., 2006, Zhu et al., 2012). This situation arises in many real-world scenarios such as utilization of smart cards, airport check-in and check-out, special situations of law enforcement, critical surveillance scenarios and checking for access control etc. (Tan et al., 2006, Zhu et al., 2012). Our present work is inspired by the method proposed by Gao et al. (2008) in which the single training image of a particular class is decomposed into two component images using SVD and then the intra class distance can be conveniently determined using these two resulting images. However, the SVD based FLDA approach in Gao et al. (2008) uses the general eigen value theory to solve the cost function. In this paper we propose a novel method of solving the FLDA cost function using an intelligent iterative stochastic optimization algorithm which can simultaneously solve the feature selection/reduction phase along with the feature extraction phase, thus effectively merging the operations required in two steps. The iterative stochastic optimization problem is solved using a recently proposed method, called gravitational search algorithm (GSA). GSA is a powerful iterative optimization algorithm based on Newton׳s laws of gravity and motion (Rashedi et al., 2009, Pal et al., 2013). Several interesting applications have recently been proposed using GSA in the domains of e.g. image processing (Sun and Zhang, 2013) and data clustering (Hatamlou et al., 2012, Hatamlou et al., 2011). Three modifications of the GSA have been proposed in this paper with the objective of solving our problem. The first variation proposes a 2-D GSA to adopt the GSA in processing of 2-D images. The second variation introduces a novel random local extrema based GSA (RLEGSA). To the best of our knowledge and belief, although some local best methods have been proposed earlier for a similar swarm intelligence based method called particle swarm optimization (PSO) (Suganthan, 1999, Das Sharma et al., 2012), this is the first such variation developed in the genre of GSA. The third variation incorporates the automated selection of projection vectors within the GSA based cost function optimization framework.

The rest of this paper is organized as follows. Section 2 provides a description of the SVD and FLDA based feature extraction schemes for the single training image per person scenario. Section 3 describes an overview of the traditional gravitational search algorithm and detailed descriptions of the novel variants proposed in this work. Section 4 presents the experiments and simulation results. Section 5 concludes the paper.

Section snippets

SVD and FLDA based feature extraction scheme

Let us consider there are C classes with each having a single image Ikm×n (k=1,,C). If mn, then let Ukm×m and Vkn×n be the eigenvector matrices of Ik IkT and IkTIk respectively. Let uik and vik be the ith column of Uk and Vk respectively. Let σik be the ith singular value of Ik such that σik is in descending order of magnitude as i increases i.e. σ1kσ2kσi1kσikσi+1kσnk. Then the image can be described as (Gao et al., 2008, Golub and Loan, 1983)Ik=i=1nσikuik(vik)T

Hence each image

Gravitational search algorithm (GSA)

GSA is a relatively recently proposed optimization algorithm based on the Newtonian law of gravity (Rashedi et al., 2009). Let us consider an isolated system of p objects called agents or particles. This can be considered a universe consisting of only these p agents which obey (1) Newton׳s Law of Motion and (2) Newton׳s Law of Gravity. Let the position of the ith agent (i=1, 2, …, p) in n dimensional space at time t be given byXi(t)={xi1(t),xi2(t),,xin(t)}

Also we define three types of masses

Experimental results

Extensive experimentations have been carried out to determine the effectiveness of the proposed algorithms. At first the 2-D version of the traditional GSA proposed in Section 3.2 and the local extrema based variation (RLEGSA) proposed in Section 3.3 have been separately evaluated on two well known benchmark face databases, namely Yale A (Yale University) and ORL (ORL database) databases and the recognition accuracies have been compared with four existing comparable methods, as shown in Table 1

Conclusions

In this work, a new approach to solve SVD based face recognition problems involving single training image per person is proposed using stochastic optimization approaches. The problem is solved using GSA, a contemporary algorithm recently proposed under the category of heuristic optimization methods, which attempts to determine an optimal transform matrix W such that a cost functional J(W) is minimized. In this context two new variants of GSA, called the 2-D version of GSA, and a 2-D randomized

Acknowledgments

This work was supported by University Grants Commission (UGC), India, under University with Potential for Excellence (UPE) – Phase II Scheme awarded to Jadavpur University, Kolkata, India.

References (29)

  • R. Brunelli et al.

    Face recognition: features versus templates

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1993)
  • S. Chen et al.

    Making FLDA applicable to face recognition with one sample per person

    Pattern Recognit.

    (2004)
  • Code for Gao et al., 2008....
  • K. Das Sharma et al.

    A random spatial lbest PSO-based hybrid strategy for designing adaptive fuzzy controllers for a class of nonlinear systems

    IEEE Trans. Instrum. Meas.

    (2012)
  • Cited by (25)

    • Improving exploration and exploitation via a Hyperbolic Gravitational Search Algorithm

      2020, Knowledge-Based Systems
      Citation Excerpt :

      In order to find automatically the number of clusters in a given dataset, GSA has been used [13]. GSA variants for face recognition have been introduced in [14]. The main variants of this approach are the automated selection of projection vectors within GSA and a stochastic local neighborhood based search.

    • A comprehensive survey on gravitational search algorithm

      2018, Swarm and Evolutionary Computation
      Citation Excerpt :

      Heuristic algorithms are useful tools to refine image processing procedures by selecting salient features, and tuning the algorithm parameters. Some applications of PR are face detection [152–154], medical images [154], image segmentation [155], noise filtering [156], image retrieval [67,71], and adaptive image enhancement [157]. Chakraborti et al. [152] selected features for face recognition problems using modified version of GSA in which the employed features are local binary pattern, modified census transform, and local gradient pattern.

    View all citing articles on Scopus
    View full text