Recognizing occluded faces by exploiting psychophysically inspired similarity maps

https://doi.org/10.1016/j.patrec.2012.05.003Get rights and content

Abstract

The presence of occlusions in facial images is inevitable in unconstrained scenarios. However recognizing occluded faces remains a partially solved problem in computer vision. In this contribution we propose a novel Bayesian technique inspired by psychophysical mechanisms relevant to face recognition to address the facial occlusion problem. For some individuals certain facial regions, e.g. features comprising of some of the upper face, might be more discriminative than the rest of the features in the face. For others, it might be the features over the mid face and some of the lower face that are important. The proposed approach in this paper, will allow for such a psychophysical analysis to be factored into the recognition process. We have discovered and modeled similarity mappings that exist in facial domains by means of Bayesian Networks. The model can efficiently learn and exploit these mappings from the facial domain and hence capable of tackling uncertainties caused by occlusions. The proposed technique shows improved recognition rates over state of the art techniques.

Highlights

► Psychophysical principles. ► Intuitive occlusion model. ► Visualization of abstract phenomena via Bayesian Networks. ► Improved results over state of the art.

Introduction

Enhancing computer vision systems with psychophysical mechanisms enable them to take intelligent decisions on a par with the human mind, especially when uncertainty elements such as occlusions are encountered. A key objective of machine vision researchers is to build automated face recognition systems that can compete and eventually surpass, human performance. Many successful face recognition and verification models (Raducanu and Vitria, 2008, Schwaninger et al., 2004, Smeraldi and Bigun, 2002, Turk and Pentland, 1991) have derived their fundamental ideas from principles of cognitive psychology and neuroscience. Substantial research is ongoing to make face recognition systems robust to typical operational environments where uncertainties such as occlusion, illumination and other variations are common.

In this contribution we propose a novel “Psychophysically Inspired SImilarity MApping (PISIMA)” model to attack this potential problem. We differ from other approaches in the sense that, we intend to identify some key cognitive psychology principles from the literature which are vital to face recognition and transform them to a computational model with the aid of an intuitive Bayesian framework. This contribution is an enhanced version of the preliminary work carried out by Ibrahim et al. (2011). Basically PISIMA intends to map similarities between two main object entities. The first entity is a set of sparse components of the probe face image. The second entity comprises of a bulk set of known face images that are stored in the face database which are known as gallery samples. Few faces are recalled for each of the sparse components as a consequence of this similarity mapping based reasoning. Further this reasoning reveals inherent conditional independence properties between the subsamples of a face and the recalled face images which are influenced by the subsamples. Such conditional independent notions could be scientifically represented by Bayesian Networks (BNs) as mentioned by Nilsson (1998). Formally a BN is a Directed Acyclic Graph (DAG) where the nodes represent variables and the arcs encode conditional independencies between the variables. BNs serve as fundamental tools in tackling uncertainty problems as they characterize intuitive notions of human reasoning. In other words, PISIMA employs BNs to establish, learn and exploit intrinsic similarity mappings that are inherent in the face domain.

We will briefly present the framework of the proposed PISIMA with the aid of the flow-chart shown in Fig. 1. Firstly, face images in the database have been enhanced with standard preprocessing techniques. We have adapted the techniques proposed by Bartlett and Movellan (2002) to normalize the face images. This image enhancement enables the face images to be independent of variations such as scaling, translation, rotation and so on. Then a feature space (low dimensional face space) is constructed from the gallery (training set) of face images available in the face database using Principal Component Analysis (PCA). PISIMA further learns conditional probability potentials which provide information about how well a face can be influenced given that a particular face component or a combination of face components has been observed. This learning process is done offline from the gallery face images, that is when computing resources are free.

A given probe face is enhanced using similar preprocessing techniques which were applied to the gallery samples and subject to horizontal segmentation. Then the PCA features of facial entities, acquired by combining probe face components over the gallery face images, are extracted and projected over the feature space using an inheritance mechanism which will be described in Section 3.2. Further probable subjects are shortlisted by means of similarity mapping based processing. For a given probe, a BN is generated whose child node variables represent the belief states of short-listed subjects and the parent nodes represent the belief states of corresponding components which influenced them. Finally faces are recognized using a face score formula which will be derived in Section 3.4.

Section snippets

Synthesis of relevant ideas apart from computer vision

Apart from the pixel domain we have investigated some key principles relevant to face recognition from other related fields. Vision is a subfield of cognitive science which involves psychological inferences in the higher nervous system (Frey and Jojic, 2005, Lin et al., 2009). Psychologists are of the view that psychologically feasible computational models exhibit clear and strong relationships between behavior and properties of the domains which they intend to represent (Fific, 2005). A very

Building the Bayesian Network from facial components

In this section we will represent the facial domain in terms of simple components and gradually build the Bayesian Network from those components.

Experimental results and discussions

We have implemented the PISIMA model using MATLAB on Intel Pentium IV Core 2 Duo 2.39 Ghz CPU with 2 GB of RAM. We have made use of the routines offered by Delac et al., 2006, Murphy, 2007 to build the PCA subspace and BN respectively. We have evaluated the performance of PISIMA model using a series of experiments on standard Face Datasets (FDs). We have used the AR FD (Martinez and Benavente, 1998) which consists of over 3200 color images of 126 subjects. Images feature frontal view faces with

Conclusion

We have discovered that faces exhibit interesting similarity mappings via Bayesian Networks. The proposed PISIMA model encapsulates key psychophysical principles fundamental to reasoning objects under uncertainty, by means of statistical machine learning techniques. The proposed framework intuitively exploit these intrinsic mappings to recognize faces when they are prone to major occlusions coupled with other variations. Further the graphical nature of PISIMA aids to visualize the abstract

Acknowledgments

The authors thank those who have contributed in developing the AR dataset and for making them publicly available. We thank Dr. Murphy and his team, University of British Columbia, for providing the Bayes Net Toolbox. This research is supported by the James Watt scholarship awarded by Heriot-Watt University, UK. We also thank Kolej University TATI, Malaysia and Universiti Sains Malayisa for their support. We thank the editors and anonymous reviewers for providing useful suggestions.

References (55)

  • K. Delac et al.

    Independent comparative study of pca, ica, and lda on the feret data set

    Internat. J. Imaging Systems Technol.

    (2006)
  • R.O. Duda et al.

    Pattern Classification

    (2001)
  • H.D. Ellis et al.

    Identification of familiar and unfamiliar faces from internal and external features: Some implications for theories of face recognition

    Perception

    (1979)
  • Fific, A., 2005. Emerging holistic properties at face value: Assessing characteristics of face perception, Ph.D....
  • I.H. Fraser et al.

    Reaction time measures of feature saliency in schematic faces

    Perception

    (1990)
  • B.J. Frey et al.

    A comparison of algorithms for inference and learning in probabilistic graphical models

    IEEE Trans. Pattern Anal. Machine Intell.

    (2005)
  • V. Goffaux et al.

    Face inversion disproportionately impairs the perception of vertical but not horizontal relations between features

    J. Exp. Psychol. Human Percept. Perform.

    (2007)
  • R.C. Gonzalez et al.

    Digital Image Processing

    (2002)
  • P.W. Hallinan et al.

    Two and Three Dimensional Patterns of the Face

    (1999)
  • B. Heisele et al.

    A component-based framework for face detection and identification

    Internat. J. Comput. Vision

    (2007)
  • V. Ibrahim et al.

    Psychophysically inspired Bayesian occlusion model to recognize occluded faces

  • F.V. Jensen et al.

    Bayesian Networks and Decision Graphs

    (2007)
  • J. Kim et al.

    Effective representation using ica for face recognition robust to local distortion and partial occlusion

    IEEE Trans. Pattern Anal. Machine Intell.

    (2005)
  • H. Laia et al.

    Reliable face recognition using adaptive and robust correlation filters

    Computer Vision and Image Understanding

    (2008)
  • J. Lin et al.

    Robust face recognition using posterior union model based neural networks

    IET Comput. Vision

    (2009)
  • Martinez, A.M., Benavente, R., 1998. The AR Face Database. CVC Technical Report No. 24, Technical...
  • A.M. Martinez

    Recognizing imprecisely localized, partially occluded and expression variant faces from a single sample per class

    IEEE Trans. Pattern Anal. Machine Intell.

    (2002)
  • Cited by (16)

    • Fast and robust occluded face detection in ATM surveillance

      2018, Pattern Recognition Letters
      Citation Excerpt :

      A more comprehensive survey of existing works is presented in Section 2. Among various existing approaches [1,5,11,30,37–40], detecting users’ facial images are widely used in practice, where the acquired facial images are crucial for verifying occlusion. Some recent methods are proposed related to face recognition and classification [8–10,26].

    • Optimized symmetric partial facegraphs for face recognition in adverse conditions

      2018, Information Sciences
      Citation Excerpt :

      The AR Face database has been used to evaluate the OSPF performance under different operational conditions. The performance of the proposed OSPF has been compared with ten standard approaches including Line Edge Map (LEM) [12], Ensemble String Matching (ESM) [8], Adaptively Weighted Patch Pseudo Zernike Moment Array (AWPPZMA) [20], partitioned Sparse Representation-based Classification (p-SRC), [43],Adaptively Weighted Sub-Gabor Array(AWSGA) [19] , Psychophysically Inspired Similarity MApping (PISMA) [40] and Harmony Search Oriented EBGM (HSO-EBGM) [24], Adaptive Noise Dictionary_Sparse Regression based Classifier (AND_SRC) [9], Local Structure based Multi-Phase Collaborative Representation Classification (LS_MPCRC) [28] and SIFT based Kepenekci approach (SIFT) [27]. In this section, the proposed OSPF approach has been assessed under relatively controlled conditions using both the AR and the FRGC ver2.0 databases.

    • Recognizing faces prone to occlusions and common variations using optimal face subgraphs

      2016, Applied Mathematics and Computation
      Citation Excerpt :

      Face recognitions systems have an essential role in biometric-oriented video surveillance systems which have been progressively incorporated in operational environments where the problem of encountering occlusions cannot be avoided [38]. Several approaches have been proposed to overcome the shortcomings of face recognition challenges: expressions, illumination conditions and facial occlusions [16,28,30,38,41,42,44]. In general, occluded face recognition approaches are classified into two categories: holistic and component-based.

    • Partial Face Identification using Local Feature Extraction Algorithm on Different Classifiers

      2023, 2023 International Conference on Computer, Electronics and Electrical Engineering and their Applications, IC2E3 2023
    • Recognition of partially occluded faces using regularized ICA

      2021, Inverse Problems in Science and Engineering
    View all citing articles on Scopus
    View full text