Elsevier

Information Fusion

Volume 48, August 2019, Pages 84-94
Information Fusion

Learning deep compact similarity metric for kinship verification from face images

https://doi.org/10.1016/j.inffus.2018.07.011Get rights and content

Highlights

  • A new DNN is proposed to facilitate fusion of deep embeddings for parent-child data.

  • A deep metric learning algorithm is derived to learn a compact kin similarity metric.

  • Evaluations show the efficacy of our kinship metric with high verification accuracy.

Abstract

Recent advances in kinship verification have shown that learning an appropriate kinship similarity metric on human faces plays a critical role in this problem. However, most of existing distance metric learning (DML) based solutions rely on linearity assumption of the kinship metric model, and the domain knowledge of large cross-generation discrepancy (e.g., large age span and gender difference between parent and child images) has not been considered in metric learning, leading to degraded performance for genetic similarity measure on human faces. To address these limitations, we propose in this work a new kinship metric learning (KML) method with a coupled deep neural network (DNN) model. KML explicitly models the cross-generation discrepancy inherent on parent-child pairs, and learns a coupled deep similarity metric such that the image pairs with kinship relation are pulled close, while those without kinship relation (but with high appearance similarity) are pushed as far away as possible. Moreover, by imposing the intra-connection diversity and inter-connection consistency over the coupled DNN, we introduce the property of hierarchical compactness into the coupled network to facilitate deep metric learning with limited amount of kinship training data. Empirically, we evaluate our algorithm on several kinship benchmarks against the state-of-the-art DML alternatives, and the results demonstrate the superiority of our method.

Introduction

Recent evidence in psychology has indicated that face appearance is a reliable and critical cue for measure of the genetic similarity between the parent and their children [1], [2], [3]. Motivated by this, researchers from biometrics and computer vision societies have developed some computational models for kinship verification via face images [4], [5], [6], [7]. The objective of this verification problem is to determine whether there exists a kin relationship between a given pair of face images. Potential applications based on such verification technique ranges from social media mining to children adoptions and missing children searching.

While encouraging results have been demonstrated over the past a few years, kinship verification using face images still remains open. On one hand, face images are often captured in wild conditions, and varying illumination, poses and expressions in such scenarios make the verification problem quite challenging. On the other hand, kinship verification aims to investigate the kin relationship between two different visual entities (e.g., father and daughter), and thus the inherent appearance gap of intra-class in kinship verification is generally far larger than that in traditional face recognition [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22].

Recent advances in kinship verification have indicated that learning an appropriate similarity metric on human faces plays a critical role in kinship verification. Distance metric learning (DML) methods [23], [24], [25], [26] have been investigated in kinship verification [7], [27], [28] for the purpose of achieving an optimal distance metric rather than a pre-specified one for more robust kin-faces matching. Despite the success of DML-based approaches, existing solutions to kinship verification still suffer from two critical limitations:

(1) They are often proposed to learn a linear distance metric for input space, which is less powerful to capture the nonlinear manifold where the genetic traits inherent on human face lie. Moreover, in existing DML-based solutions the parent-child face images share a common linear transformation for visual matching, and hence the domain prior of large distribution gap between the parent and child has not been taken into account, leading to inaccurate measure of inherent kin similarity on human faces.

(2) While learning a nonlinear distance metric based on the deep neural networks (DNNs) [20], [29] is a straightforward solution to this problem, supervised metric learning with DNN typically requires a large number of labeled training samples, which is extremely expensive to collected in practical kinship verification due to the privacy concerns and involved time and human costs.

To address these issues, we propose in this paper a new kinship metric learning (KML) method for kinship verification from face images with a well-designed DNN architecture. The main contributions of this work are summarized as follows:

(1) We design a coupled DNN, named KinNet, for kinship verification from face images. KinNet explicitly models the cross-generation discrepancy inherent on parent-child pairs, and facilitates deep metric learning with limited amount of labeled kinship data. Particularly, by imposing the diversity regularization and cross-generation consistency regularization on the coupled connections, we introduce the property of hierarchical compactness into the coupled network to improve generalization performance of the kinship metric model.

(2) We develop a new deep metric learning algorithm with the proposed KinNet architecture to learn a deep compact cross-generation similarity metric. The learned similarity metric possesses some desirable properties that help address the limitations of most existing DML-based solutions to kinship verification.

From the information fusion point of view, the parent-child faces input to KML can be regarded as the two-view kin data for kinship verification, and hence KML can be considered as a multi-view metric learning in the deep learning framework. Essentially, our KML implicitly learns to fuse a pair of deep embeddings for robust similarity measure of the parent-child pairs.

On the other hand, by latent variable modeling, an ensemble of latent factors in weight matrices of the KinNet are enforced to be as diverse from one another as possible, such that the learned deep embeddings are compact enough to reduce information redundancy in metric learning. From the ensemble learning point of view, our KML implicitly learns to fuse a set of diverse latent factors in deep metric learning.

(3) We empirically evaluate our method on several benchmark datasets, and the results show that our proposed KML significantly boosts the current state-of-the-art level of kinship verification.

The remainder of this paper is organized as follows. We first briefly review the related work in Section 2 , and Section 3 details the kinship metric learning method with the proposed KinNet architecture. Experimental settings, results and discussions are presented in Section 4 , and Section 5 concludes the paper.

Section snippets

Related work

In this section, some related topics are briefly reviewed: (1) kinship verification, and (2) deep metric learning.

Roughly speaking, existing methods for kinship verification are either feature-based [4], [6], [30], [31], [32], [33], [34] or distance metric-based [5], [7], [27], [28], [35], [36], [37], [38]. Feature-based methods extract discriminative feature from face images by hand-crafted image descriptors [4], [6], [30] or feature learning [32], [33], [34] to represent genetic traits on

Our approach

In this section, we first introduce the proposed KinNet architecture, and then elaborate our KML method with KinNet for kinship verification. Finally, we present the optimization algorithm to solve the KML problem.

The motivation figure of our proposed KML method is shown in Fig. 1. Suppose there is a quadruplet (xp,xc,x^p,x^c) in the original metric space, where (xp, xc) are a pair of parent-child faces with kin relationship, and x^c and x^p are their nearest samples in the child and parent

Experiments

To evaluate the effectiveness of our proposed kinship verification method, we conduct experiments on four widely used datasets: KinFaceW-I1, KinFaceW-II2, Cornell KinFace3, and UB KinFace4. Fig. 4 presents some sample kin pairs from the KinFaceW-II dataset. We elaborate the datasets, experimental settings, results and analysis in

Conclusion

We have presented in this paper a kinship metric learning method to address kinship verification using facial images. We have shown that, despite the differences in image statistics and tasks between the datasets for face recognition and kinship verification, the transferred deep face representation leads to significantly improved accuracy in kinship verification. Also, by learning a coupled and deep compact similarity metric with the KinNet architecture tailored for kinship verification

Acknowledgment

This work is partially supported by the National Natural Science Foundation of China under grants 61373090 and 61601310.

References (66)

  • R.G. Cinbis et al.

    Unsupervised metric learning for face identification in TV video

    2011 International Conference on Computer Vision

    (2011)
  • M. Guillaumin et al.

    Is that you? Metric learning approaches for face identification

    2009 IEEE 12th International Conference on Computer Vision

    (2009)
  • M. Köstinger et al.

    Large scale metric learning from equivalence constraints

    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

    (2012)
  • Y. Taigman et al.

    Deepface: closing the gap to human-level performance in face verification

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2014)
  • A. Mignon et al.

    Pcca: a new approach for distance learning from sparse pairwise constraints

    Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

    (2012)
  • H.V. Nguyen et al.

    Cosine similarity metric learning for face verification

    Asian Conference on Computer Vision

    (2010)
  • Z. Cui et al.

    Fusing robust face region descriptors via multiple metric learning for face recognition in the wild

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2013)
  • O.M. Parkhi et al.

    Deep face recognition

    British Machine Vision Conference

    (2015)
  • W. Deng et al.

    Transform-invariant PCA: a unified approach to fully automatic face alignment, representation, and recognition

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2014)
  • X. Cai et al.

    Deep nonlinear metric learning with independent subspace analysis for face verification

    Proceedings of the 20th ACM international conference on Multimedia

    (2012)
  • J. Lu et al.

    Discriminative multimanifold analysis for face recognition from a single training sample per person

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2013)
  • J. Lu et al.

    Learning compact binary face descriptor for face recognition

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2015)
  • J. Hu et al.

    Discriminative deep metric learning for face verification in the wild

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2014)
  • J. Hu et al.

    Deep transfer metric learning

    Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    (2015)
  • X. Zhou et al.

    Multiple face tracking and recognition with identity-specific localized metric learning

    Pattern Recognit.

    (2018)
  • E.P. Xing et al.

    Distance metric learning with application to clustering with side-information

    Advances in Neural Information Processing Systems

    (2003)
  • K.Q. Weinberger et al.

    Distance metric learning for large margin nearest neighbor classification

    Advances in Neural Information Processing Systems

    (2005)
  • J.V. Davis et al.

    Information-theoretic metric learning

    Proceedings of the 24th International Conference On Machine Learning

    (2007)
  • J. Hu et al.

    Local large-margin multi-metric learning for face and kinship verification

    IEEE Trans. Circuits Syst. Video Technol.

    (2017)
  • H. Yan et al.

    Discriminative multimetric learning for kinship verification

    IEEE Trans. Inf. Forensics Secur.

    (2014)
  • Y. LeCun et al.

    Backpropagation applied to handwritten zip code recognition

    Neural Comput.

    (1989)
  • G. Guo et al.

    Kinship measurement on salient facial features

    IEEE Trans. Instrum. Meas.

    (2012)
  • X. Zhou et al.

    Gabor-based gradient orientation pyramid for kinship verification under uncontrolled environments

    Proceedings of the 20th ACM International Conference on Multimedia

    (2012)
  • Cited by (70)

    • Knowledge-based tensor subspace analysis system for kinship verification

      2022, Neural Networks
      Citation Excerpt :

      However, there is a risk that the alternative paradigm is too large to be truly comprehensible to humans. Many of previous studies (Chen et al., 2020; Dornaika et al., 2019; Laiadi et al., 2019b; Liang et al., 2019; Zhou et al., 2019b) aims to project deep features from the unknown subspaces (black-box subspaces) to the known and more discriminative subspaces (e.g., microaggregation and shallow decision trees (Blanco-Justicia, Domingo-Ferrer, Martínez, & Sánchez, 2020)). Through this transformation, they benefit from the knowledge-based domain and their proposed frameworks guarantee the transparency, efficiency and robustness of the proposed models based on deep features.

    • Robust discriminative feature subspace analysis for kinship verification

      2021, Information Sciences
      Citation Excerpt :

      Robinson et al. [36] presented Families in the Wild (FIW) as the largest kinship dataset and performed verification using pre-tained CNN models. Other relevant deep learning approaches are toward-young cross-generation model using sparse discriminative metric Loss (SDM-Loss) [37], DCTNet [38], supervised mixed norm autoEncoder (SMNAE) [39], kinship metric learning (KML) with a coupled deep neural network (DNN) model [40], kernelized bi-directional PCA (K-BDPCA) [41], and adversarial convolutional network (AdvKin) [42]. Wang et al. [37] employed SDM-Loss in a towards-young cross-generation model to extract deep features for effective kinship verification.

    View all citing articles on Scopus
    View full text