Learning deep compact similarity metric for kinship verification from face images

doi:10.1016/j.inffus.2018.07.011

Information Fusion

Volume 48, August 2019, Pages 84-94

https://doi.org/10.1016/j.inffus.2018.07.011 Get rights and content

Highlights

•
A new DNN is proposed to facilitate fusion of deep embeddings for parent-child data.
•
A deep metric learning algorithm is derived to learn a compact kin similarity metric.
•
Evaluations show the efficacy of our kinship metric with high verification accuracy.

Abstract

Recent advances in kinship verification have shown that learning an appropriate kinship similarity metric on human faces plays a critical role in this problem. However, most of existing distance metric learning (DML) based solutions rely on linearity assumption of the kinship metric model, and the domain knowledge of large cross-generation discrepancy (e.g., large age span and gender difference between parent and child images) has not been considered in metric learning, leading to degraded performance for genetic similarity measure on human faces. To address these limitations, we propose in this work a new kinship metric learning (KML) method with a coupled deep neural network (DNN) model. KML explicitly models the cross-generation discrepancy inherent on parent-child pairs, and learns a coupled deep similarity metric such that the image pairs with kinship relation are pulled close, while those without kinship relation (but with high appearance similarity) are pushed as far away as possible. Moreover, by imposing the intra-connection diversity and inter-connection consistency over the coupled DNN, we introduce the property of hierarchical compactness into the coupled network to facilitate deep metric learning with limited amount of kinship training data. Empirically, we evaluate our algorithm on several kinship benchmarks against the state-of-the-art DML alternatives, and the results demonstrate the superiority of our method.

Introduction

Recent evidence in psychology has indicated that face appearance is a reliable and critical cue for measure of the genetic similarity between the parent and their children [1], [2], [3]. Motivated by this, researchers from biometrics and computer vision societies have developed some computational models for kinship verification via face images [4], [5], [6], [7]. The objective of this verification problem is to determine whether there exists a kin relationship between a given pair of face images. Potential applications based on such verification technique ranges from social media mining to children adoptions and missing children searching.

While encouraging results have been demonstrated over the past a few years, kinship verification using face images still remains open. On one hand, face images are often captured in wild conditions, and varying illumination, poses and expressions in such scenarios make the verification problem quite challenging. On the other hand, kinship verification aims to investigate the kin relationship between two different visual entities (e.g., father and daughter), and thus the inherent appearance gap of intra-class in kinship verification is generally far larger than that in traditional face recognition [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22].

Recent advances in kinship verification have indicated that learning an appropriate similarity metric on human faces plays a critical role in kinship verification. Distance metric learning (DML) methods [23], [24], [25], [26] have been investigated in kinship verification [7], [27], [28] for the purpose of achieving an optimal distance metric rather than a pre-specified one for more robust kin-faces matching. Despite the success of DML-based approaches, existing solutions to kinship verification still suffer from two critical limitations:

(1) They are often proposed to learn a linear distance metric for input space, which is less powerful to capture the nonlinear manifold where the genetic traits inherent on human face lie. Moreover, in existing DML-based solutions the parent-child face images share a common linear transformation for visual matching, and hence the domain prior of large distribution gap between the parent and child has not been taken into account, leading to inaccurate measure of inherent kin similarity on human faces.

(2) While learning a nonlinear distance metric based on the deep neural networks (DNNs) [20], [29] is a straightforward solution to this problem, supervised metric learning with DNN typically requires a large number of labeled training samples, which is extremely expensive to collected in practical kinship verification due to the privacy concerns and involved time and human costs.

To address these issues, we propose in this paper a new kinship metric learning (KML) method for kinship verification from face images with a well-designed DNN architecture. The main contributions of this work are summarized as follows:

(1) We design a coupled DNN, named KinNet, for kinship verification from face images. KinNet explicitly models the cross-generation discrepancy inherent on parent-child pairs, and facilitates deep metric learning with limited amount of labeled kinship data. Particularly, by imposing the diversity regularization and cross-generation consistency regularization on the coupled connections, we introduce the property of hierarchical compactness into the coupled network to improve generalization performance of the kinship metric model.

(2) We develop a new deep metric learning algorithm with the proposed KinNet architecture to learn a deep compact cross-generation similarity metric. The learned similarity metric possesses some desirable properties that help address the limitations of most existing DML-based solutions to kinship verification.

From the information fusion point of view, the parent-child faces input to KML can be regarded as the two-view kin data for kinship verification, and hence KML can be considered as a multi-view metric learning in the deep learning framework. Essentially, our KML implicitly learns to fuse a pair of deep embeddings for robust similarity measure of the parent-child pairs.

On the other hand, by latent variable modeling, an ensemble of latent factors in weight matrices of the KinNet are enforced to be as diverse from one another as possible, such that the learned deep embeddings are compact enough to reduce information redundancy in metric learning. From the ensemble learning point of view, our KML implicitly learns to fuse a set of diverse latent factors in deep metric learning.

(3) We empirically evaluate our method on several benchmark datasets, and the results show that our proposed KML significantly boosts the current state-of-the-art level of kinship verification.

The remainder of this paper is organized as follows. We first briefly review the related work in Section 2 , and Section 3 details the kinship metric learning method with the proposed KinNet architecture. Experimental settings, results and discussions are presented in Section 4 , and Section 5 concludes the paper.

Section snippets

Related work

In this section, some related topics are briefly reviewed: (1) kinship verification, and (2) deep metric learning.

Roughly speaking, existing methods for kinship verification are either feature-based [4], [6], [30], [31], [32], [33], [34] or distance metric-based [5], [7], [27], [28], [35], [36], [37], [38]. Feature-based methods extract discriminative feature from face images by hand-crafted image descriptors [4], [6], [30] or feature learning [32], [33], [34] to represent genetic traits on

Our approach

In this section, we first introduce the proposed KinNet architecture, and then elaborate our KML method with KinNet for kinship verification. Finally, we present the optimization algorithm to solve the KML problem.

The motivation figure of our proposed KML method is shown in Fig. 1. Suppose there is a quadruplet $(x_{p}, x_{c}, {\hat{x}}_{p}, {\hat{x}}_{c})$ in the original metric space, where (x_p, x_c) are a pair of parent-child faces with kin relationship, and ${\hat{x}}_{c}$ and ${\hat{x}}_{p}$ are their nearest samples in the child and parent

Experiments

To evaluate the effectiveness of our proposed kinship verification method, we conduct experiments on four widely used datasets: KinFaceW-I¹, KinFaceW-II², Cornell KinFace³, and UB KinFace⁴. Fig. 4 presents some sample kin pairs from the KinFaceW-II dataset. We elaborate the datasets, experimental settings, results and analysis in

Conclusion

We have presented in this paper a kinship metric learning method to address kinship verification using facial images. We have shown that, despite the differences in image statistics and tasks between the datasets for face recognition and kinship verification, the transferred deep face representation leads to significantly improved accuracy in kinship verification. Also, by learning a coupled and deep compact similarity metric with the KinNet architecture tailored for kinship verification

Acknowledgment

This work is partially supported by the National Natural Science Foundation of China under grants 61373090 and 61601310.

References (66)

J. Goldberger et al.
Neighbourhood components analysis
Advances in Neural Information Processing Systems
(2004)
S. Wang et al.
Kinship verification on families in the wild with marginalized denoising metric learning
Automatic Face & Gesture Recognition (FG 2017), 2017 12th IEEE International Conference on
(2017)
A. Krizhevsky et al.
Imagenet classification with deep convolutional neural networks
Advances in Neural Information Processing Systems
(2012)
A. Alvergne et al.
Cross-cultural perceptions of facial resemblance between kin
J. Vis.
(2009)
G. Kaminski et al.
Human ability to detect kinship in strangers’ faces: effects of the degree of relatedness
Proc. R. Soc. Lond. B
(2009)
M.F. Dal Martello et al.
Lateralization of kin recognition signals in the human face
J. Vis.
(2010)
R. Fang et al.
Towards computational models of kinship verification
2010 IEEE International Conference on Image Processing
(2010)
S. Xia et al.
Kinship verification through transfer learning
IJCAI Proceedings-International Joint Conference on Artificial Intelligence
(2011)
X. Zhou et al.
Kinship verification from facial images under uncontrolled conditions
Proceedings of the 19th ACM International Conference on Multimedia
(2011)
J. Lu et al.
Neighborhood repulsed metric learning for kinship verification
Pattern Anal. Mach. Intell. IEEE Trans.
(2014)

R.G. Cinbis et al.

Unsupervised metric learning for face identification in TV video

2011 International Conference on Computer Vision

(2011)

M. Guillaumin et al.

Is that you? Metric learning approaches for face identification

2009 IEEE 12th International Conference on Computer Vision

(2009)

M. Köstinger et al.

Large scale metric learning from equivalence constraints

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

(2012)

Y. Taigman et al.

Deepface: closing the gap to human-level performance in face verification

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2014)

A. Mignon et al.

Pcca: a new approach for distance learning from sparse pairwise constraints

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

(2012)

H.V. Nguyen et al.

Cosine similarity metric learning for face verification

Asian Conference on Computer Vision

(2010)

Z. Cui et al.

Fusing robust face region descriptors via multiple metric learning for face recognition in the wild

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2013)

O.M. Parkhi et al.

Deep face recognition

British Machine Vision Conference

(2015)

W. Deng et al.

Transform-invariant PCA: a unified approach to fully automatic face alignment, representation, and recognition

IEEE Trans. Pattern Anal. Mach. Intell.

(2014)

X. Cai et al.

Deep nonlinear metric learning with independent subspace analysis for face verification

Proceedings of the 20th ACM international conference on Multimedia

(2012)

J. Lu et al.

Discriminative multimanifold analysis for face recognition from a single training sample per person

IEEE Trans. Pattern Anal. Mach. Intell.

(2013)

J. Lu et al.

Learning compact binary face descriptor for face recognition

IEEE Trans. Pattern Anal. Mach. Intell.

(2015)

J. Hu et al.

Discriminative deep metric learning for face verification in the wild

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2014)

J. Hu et al.

Deep transfer metric learning

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

(2015)

X. Zhou et al.

Multiple face tracking and recognition with identity-specific localized metric learning

Pattern Recognit.

(2018)

E.P. Xing et al.

Distance metric learning with application to clustering with side-information

Advances in Neural Information Processing Systems

(2003)

K.Q. Weinberger et al.

Distance metric learning for large margin nearest neighbor classification

Advances in Neural Information Processing Systems

(2005)

J.V. Davis et al.

Information-theoretic metric learning

Proceedings of the 24th International Conference On Machine Learning

(2007)

J. Hu et al.

Local large-margin multi-metric learning for face and kinship verification

IEEE Trans. Circuits Syst. Video Technol.

(2017)

H. Yan et al.

Discriminative multimetric learning for kinship verification

IEEE Trans. Inf. Forensics Secur.

(2014)

Y. LeCun et al.

Backpropagation applied to handwritten zip code recognition

Neural Comput.

(1989)

G. Guo et al.

Kinship measurement on salient facial features

IEEE Trans. Instrum. Meas.

(2012)

X. Zhou et al.

Gabor-based gradient orientation pyramid for kinship verification under uncontrolled environments

Proceedings of the 20th ACM International Conference on Multimedia

(2012)

Cited by (70)

Kinship verification using multi-level dictionary pair learning for multiple resolution images
2023, Pattern Recognition
Kinship verification using facial images is gaining substantial attention by computer vision researchers. The real challenge in kinship verification is to effectively represent the discriminative features to ease the differences between kinship image pairs. Further, existing kinship methods only focus on a single resolution, and ignore the variability of resolutions in practical scenarios. To address these issues, we propose a multi-level dictionary pair learning (MLDPL) method to learn dictionary pairs by incorporating multiple resolution images for kinship verification. We learn dictionary pairs jointly by transforming discriminative features of image pairs into different coding coefficients in the same space, thereby reducing the differences between them. Further, multiple resolution images are incorporated into dictionary pair learning to effectively deal with resolution variations in kinship verification. Extensive experiments are performed on different kinship datasets to validate the efficacy of proposed MLDPL method. Experimental results show that MLDPL achieves competitive performance on all kinship datasets.
A survey on kinship verification
2023, Neurocomputing
In this survey, kinship verification is defined as the automatic process of verifying whether two or more persons are blood relatives (kin) by analyzing images of their faces. Kinship verification is an important research field in computer vision with many applications such as finding missing persons, family album organization, and online image search. Although substantial progress has been made in kinship verification in the past decade, there are still challenges such as intrinsic (face i.e., differences in facial appearance) and extrinsic (acquisition i.e., varying imaging conditions) problems. And there is still a demand for more diverse datasets.
Therefore, this paper provides a survey on kinship verification methods and datasets. The survey starts with the definition of kinship verification and its corresponding intrinsic and extrinsic challenges. Then, an overview of kinship verification methods and datasets is given. Finally, a new multi-modal dataset (Nemo-Kinship Dataset) is proposed as a benchmark dataset addressing large inter-subject age variations consisting of 4216 videos of 248 persons from 85 families. The newly collected dataset is used to systematically test and analyze state-of-the-art methods.
Knowledge-based tensor subspace analysis system for kinship verification
2022, Neural Networks
Citation Excerpt :
However, there is a risk that the alternative paradigm is too large to be truly comprehensible to humans. Many of previous studies (Chen et al., 2020; Dornaika et al., 2019; Laiadi et al., 2019b; Liang et al., 2019; Zhou et al., 2019b) aims to project deep features from the unknown subspaces (black-box subspaces) to the known and more discriminative subspaces (e.g., microaggregation and shallow decision trees (Blanco-Justicia, Domingo-Ferrer, Martínez, & Sánchez, 2020)). Through this transformation, they benefit from the knowledge-based domain and their proposed frameworks guarantee the transparency, efficiency and robustness of the proposed models based on deep features.
Most existing automatic kinship verification methods focus on learning the optimal distance metrics between family members. However, learning facial features and kinship features simultaneously may cause the proposed models to be too weak. In this work, we explore the possibility of bridging this gap by developing knowledge-based tensor models based on pre-trained multi-view models. We propose an effective knowledge-based tensor similarity extraction framework for automatic facial kinship verification using four pre-trained networks (i.e., VGG-Face, VGG-F, VGG-M, and VGG-S). Therefore, knowledge-based deep face and general features (such as identity, age, gender, ethnicity, expression, lighting, pose, contour, edges, corners, shape, etc.) were successfully fused by our tensor design to understand the kinship cue. Multiple effective representations are learned for kinship verification statements (children and parents) using a margin maximization learning scheme based on Tensor Cross-view Quadratic Exponential Discriminant Analysis. Through the exponential learning process, the large gap between distributions of the same family can be reduced to the maximum, while the small gap between distributions of different families is simultaneously increased. The WCCN metric successfully reduces the intra-class variability problem caused by deep features. The explanation of black-box models and the problems of ubiquitous face recognition are considered in our system. The extensive experiments on four challenging datasets show that our system performs very well compared to state-of-the-art approaches.
Deep discriminant generation-shared feature learning for image-based kinship verification
2022, Signal Processing: Image Communication
Kinship verification based on facial images in the wild is an interesting and challenging research topic in the fields of computer vision. It has many potential applications, e.g., missing children search, image annotation, etc. In practice, the biggest obstacle in kinship verification is that there usually exists large divergence between the images of parent and children. How to effectively tackle such challenge is important for improving kinship verification performance. To this end, we proposed a novel Deep Discriminant Generation-Shared Feature Learning for Image-Based Kinship Verification (D $^{2}$ GFL) method for kinship verification, which consists of a two-stream generation-specific feature learning module and a generation-shared discrepancy reducing module. The generation-specific feature learning module can learn parent-specific and child-specific features. The generation-shared module aims to reduce the divergence between facial images of parent and child. In order to further relieve the cross-generation difference and improve the discriminability of learned features, we also design a cross-generation difference loss term and an intra-generation discriminant loss term, which simultaneously make use of the local and holistic features, as well as the family identity information. Experimental results on several widely used kinship datasets are presented to validate the effectiveness of our proposed approach by comparing with the state-of-the-art kinship verification methods. Specially, our approach can improve the average verification accuracy at least by 3.5%, 1.5% and 0.8% on KinFaceW-I, Cornell, and TSKinFace datasets, respectively.
Robust discriminative feature subspace analysis for kinship verification
2021, Information Sciences
Citation Excerpt :
Robinson et al. [36] presented Families in the Wild (FIW) as the largest kinship dataset and performed verification using pre-tained CNN models. Other relevant deep learning approaches are toward-young cross-generation model using sparse discriminative metric Loss (SDM-Loss) [37], DCTNet [38], supervised mixed norm autoEncoder (SMNAE) [39], kinship metric learning (KML) with a coupled deep neural network (DNN) model [40], kernelized bi-directional PCA (K-BDPCA) [41], and adversarial convolutional network (AdvKin) [42]. Wang et al. [37] employed SDM-Loss in a towards-young cross-generation model to extract deep features for effective kinship verification.
Kinship verification using single image of a person is a challenging task for real-world applications. In this paper, we propose a novel robust discriminative feature subspace analysis (RDFSA) method to address single sample per person (SSPP) problem in kinship verification. The proposed RDFSA method takes advantages of facial symmetry and patch-based analysis to extract discriminative features for kinship verification. Each face image is firstly divided into two halves about bilateral symmetry axis, and each halved face is then partitioned into equal sized non-overlapping patches. Multiple image-sets are formed by grouping these patches according to their positions at each halved face. Then, an SSPP is formulated as an RDFSA problem and a feature subspace is learned by maximizing inter-class separation and minimizing intra-class variance for different patches in each image-set. For a given test image pair, similarity is computed for each feature subspace and majority voting strategy is employed to determine if a given image pair is kin related or not. Proposed RDFSA method is extensively evaluated on different publicly available kinship datasets to validate kinship accuracy. Experimental results show that RDFSA achieves competitive accuracy on all kinship datasets while performing kinship verification under unconstrained environment.
Stationary wavelet transform features for kinship verification in childhood images
2024, Multimedia Tools and Applications

View all citing articles on Scopus

View full text

Learning deep compact similarity metric for kinship verification from face images

Highlights

Abstract

Introduction

Section snippets

Related work

Our approach

Experiments

Conclusion

Acknowledgment

Cross-cultural perceptions of facial resemblance between kin

J. Vis.

Human ability to detect kinship in strangers’ faces: effects of the degree of relatedness

Proc. R. Soc. Lond. B

Lateralization of kin recognition signals in the human face

J. Vis.

Towards computational models of kinship verification

2010 IEEE International Conference on Image Processing

Kinship verification through transfer learning

IJCAI Proceedings-International Joint Conference on Artificial Intelligence

Kinship verification from facial images under uncontrolled conditions

Proceedings of the 19th ACM International Conference on Multimedia

Neighborhood repulsed metric learning for kinship verification

Pattern Anal. Mach. Intell. IEEE Trans.

Unsupervised metric learning for face identification in TV video

2011 International Conference on Computer Vision

Is that you? Metric learning approaches for face identification

2009 IEEE 12th International Conference on Computer Vision

Large scale metric learning from equivalence constraints

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Deepface: closing the gap to human-level performance in face verification

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Pcca: a new approach for distance learning from sparse pairwise constraints

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Cosine similarity metric learning for face verification

Asian Conference on Computer Vision

Fusing robust face region descriptors via multiple metric learning for face recognition in the wild

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep face recognition

British Machine Vision Conference

Transform-invariant PCA: a unified approach to fully automatic face alignment, representation, and recognition

IEEE Trans. Pattern Anal. Mach. Intell.

Deep nonlinear metric learning with independent subspace analysis for face verification

Proceedings of the 20th ACM international conference on Multimedia

Discriminative multimanifold analysis for face recognition from a single training sample per person

IEEE Trans. Pattern Anal. Mach. Intell.

Learning compact binary face descriptor for face recognition

IEEE Trans. Pattern Anal. Mach. Intell.

Discriminative deep metric learning for face verification in the wild

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Deep transfer metric learning

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

Multiple face tracking and recognition with identity-specific localized metric learning

Pattern Recognit.

Distance metric learning with application to clustering with side-information

Advances in Neural Information Processing Systems

Distance metric learning for large margin nearest neighbor classification

Advances in Neural Information Processing Systems

Information-theoretic metric learning

Proceedings of the 24th International Conference On Machine Learning

Local large-margin multi-metric learning for face and kinship verification

IEEE Trans. Circuits Syst. Video Technol.

Discriminative multimetric learning for kinship verification

IEEE Trans. Inf. Forensics Secur.

Backpropagation applied to handwritten zip code recognition

Neural Comput.

Kinship measurement on salient facial features

IEEE Trans. Instrum. Meas.

Gabor-based gradient orientation pyramid for kinship verification under uncontrolled environments

Proceedings of the 20th ACM International Conference on Multimedia