Geometric Preserving Local Fisher Discriminant Analysis for person re-identification

doi:10.1016/j.neucom.2016.05.003

Neurocomputing

Volume 205, 12 September 2016, Pages 92-105

https://doi.org/10.1016/j.neucom.2016.05.003 Get rights and content

Highlights

•
A novel metric learning method is proposed for person re-identification.
•
A novel assumption that the re-id data lies on a nonlinear manifold is made.
•
Geometric structure is incorporated with nearest neighbor graph.
•
The problem is solved effectively without complex iteration.
•
Kernel extension of the method is proposed.

Abstract

Recently, Local Fisher Discriminant Analysis (LFDA) has achieved impressive performance in person re-identification. However, the classic LFDA method pays little attention to the intrinsic geometrical structure of the complex person re-identification data. Due to large appearance variance, two images of the same person may be far away from each other in feature space while images of different people may be quite close to each other. The linear topology exploited in LFDA is not sufficient to describe this nonlinear data structure. In this paper, we assume that the data reside on a manifold and propose an effective method termed Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA). The method integrates discriminative framework of LFDA with geometric preserving method which approximates local manifold utilizing a nearest neighbor graph. LFDA provides discriminative information by separating different labeled samples and pulling the same labeled samples together. The geometric preserving projection provides local manifold structure of the nonlinear data induced by graph topology. Taking advantage of the complementary between them, the proposed method achieves significant improvement over state-of-the-art approaches. Furthermore, a kernel extension of the GeoPLFDA method is proposed to handle the complex nonlinearity more effectively and to further improve re-identification accuracy. Experiments on the challenging iLIDS, VIPeR, CAVIAR and 3DPeS datasets demonstrate the effectiveness of the proposed method.

Introduction

Person re-identification, which aims at matching people across multiple non-overlapping camera networks, has attracted huge interest over the recent decades [1]. It manages to achieve that when a target disappears from one camera, he/she can be re-identified in another camera deployed far away. It can save a lot of human efforts on exhaustively searching for a target from large amounts of video sequences [2].

In the literature, the methods of re-identification can be divided into two categories: feature extraction and metric learning. Feature based approaches [3], [4] focus on extracting distinctive visual features to represent the human appearance. Metric learning [5], [6], [7] approaches aim at finding an optimal metric that can maximize the distance of samples from different class whilst minimize the distance of samples from the same class. Our approach belongs to the latter category.

Typical metric learning approach such as Large Margin Nearest Neighbor (LMNN) [5], [6] tries to learn a metric that minimizes the distance between each training point and its k nearest similarly labeled neighbors, while maximizing the distance between all differently labeled points. Inspired by LMNN, a bunch of metric learning methods for person re-identification have been proposed, such as ITML [8], RDC [9], PCCA [10], KISSME [11], LFDA [7], [12], [13].¹ While these methods could achieve encouraging re-identification performance, they are limited by linearity and prone to overfitting especially in large scale and high dimensional learning scenarios.

Traditional metric learning approaches [5], [6], [7], [8] often assume that data is linearly distributed, which does not hold true in re-identification. Samples in the same class may undergo dramatic appearance variations due to changes in view angle, illumination, background clutter and occlusion [14] (see Fig. 1). Meanwhile, samples of different people may share similar appearance, e.g., people wearing clothes with similar color or similar pattern. Therefore, traditional linear topology is not sufficient to model the re-identification data.

Furthermore, the metric learning methods are prone to overfitting because of the small sample size (SSS) problem in person re-identification, i.e. the number of samples per subject is far less than the dimension of the feature. For instance, the VIPeR dataset [15] only contains two images of each subject, while the dimension of features is usually thousands or higher. In this case, metric learning methods tend to overfit because pair or triplet-based constraints become much easier to satisfy in a high-dimensional space and thereby lead to poor generalization performance. The absence of regularization further deteriorates recognition performance [16]. As for LFDA, the within-class scatter matrix $S^{W}$ cannot be accurately estimated because the number of within-class samples is very limited thus $S^{W}$ often becomes singular. The singularity can easily lead to overfitting.

Motivated by these problems, in this paper, we propose a novel algorithm termed Geometric Preserving Local Fisher Discriminate Analysis (GeoPLFDA) which makes a reasonable assumption that the re-identification data reside on a manifold and each sample corresponds to a point on the manifold. The method exploits local manifold approximation derived by nearest neighbor graph [17]. This graph topology provides better approximation to the real world data structure than linear assumptions. To accommodate with LFDA, the data is then projected into a low dimensional linear subspace following the criterion that geometric information should be well preserved. In other words, nearby points on the manifold are mapped to nearby points in the subspace, and faraway points to faraway points. LFDA is performed to improve the intra-class compactness and inter-class separation. Through a linear weighted technique, the geometric preserving techniques are effectively incorporated into the LFDA scheme. In this way, not only the discriminant information is exploited, the geometrical structure is also effectively preserved. The geometric preserving term can serve as a regularization term thus overfitting is alleviated. What׳s more, the proposed GeoPLFDA incorporates global information from the whole feature data which makes up for the fact that the discriminative margin in LFDA is determined by limited nearby data pairs [18]. In addition, we propose the kernel extension of GeoPLFDA which handles the complicated nonlinear high dimensional data structure more effectively. Experimental results demonstrate the effectiveness of the proposed method.

The main contribution of the proposed method is three-folds:

•
A more faithful representation of the data structure is proposed, which assumes that the data lies on a nonlinear manifold.
•
The proposed method not only exploits discriminant structure utilizing techniques from LFDA, but also effectively incorporates local structure information by constructing the nearest neighbor graph.
•
A closed form solution is achieved through generalized eigenvalue decomposition. Hence, complex iterative optimization schemes are not required.

The rest of the paper is organized as follows: a brief view of related works is presented in Section 2. Section 3 introduces the proposed GeoPLFDA algorithm and its kernel extension. Experimental results on iLIDS, VIPeR, CAVIAR and 3DPeS datasets are presented in Section 4. Finally, the concluding remarks and suggestions for future work are discussed in Section 5.

Section snippets

Person re-identification

Existing person re-identification methods can be roughly divided into two categories.

Feature based approaches focus on designing a feature representation that can be both distinctive and robust to large appearance variations. For instance, Farenzena et al. [3] try to utilize a strategy to extract distinctive and stable features. This strategy is based on the localization of perceptual relevant human parts, driven by asymmetry/ symmetry principles. Color Hexagonal-SIFT and Color Histogram

Proposed method

In this section, we introduce our proposed method in detail with organizing it into three parts. Section 3.1 describes how to model the nonlinear data structure. The GeoPLFDA process is presented in Section 3.2. In Section 3.3, we apply the method in kernel space. The basic flow of the proposed method is depicted in Fig. 2.

Experimental results

In this section, four most challenging and commonly used datasets are adopted for evaluation, namely iLIDS [39], VIPeR [15], CAVIAR [40] and 3DPeS [41]. These datasets possess different characteristics (e.g. outdoor/indoor, large/small variations in view angle, constant/varying image scale, presence/absence of occlusion) and give a faithful representation of real-word challenges for person re-identification. The details of these datasets are listed in Table 1. We compare our algorithm with

Conclusion

We have introduced a novel distance learning algorithm called Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA) for person re-identification. To model the complex data structure of person re-identification, we make a reasonable assumption that the data reside on a manifold and each sample corresponds to a point on the manifold. A geometric preserving approach which approximates local manifold utilizing a nearest neighbor graph is integrated with LFDA to complement each other.

Acknowledgment

This work is funded by the Fundamental Research Funds for the Central Universities (K15JB00160).

Jieru Jia received the B.S. degree from Beijing Jiaotong University, Beijing, P.R. China, in 2012. She is currently a Ph.D. candidate in the Institute of Information Science, Beijing Jiaotong University. Her main research interests are in computer vision, pattern recognition and machine learning, in particular focusing on person re-identification.

References (53)

A. Bedagkar-Gala et al.
A survey of approaches and trends in person re-identification
Image Vis. Comput.
(2014)
H. Huang et al.
Complete local Fisher discriminant analysis with Laplacian score ranking for face recognition
Neurocomputing
(2012)
H. Liu et al.
Set-label modeling and deep metric learning on person re-identification
Neurocomputing
(2015)
C. Liu et al.
On-the-fly feature importance mining for person re-identification
Pattern Recognit.
(2014)
Z. Liu et al.
Enhancing person re-identification by integrating gait biometric
Neurocomputing
(2015)
S.C. Shi et al.
Person re-identification with multi-level adaptive correspondence models
Neurocomputing
(2015)
Z. Wang et al.
Facial expression recognition using sparse local Fisher discriminant analysis
Neurocomputing
(2016)
R. Zhao, W. Ouyang, X. Wang, Person re-identification by salience matching, in: Proceedings of IEEE International...
M. Farenzena, L. Bazzani, A. Perina, et al., Person re-identification by symmetry-driven accumulation of local...
N. Gheissari, T.B. Sebastian, R. Hartley, Person re identification using spatiotemporal appearance, in: Proceedings of...

K.Q. Weinberger et al.

Distance metric learning for large margin nearest neighbor classification

Adv. Neural Inf. Process. Syst.

(2005)

M. Dikmen, E. Akbas, T.S. Huang, et al., Pedestrian recognition with a learned metric, Computer Vision-ACCV 2010, Asian...

S. Pedagadi, J. Orwell, S. Velastin, et al., Local fisher discriminant analysis for pedestrian re-identification, in:...

J.V. Davis, B. Kulis, P. Jain, et al., Information-theoretic metric learning, in: Proceedings of the 24th International...

W.S. Zheng et al.

Re identification by relative distance comparison

IEEE Trans. Pattern Anal. Mach. Intell.

(2013)

A. Mignon, F. Jurie, PCCA: a new approach for distance learning from sparse pairwise constraints, in: Proceedings of...

M. Kostinger, M. Hirzer, P. Wohlhart, et al., Large scale metric learning from equivalence constraints, in: Proceedings...

M. Sugiyama, Local fisher discriminant analysis for supervised dimensionality reduction, in: Proceedings of the 23rd...

M. Sugiyama et al.

Semi-supervised local Fisher discriminant analysis for dimensionality reduction

Mach. Learn.

(2010)

Shaogang Gong

(2014)

D. Gray, H. Tao, Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features, Computer Vision -...

A. Bellet, A. Habrard, M. Sebban, A survey on metric learning for feature vectors and structured data, arXiv preprint...

D. Cai, X. He, J. Han, Isometric projection, in: Proceedings of the National Conference on Artificial Intelligence,...

H. J. et al.

Multi-camera handoff for person re-identi cation

Neurocomputing

(2016)

L. An et al.

Person re-identification via hypergraph-based matching

Neurocomputing

(2015)

R. Zhao, W. Ouyang, X. Wang, Learning mid-level filters for person re-identification, in: Proceedings of IEEE...

Cited by (17)

Person re-identification: A taxonomic survey and the path ahead
2022, Image and Vision Computing
Citation Excerpt :
However, this linear topology exploited in LFDA pays little attention to the intrinsic geometrical structure of the complex PRId data. To handle this, Jia et al. [82] proposed a method called Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA). GeoPLFDA integrates a discriminative framework of LFDA with the geometric preserving method.
Person re-identification (PRId) is one of the most challenging tasks in automated video surveillance and has been an area of intense research spanning the past decade. PRId aims at finding a person who has previously been identified using some unique descriptor of the person. This survey comprises a wide spectrum of PRId methods spanning from traditional to deep learning-based being analyzed and compared. This survey also discusses different PRId frameworks on the basis of machine learning and deep learning. It offers a multi-dimensional taxonomy to classify the most pertinent researches according to different perspectives and tries to unify the categorization of PRId methods and fill the gap between the recently published surveys. This study highlights the challenges in building PRId systems. It presents a critical overview of recent progress and the state-of-the-art approaches to solving some major challenges of existing PRId systems. Furthermore, we discuss the performance comparisons of the various state-of-the-art in different datasets. Finally, we discuss several open issues and directions for future studies.
Density-oriented linear discriminant analysis[Formula presented]
2022, Expert Systems with Applications
The conventional Linear Discriminant Analysis (LDA) model has some challenges, such as sensitivity to the outlier, the singularity problem of the within-class scatter matrix, and Gaussian assumption of data within the same class. This paper proposes a robust LDA method that tries to solve the sensitivity to outliers and singularity problems. Specifically, we first use Bayesian risk to design the proposed method optimization problem. Then, the proposed Density-oriented LDA (DLDA) method used the data density as prior knowledge for robustness against outliers. The proposed method can classify non-linear and multi-mode distribution data sets. Furthermore, the proposed method can be employed for big data classification using the AdaBoost approach. Experimental results on synthetic and real data sets demonstrate the proposed DLDA method’s superiority over other competing methods.
Prediction therapy outcomes of HCV patients treated with interferon/ribavirin
2018, Biomedical Signal Processing and Control
Citation Excerpt :
Having a considerable overlap between positive and negative samples and the problem of excessive dimensionality of the extracted features, make us to use an effective and analytically tractable method for coping with these issues. Fisher discriminant analysis (FDA) [26,27] is a kind of linear discriminant analysis (LDA), which is named after Ronald Fisher. This procedure is applied for specifying or detaching two or more classes of objects by acquiring a linear combination of features.
Hepatitis C is a kind of an infectious disease that mainly has an impact on the liver and also disrupts its activities. As an approximation, 130∼170 millions of people around the world have been suffering from hepatitis C virus. Until now, a combination of interferon-alpha (IFN-Alpha) and ribavirin (RBV) is employed as a therapy to those who infected with hepatitis C virus (HCV). This paper presents powerful and novel methods to predict and classify therapy outcomes based on two techniques and two classifiers. Here, discrete wavelet transform (DWT) is invoked for decomposing the initial datasets up several levels. The datasets that used in the procedure of prediction and classification are the full-length nucleotide sequences of HCV subtypes 1a and 1b. Next, the reduction of data dimension as well as correlation amongst the datasets are carried out by exerting linear discriminant analysis (LDA). After acquiring the most significant and vital features from the full-length nucleotide sequences of HCV subtypes 1a and 1b, two effective and powerful methods are presented for classifying and identifying genetic determinatives of treatment consequence. Thus, wavelet neural network (Wave-Net) and support vector machine (SVM) with various parameters and wavelets are used to classify and predict the therapy outcome. The experimental results indicate the efficiency and accuracy of the proposed techniques compared to other classification and prediction methods.
Person re-identification by order-induced metric fusion
2018, Neurocomputing
Citation Excerpt :
For instance, at rank-10, our framework is 12% ahead of [23]. Finally, regarding 3DPeS, we compare against [11],[24] and [32]. The best rank-1 rate was scored by [24], which is drastically improved by the order of 9%.
This paper presents a novel two-pronged framework for person re-identification. Its idea articulates over the fact that distinct descriptors manifest different ranking scores for the same probe pattern. Thus, if conveniently fused, the descriptors in hand are ought to compensate each other, leading to significant improvements. In this respect, this paper proposes a learning-free weighting method that penalizes and averages the re-identification estimates (e.g., distances) pointed out by different descriptors according to their confidence in evidencing the correct match, to a given probe person, among a given gallery. We particularly show that tangible improvements can be attained with respect to utilizing each descriptor individually. Moreover, we consider a confidence measure mechanism that treats the mutual pairwise distances within the gallery, in order to raise the scores obtained at the fusion stage, and we show that interesting improvements can be achieved. We evaluate the proposed framework on four benchmark datasets and advance late works by large margins.
Multiple metric learning with query adaptive weights and multi-task re-weighting for person re-identification
2017, Computer Vision and Image Understanding
Citation Excerpt :
The weaknesses are two folds: First, single metric learning is not robust against the complex nonlinear data structure in person re-id. Images of the same person may be far away from each other due to dramatic appearance variations, while images of different people may be very close to each other, e.g., two different people wearing similar color or pattern (Jia et al., 2016). Second, single metric learning encounters the bottleneck of Small Sample Size (SSS), i.e. the number of training samples is far less than the feature dimension.
Metric learning has been widely studied in person re-identification (re-id). However, most existing metric learning methods only learn one holistic Mahalanobis distance metric for the concatenated high dimensional feature. This single metric learning strategy cannot handle complex nonlinear data structure and may easily encounter overfitting. Besides, feature concatenation is incapable of exploring the discriminant capability of different features and low dimensional features tend to be dominated by high dimensional ones. Motivated by these problems, we propose a multiple metric learning method for the re-id problem, where individual sub-metrics are separately learned for each feature type and the final metric is formed as weighted sum of the sub-metrics. The sub-metrics are learned with the Cross-view Quadratic Discriminant Analysis (XQDA) algorithm and the weights to each sub-metric are assigned in a two-step procedure. First, the importance of each feature type is estimated according to its discriminative power, which is measured in a query adaptive manner as related to the partial Area Under Curve (pAUC) scores. Then, the weights of all feature types are learned simultaneously with a maximum-margin based multi-task structural SVM learning framework, in order to make sure that relevant gallery images are ranked before irrelevant ones within all feature spaces. Finally, the sub-metrics are integrated with the learned weights in an ensemble model, generating a sophisticated distance metric. Experiments on the challenging i-LIDS, VIPeR, CAVIAR and 3DPeS datasets demonstrate the effectiveness of the proposed method.
Advancing Person Re-Identification: Tensor-based Feature Fusion and Multilinear Subspace Learning
2023, arXiv

View all citing articles on Scopus

Qiuqi Ruan received the B.S. and M.S. degree from Northern Jiaotong University, P.R. China in 1969 and 1981, respectively.

From January 1987 to May 1990, he was a visiting scholar at the University of Pittsburgh, Pittsburgh, PA, and at the University of Cincinnati, Cincinnati, OH. Subsequently, he has been a Visiting Professor in the U.S. for several times. He is currently a Professor and a Doctorate Supervisor at the Institute of Information Science, Beijing Jiaotong University, Beijing. He is IEEE Beijing Section Chairman. He has authored and co-authored eight books and more than 350 technical papers in the image processing and information science, and holds one invention patent. His main research interests include digital signal processing, computer vision, pattern recognition, and virtual reality.

Yi Jin received the Ph.D. degree in Signal and Information Processing from the Institute of Information Science, Beijing Jiaotong University, Beijing, P.R. China, in 2010. She is currently an Associate Professor in the School of Computer Science and Information Technology, Beijing Jiaotong University.

She has been a visiting scholar in School of Electrical and Electronic Engineering, Nanyang Technological University of Singapore (2013–2014). Her research interests include computer vision, pattern recognition, image processing and machine learning.

View full text

Geometric Preserving Local Fisher Discriminant Analysis for person re-identification

Highlights

Abstract

Introduction

Section snippets

Person re-identification

Proposed method

Experimental results

Conclusion

Acknowledgment

Image Vis. Comput.

Neurocomputing

Neurocomputing

Pattern Recognit.

Neurocomputing

Neurocomputing

Neurocomputing

Distance metric learning for large margin nearest neighbor classification

Adv. Neural Inf. Process. Syst.

Re identification by relative distance comparison

IEEE Trans. Pattern Anal. Mach. Intell.

Semi-supervised local Fisher discriminant analysis for dimensionality reduction

Mach. Learn.

Multi-camera handoff for person re-identi cation

Neurocomputing

Person re-identification via hypergraph-based matching

Neurocomputing