Elsevier

Neurocomputing

Volume 205, 12 September 2016, Pages 92-105
Neurocomputing

Geometric Preserving Local Fisher Discriminant Analysis for person re-identification

https://doi.org/10.1016/j.neucom.2016.05.003Get rights and content

Highlights

  • A novel metric learning method is proposed for person re-identification.

  • A novel assumption that the re-id data lies on a nonlinear manifold is made.

  • Geometric structure is incorporated with nearest neighbor graph.

  • The problem is solved effectively without complex iteration.

  • Kernel extension of the method is proposed.

Abstract

Recently, Local Fisher Discriminant Analysis (LFDA) has achieved impressive performance in person re-identification. However, the classic LFDA method pays little attention to the intrinsic geometrical structure of the complex person re-identification data. Due to large appearance variance, two images of the same person may be far away from each other in feature space while images of different people may be quite close to each other. The linear topology exploited in LFDA is not sufficient to describe this nonlinear data structure. In this paper, we assume that the data reside on a manifold and propose an effective method termed Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA). The method integrates discriminative framework of LFDA with geometric preserving method which approximates local manifold utilizing a nearest neighbor graph. LFDA provides discriminative information by separating different labeled samples and pulling the same labeled samples together. The geometric preserving projection provides local manifold structure of the nonlinear data induced by graph topology. Taking advantage of the complementary between them, the proposed method achieves significant improvement over state-of-the-art approaches. Furthermore, a kernel extension of the GeoPLFDA method is proposed to handle the complex nonlinearity more effectively and to further improve re-identification accuracy. Experiments on the challenging iLIDS, VIPeR, CAVIAR and 3DPeS datasets demonstrate the effectiveness of the proposed method.

Introduction

Person re-identification, which aims at matching people across multiple non-overlapping camera networks, has attracted huge interest over the recent decades [1]. It manages to achieve that when a target disappears from one camera, he/she can be re-identified in another camera deployed far away. It can save a lot of human efforts on exhaustively searching for a target from large amounts of video sequences [2].

In the literature, the methods of re-identification can be divided into two categories: feature extraction and metric learning. Feature based approaches [3], [4] focus on extracting distinctive visual features to represent the human appearance. Metric learning [5], [6], [7] approaches aim at finding an optimal metric that can maximize the distance of samples from different class whilst minimize the distance of samples from the same class. Our approach belongs to the latter category.

Typical metric learning approach such as Large Margin Nearest Neighbor (LMNN) [5], [6] tries to learn a metric that minimizes the distance between each training point and its k nearest similarly labeled neighbors, while maximizing the distance between all differently labeled points. Inspired by LMNN, a bunch of metric learning methods for person re-identification have been proposed, such as ITML [8], RDC [9], PCCA [10], KISSME [11], LFDA [7], [12], [13].1 While these methods could achieve encouraging re-identification performance, they are limited by linearity and prone to overfitting especially in large scale and high dimensional learning scenarios.

Traditional metric learning approaches [5], [6], [7], [8] often assume that data is linearly distributed, which does not hold true in re-identification. Samples in the same class may undergo dramatic appearance variations due to changes in view angle, illumination, background clutter and occlusion [14] (see Fig. 1). Meanwhile, samples of different people may share similar appearance, e.g., people wearing clothes with similar color or similar pattern. Therefore, traditional linear topology is not sufficient to model the re-identification data.

Furthermore, the metric learning methods are prone to overfitting because of the small sample size (SSS) problem in person re-identification, i.e. the number of samples per subject is far less than the dimension of the feature. For instance, the VIPeR dataset [15] only contains two images of each subject, while the dimension of features is usually thousands or higher. In this case, metric learning methods tend to overfit because pair or triplet-based constraints become much easier to satisfy in a high-dimensional space and thereby lead to poor generalization performance. The absence of regularization further deteriorates recognition performance [16]. As for LFDA, the within-class scatter matrix SW cannot be accurately estimated because the number of within-class samples is very limited thus SW often becomes singular. The singularity can easily lead to overfitting.

Motivated by these problems, in this paper, we propose a novel algorithm termed Geometric Preserving Local Fisher Discriminate Analysis (GeoPLFDA) which makes a reasonable assumption that the re-identification data reside on a manifold and each sample corresponds to a point on the manifold. The method exploits local manifold approximation derived by nearest neighbor graph [17]. This graph topology provides better approximation to the real world data structure than linear assumptions. To accommodate with LFDA, the data is then projected into a low dimensional linear subspace following the criterion that geometric information should be well preserved. In other words, nearby points on the manifold are mapped to nearby points in the subspace, and faraway points to faraway points. LFDA is performed to improve the intra-class compactness and inter-class separation. Through a linear weighted technique, the geometric preserving techniques are effectively incorporated into the LFDA scheme. In this way, not only the discriminant information is exploited, the geometrical structure is also effectively preserved. The geometric preserving term can serve as a regularization term thus overfitting is alleviated. What׳s more, the proposed GeoPLFDA incorporates global information from the whole feature data which makes up for the fact that the discriminative margin in LFDA is determined by limited nearby data pairs [18]. In addition, we propose the kernel extension of GeoPLFDA which handles the complicated nonlinear high dimensional data structure more effectively. Experimental results demonstrate the effectiveness of the proposed method.

The main contribution of the proposed method is three-folds:

  • A more faithful representation of the data structure is proposed, which assumes that the data lies on a nonlinear manifold.

  • The proposed method not only exploits discriminant structure utilizing techniques from LFDA, but also effectively incorporates local structure information by constructing the nearest neighbor graph.

  • A closed form solution is achieved through generalized eigenvalue decomposition. Hence, complex iterative optimization schemes are not required.

The rest of the paper is organized as follows: a brief view of related works is presented in Section 2. Section 3 introduces the proposed GeoPLFDA algorithm and its kernel extension. Experimental results on iLIDS, VIPeR, CAVIAR and 3DPeS datasets are presented in Section 4. Finally, the concluding remarks and suggestions for future work are discussed in Section 5.

Section snippets

Person re-identification

Existing person re-identification methods can be roughly divided into two categories.

Feature based approaches focus on designing a feature representation that can be both distinctive and robust to large appearance variations. For instance, Farenzena et al. [3] try to utilize a strategy to extract distinctive and stable features. This strategy is based on the localization of perceptual relevant human parts, driven by asymmetry/ symmetry principles. Color Hexagonal-SIFT and Color Histogram

Proposed method

In this section, we introduce our proposed method in detail with organizing it into three parts. Section 3.1 describes how to model the nonlinear data structure. The GeoPLFDA process is presented in Section 3.2. In Section 3.3, we apply the method in kernel space. The basic flow of the proposed method is depicted in Fig. 2.

Experimental results

In this section, four most challenging and commonly used datasets are adopted for evaluation, namely iLIDS [39], VIPeR [15], CAVIAR [40] and 3DPeS [41]. These datasets possess different characteristics (e.g. outdoor/indoor, large/small variations in view angle, constant/varying image scale, presence/absence of occlusion) and give a faithful representation of real-word challenges for person re-identification. The details of these datasets are listed in Table 1. We compare our algorithm with

Conclusion

We have introduced a novel distance learning algorithm called Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA) for person re-identification. To model the complex data structure of person re-identification, we make a reasonable assumption that the data reside on a manifold and each sample corresponds to a point on the manifold. A geometric preserving approach which approximates local manifold utilizing a nearest neighbor graph is integrated with LFDA to complement each other.

Acknowledgment

This work is funded by the Fundamental Research Funds for the Central Universities (K15JB00160).

Jieru Jia received the B.S. degree from Beijing Jiaotong University, Beijing, P.R. China, in 2012. She is currently a Ph.D. candidate in the Institute of Information Science, Beijing Jiaotong University. Her main research interests are in computer vision, pattern recognition and machine learning, in particular focusing on person re-identification.

References (53)

  • K.Q. Weinberger et al.

    Distance metric learning for large margin nearest neighbor classification

    Adv. Neural Inf. Process. Syst.

    (2005)
  • M. Dikmen, E. Akbas, T.S. Huang, et al., Pedestrian recognition with a learned metric, Computer Vision-ACCV 2010, Asian...
  • S. Pedagadi, J. Orwell, S. Velastin, et al., Local fisher discriminant analysis for pedestrian re-identification, in:...
  • J.V. Davis, B. Kulis, P. Jain, et al., Information-theoretic metric learning, in: Proceedings of the 24th International...
  • W.S. Zheng et al.

    Re identification by relative distance comparison

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2013)
  • A. Mignon, F. Jurie, PCCA: a new approach for distance learning from sparse pairwise constraints, in: Proceedings of...
  • M. Kostinger, M. Hirzer, P. Wohlhart, et al., Large scale metric learning from equivalence constraints, in: Proceedings...
  • M. Sugiyama, Local fisher discriminant analysis for supervised dimensionality reduction, in: Proceedings of the 23rd...
  • M. Sugiyama et al.

    Semi-supervised local Fisher discriminant analysis for dimensionality reduction

    Mach. Learn.

    (2010)
  • Shaogang Gong
    (2014)
  • D. Gray, H. Tao, Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features, Computer Vision -...
  • A. Bellet, A. Habrard, M. Sebban, A survey on metric learning for feature vectors and structured data, arXiv preprint...
  • D. Cai, X. He, J. Han, Isometric projection, in: Proceedings of the National Conference on Artificial Intelligence,...
  • H. J. et al.

    Multi-camera handoff for person re-identi cation

    Neurocomputing

    (2016)
  • L. An et al.

    Person re-identification via hypergraph-based matching

    Neurocomputing

    (2015)
  • R. Zhao, W. Ouyang, X. Wang, Learning mid-level filters for person re-identification, in: Proceedings of IEEE...
  • Cited by (17)

    • Person re-identification: A taxonomic survey and the path ahead

      2022, Image and Vision Computing
      Citation Excerpt :

      However, this linear topology exploited in LFDA pays little attention to the intrinsic geometrical structure of the complex PRId data. To handle this, Jia et al. [82] proposed a method called Geometric Preserving Local Fisher Discriminant Analysis (GeoPLFDA). GeoPLFDA integrates a discriminative framework of LFDA with the geometric preserving method.

    • Prediction therapy outcomes of HCV patients treated with interferon/ribavirin

      2018, Biomedical Signal Processing and Control
      Citation Excerpt :

      Having a considerable overlap between positive and negative samples and the problem of excessive dimensionality of the extracted features, make us to use an effective and analytically tractable method for coping with these issues. Fisher discriminant analysis (FDA) [26,27] is a kind of linear discriminant analysis (LDA), which is named after Ronald Fisher. This procedure is applied for specifying or detaching two or more classes of objects by acquiring a linear combination of features.

    • Person re-identification by order-induced metric fusion

      2018, Neurocomputing
      Citation Excerpt :

      For instance, at rank-10, our framework is 12% ahead of [23]. Finally, regarding 3DPeS, we compare against [11],[24] and [32]. The best rank-1 rate was scored by [24], which is drastically improved by the order of 9%.

    • Multiple metric learning with query adaptive weights and multi-task re-weighting for person re-identification

      2017, Computer Vision and Image Understanding
      Citation Excerpt :

      The weaknesses are two folds: First, single metric learning is not robust against the complex nonlinear data structure in person re-id. Images of the same person may be far away from each other due to dramatic appearance variations, while images of different people may be very close to each other, e.g., two different people wearing similar color or pattern (Jia et al., 2016). Second, single metric learning encounters the bottleneck of Small Sample Size (SSS), i.e. the number of training samples is far less than the feature dimension.

    View all citing articles on Scopus

    Jieru Jia received the B.S. degree from Beijing Jiaotong University, Beijing, P.R. China, in 2012. She is currently a Ph.D. candidate in the Institute of Information Science, Beijing Jiaotong University. Her main research interests are in computer vision, pattern recognition and machine learning, in particular focusing on person re-identification.

    Qiuqi Ruan received the B.S. and M.S. degree from Northern Jiaotong University, P.R. China in 1969 and 1981, respectively.

    From January 1987 to May 1990, he was a visiting scholar at the University of Pittsburgh, Pittsburgh, PA, and at the University of Cincinnati, Cincinnati, OH. Subsequently, he has been a Visiting Professor in the U.S. for several times. He is currently a Professor and a Doctorate Supervisor at the Institute of Information Science, Beijing Jiaotong University, Beijing. He is IEEE Beijing Section Chairman. He has authored and co-authored eight books and more than 350 technical papers in the image processing and information science, and holds one invention patent. His main research interests include digital signal processing, computer vision, pattern recognition, and virtual reality.

    Yi Jin received the Ph.D. degree in Signal and Information Processing from the Institute of Information Science, Beijing Jiaotong University, Beijing, P.R. China, in 2010. She is currently an Associate Professor in the School of Computer Science and Information Technology, Beijing Jiaotong University.

    She has been a visiting scholar in School of Electrical and Electronic Engineering, Nanyang Technological University of Singapore (2013–2014). Her research interests include computer vision, pattern recognition, image processing and machine learning.

    View full text