Skip to main content

Advertisement

Log in

A deep multi-feature distance metric learning method for pedestrian re-identification

  • 1142T: Multi-view Representation Learning and Understanding
  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Consider the problem that handcrafted features are limited by not being directly applicable to practical problems. Additionally, the deep convolution feature is a high-dimensional feature, and if it is directly used to match the image, it will consume considerable time and memory. Moreover, features from higher levels may be contaminated by dramatic variations in the human pose or background clutter. This paper proposes a method based on deep multi-feature distance metric learning. First, each spatial position and channel are weighted after extracting the deep convolution feature from the last layer of the CNN, and the final aggregation result, that is, the feature of the image, is obtained by sum-pooling. Second, a new method to improve and integrate the convolution feature of the region is proposed. The convolution features are processed by the sliding frame technique, and the low-dimensional eigenvector with dimensions equal to the number of convolution layer channels is obtained. Third, a distance learning algorithm is proposed by cross-view quadratic discriminant analysis metric learning. Finally, the weighted fusion strategy is used to accomplish the collaboration between the handcrafted and deep convolution features. On the Market-1501 and VIPeR datasets, the experimental results show that the rank 1 values of the proposed method on three experimental datasets reach 90.02% and 68.74%, respectively. Under the new classification rules of the CHUK03 dataset, the rank 1 performance of the proposed method reaches 34.2%. The experimental results show that the accuracy of pedestrian re-identification after distance-weighted fusion is higher than that obtained by the separate feature distance metric.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3908–3916

  2. Bai S, Bai X, Tian Q (2017) Scalable Person Re-identification on Supervised Smoothed Manifold, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3356–3365

  3. Bai S, Tang P, Torr PHS, Latecki LJ (2019) Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 40–749

  4. Chen C-H, Chen J-C, Lin KW (2018) Viewpoint Invariant Person Re-identification with Pose and Weighted Local Features. In: Sieminski A, Kozierkiewicz A, Nunez M, Ha QT (eds) Modern Approaches for Intelligent Information and Database Systems. Springer International Publishing, Cham, pp 387–396

    Chapter  Google Scholar 

  5. Cheng D, Gong Y, Shi W, Zhang S (2018) Person re-identification by the asymmetric triplet and identification loss function. Multimedia Tools and Applications 77(3):3533–3550 2018/02/01

    Article  Google Scholar 

  6. Dai Z, Chen M, Gu X, Zhu S, Tan P (2018) Batch DropBlock Network for Person Re-identification and Beyond, arXiv: Computer Vision and Pattern Recognition

  7. Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recognition 48(10):2993–3003

    Article  Google Scholar 

  8. Engel C, Baumgartner P, Holzmann M, Nutzel JF (2010) Person Re-Identification by Support Vector Ranking, in British Machine Vision Conference

  9. Fan X, Jiang W, Luo H, Fei M (2019) SphereReID: Deep hypersphere manifold embedding for person re-identification, J Vis Commun Image Represent, 60, 51–58, 2019/04/01/

  10. Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification symmetry-driven accumulation of local features, in Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pp. 2360–2367: IEEE

  11. Fu M, Sun S, Chen N, Wang D, Tong X (2019) Deep Fusion Feature Presentations for Nonaligned Person Re-Identification. IEEE Access 7:73253–73261

    Article  Google Scholar 

  12. Gray D, Tao H (2008) Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features, 5302, 262–275

  13. Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking, in Proc. IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), 3, 5, 1–7: Citeseer

  14. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778

  15. Hermans A, Beyer L, Leibe B (2017) In Defense of the Triplet Loss for Person Re-Identification, arXiv: Computer Vision and Pattern Recognition

  16. Hirzer M (2012) Large scale metric learning from equivalence constraints, in IEEE Conference on Computer Vision and Pattern Recognition 2288–2295

  17. Johnson DM, Xiong C, Corso JJ (2016) Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies. IEEE Transactions on Knowledge & Data Engineering 28(4):1035–1046

    Article  Google Scholar 

  18. Kalantidis Y, Mellina C, Osindero S (2016) Cross-Dimensional Weighting for Aggregated Deep Convolutional Features, European conference on computer vision, 685–701

  19. Kan S, Cen Y, He Z, Zhang Z, Zhang L, Wang Y (2019) Supervised Deep Feature Embedding with Hand Crafted Feature, IEEE Trans Image Process, 1–1

  20. Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: Deep Filter Pairing Neural Network for Person Re-identification, in 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

  21. Li X, Wang Z, Lu X (2015) Surveillance Video Synopsis via Scaling Down Objects. IEEE Trans Image Process Pub IEEE Signal Process Soc 25(2):740–755

    Article  MathSciNet  Google Scholar 

  22. Liao S, Li SZ (2015) Efficient PSD Constrained Asymmetric Metric Learning for Person Re-Identification, in IEEE International Conference on Computer Vision

  23. Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by Local Maximal Occurrence representation and metric learning, in IEEE Conference on Computer Vision and Pattern Recognition, pp. 2197–2206

  24. Liu H, Feng J, Qi M, Jiang J, Yan S (2016) End-to-end comparative attention networks for person re-identification, arXiv preprint arXiv:1606.04404

  25. Lowe DG (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  26. Lu X, Yuan Y, Yan P (2014) Alternatively Constrained Dictionary Learning for Image Superresolution. IEEE Trans Cybern 44(3):366–377

    Article  Google Scholar 

  27. Lu X, Zheng X, Li X (2016) Latent Semantic Minimal Hashing for Image Retrieval, IEEE Trans Image Process, vol. PP, no. 99, pp. 1–1

  28. Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of Tricks and a Strong Baseline for Deep Person Re-Identification, in computer vision and pattern recognition, 0–0

  29. Ma B, Su Y, Jurie F (2014) Covariance Descriptor based on Bio-inspired Features for Person re-Identification and Face Verification. Image and Vision Comput 32(6):379–390

    Article  Google Scholar 

  30. Moghaddam B, Jebara T, Pentland A (2000) Bayesian face recognition. Pattern Recognition 33(11):1771–1782

    Article  Google Scholar 

  31. Nguyen T-B, Le T-L, Nguyen D-D, Pham D-T (2018) A Reliable Image-to-Video Person Re-identification Based on Feature Fusion, in Intelligent Information and Database Systems, Cham, 433–442: Springer International Publishing

  32. Ojala T (2002) Multiresolution Gray-Scale and Rotation Invariant Texure. Classification with Local Binary Patterns 24(7):971–987

    Google Scholar 

  33. Paisitkriangkrai S, Shen C, Hengel AVD (2015) Learning to rank in person re-identification with metric ensembles, in Comput Vis Pattern Recogn, 1846–1855

  34. Ristani E, Tomasi C (2018) Features for Multi-target Multi-camera Tracking and Re-identification, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6036–6046

  35. Saquib Sarfraz M, Schumann A, Eberle A, Stiefelhagen R (2017) A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking, arXiv e-prints, Accessed on: November 01, Available: https://ui.adsabs.harvard.edu/abs/2017arXiv171110378S

  36. Shet V, Khamis S, Kuo CH (2013) Person re-identification using semantic color names and RankBoost," in IEEE Workshop on Applications of Computer Vision

  37. Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline), in european conference on computer vision, 501–518

  38. Tao D, Guo Y, Yu B, Pang J, Yu Z (2017) Deep Multi-View Feature Learning for Person Re-Identification, IEEE Trans Circuits Syst Video Technol

  39. Wang X, Zheng WS, Li X, Zhang J (2016) Cross-scenario Transfer Person Re-identification. IEEE Transactions on Circuits & Systems for Video Technology 26(8):1447–1460

    Article  Google Scholar 

  40. Wang G, Lai J, Huang P, Xie X (2019) Spatial-Temporal Person Re-identification, in national conference on artificial intelligence, 33, 01, 8933–8940

  41. Wu S, Chen Y-C, Li X, Wu A-C, You J-J, Zheng W-S (2016) An enhanced deep feature representation for person re-identification," in Applications of Computer Vision (WACV), 2016 IEEE Winter Conference on, 1–8: IEEE

  42. Wu L, Shen C, Hengel AVD (2016) Personnet: Person re-identification with deep convolutional neural networks," arXiv preprint arXiv:1601.07255

  43. Yan Y, Zhang Q, Ni B, Zhang W, Xu M, Yang X (2019) Learning Context Graph for Person Search, in computer vision and pattern recognition, pp. 2158–2167

  44. Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification, in Pattern Recognition (ICPR), 2014 22nd International Conference on, 34–39: IEEE

  45. Yi D, Lei Z, Li SZ (2014) Deep Metric Learning for Practical Person Re-Identification, arXiv: Computer Vision and Pattern Recognition

  46. Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-Learned Part-Aligned Representations for Person Re-identification, in international conference on computer vision, 3239–3248

  47. Zhao H et al. (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1077–1085

  48. Zheng L, Wang S, Tian L, He F, Liu Z, Tian Q (2015) Query-adaptive late fusion for image search and person re-identification, in Proceedings of the IEEE conference on computer vision and pattern recognition, 1741–1750

  49. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark, in Proceedings of the IEEE International Conference on Computer Vision, 1116–1124

  50. Zheng L, Zhang H, Sun S, Chandraker M, Yang Y, Tian Q (2017) Person Re-identification in the Wild, in CVPR, vol. 1, p. 2

  51. Zheng L, Huang Y, Lu H, Yang Y (2017) Pose invariant embedding for deep person re-identification, arXiv preprint arXiv:1701.07732

  52. Zhong Z, Zheng L, Kang G, Li S, Yang Y (2017) Random Erasing Data Augmentation, arXiv: Computer Vision and Pattern Recognition

  53. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking Person Re-identification with k-reciprocal Encoding, arXiv e-prints, Accessed on: January 01 Available: https://ui.adsabs.harvard.edu/abs/2017arXiv170108398Z

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grant 61671376 and Grant 61771386 and in part by the Scientific Research Project of Shaanxi Provincial Department of Education under Grant 18JK0556.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kaiyang Liao.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Deng, X., Liao, K., Zheng, Y. et al. A deep multi-feature distance metric learning method for pedestrian re-identification. Multimed Tools Appl 80, 23113–23131 (2021). https://doi.org/10.1007/s11042-020-10458-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-10458-8

Keywords

Navigation