Abstract
Consider the problem that handcrafted features are limited by not being directly applicable to practical problems. Additionally, the deep convolution feature is a high-dimensional feature, and if it is directly used to match the image, it will consume considerable time and memory. Moreover, features from higher levels may be contaminated by dramatic variations in the human pose or background clutter. This paper proposes a method based on deep multi-feature distance metric learning. First, each spatial position and channel are weighted after extracting the deep convolution feature from the last layer of the CNN, and the final aggregation result, that is, the feature of the image, is obtained by sum-pooling. Second, a new method to improve and integrate the convolution feature of the region is proposed. The convolution features are processed by the sliding frame technique, and the low-dimensional eigenvector with dimensions equal to the number of convolution layer channels is obtained. Third, a distance learning algorithm is proposed by cross-view quadratic discriminant analysis metric learning. Finally, the weighted fusion strategy is used to accomplish the collaboration between the handcrafted and deep convolution features. On the Market-1501 and VIPeR datasets, the experimental results show that the rank 1 values of the proposed method on three experimental datasets reach 90.02% and 68.74%, respectively. Under the new classification rules of the CHUK03 dataset, the rank 1 performance of the proposed method reaches 34.2%. The experimental results show that the accuracy of pedestrian re-identification after distance-weighted fusion is higher than that obtained by the separate feature distance metric.
Similar content being viewed by others
References
Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3908–3916
Bai S, Bai X, Tian Q (2017) Scalable Person Re-identification on Supervised Smoothed Manifold, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3356–3365
Bai S, Tang P, Torr PHS, Latecki LJ (2019) Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 40–749
Chen C-H, Chen J-C, Lin KW (2018) Viewpoint Invariant Person Re-identification with Pose and Weighted Local Features. In: Sieminski A, Kozierkiewicz A, Nunez M, Ha QT (eds) Modern Approaches for Intelligent Information and Database Systems. Springer International Publishing, Cham, pp 387–396
Cheng D, Gong Y, Shi W, Zhang S (2018) Person re-identification by the asymmetric triplet and identification loss function. Multimedia Tools and Applications 77(3):3533–3550 2018/02/01
Dai Z, Chen M, Gu X, Zhu S, Tan P (2018) Batch DropBlock Network for Person Re-identification and Beyond, arXiv: Computer Vision and Pattern Recognition
Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recognition 48(10):2993–3003
Engel C, Baumgartner P, Holzmann M, Nutzel JF (2010) Person Re-Identification by Support Vector Ranking, in British Machine Vision Conference
Fan X, Jiang W, Luo H, Fei M (2019) SphereReID: Deep hypersphere manifold embedding for person re-identification, J Vis Commun Image Represent, 60, 51–58, 2019/04/01/
Farenzena M, Bazzani L, Perina A, Murino V, Cristani M (2010) Person re-identification symmetry-driven accumulation of local features, in Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on, pp. 2360–2367: IEEE
Fu M, Sun S, Chen N, Wang D, Tong X (2019) Deep Fusion Feature Presentations for Nonaligned Person Re-Identification. IEEE Access 7:73253–73261
Gray D, Tao H (2008) Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features, 5302, 262–275
Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking, in Proc. IEEE International Workshop on Performance Evaluation for Tracking and Surveillance (PETS), 3, 5, 1–7: Citeseer
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, 770–778
Hermans A, Beyer L, Leibe B (2017) In Defense of the Triplet Loss for Person Re-Identification, arXiv: Computer Vision and Pattern Recognition
Hirzer M (2012) Large scale metric learning from equivalence constraints, in IEEE Conference on Computer Vision and Pattern Recognition 2288–2295
Johnson DM, Xiong C, Corso JJ (2016) Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies. IEEE Transactions on Knowledge & Data Engineering 28(4):1035–1046
Kalantidis Y, Mellina C, Osindero S (2016) Cross-Dimensional Weighting for Aggregated Deep Convolutional Features, European conference on computer vision, 685–701
Kan S, Cen Y, He Z, Zhang Z, Zhang L, Wang Y (2019) Supervised Deep Feature Embedding with Hand Crafted Feature, IEEE Trans Image Process, 1–1
Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: Deep Filter Pairing Neural Network for Person Re-identification, in 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Li X, Wang Z, Lu X (2015) Surveillance Video Synopsis via Scaling Down Objects. IEEE Trans Image Process Pub IEEE Signal Process Soc 25(2):740–755
Liao S, Li SZ (2015) Efficient PSD Constrained Asymmetric Metric Learning for Person Re-Identification, in IEEE International Conference on Computer Vision
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by Local Maximal Occurrence representation and metric learning, in IEEE Conference on Computer Vision and Pattern Recognition, pp. 2197–2206
Liu H, Feng J, Qi M, Jiang J, Yan S (2016) End-to-end comparative attention networks for person re-identification, arXiv preprint arXiv:1606.04404
Lowe DG (2004) Distinctive Image Features from Scale-Invariant Keypoints. Int J Comput Vis 60(2):91–110
Lu X, Yuan Y, Yan P (2014) Alternatively Constrained Dictionary Learning for Image Superresolution. IEEE Trans Cybern 44(3):366–377
Lu X, Zheng X, Li X (2016) Latent Semantic Minimal Hashing for Image Retrieval, IEEE Trans Image Process, vol. PP, no. 99, pp. 1–1
Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of Tricks and a Strong Baseline for Deep Person Re-Identification, in computer vision and pattern recognition, 0–0
Ma B, Su Y, Jurie F (2014) Covariance Descriptor based on Bio-inspired Features for Person re-Identification and Face Verification. Image and Vision Comput 32(6):379–390
Moghaddam B, Jebara T, Pentland A (2000) Bayesian face recognition. Pattern Recognition 33(11):1771–1782
Nguyen T-B, Le T-L, Nguyen D-D, Pham D-T (2018) A Reliable Image-to-Video Person Re-identification Based on Feature Fusion, in Intelligent Information and Database Systems, Cham, 433–442: Springer International Publishing
Ojala T (2002) Multiresolution Gray-Scale and Rotation Invariant Texure. Classification with Local Binary Patterns 24(7):971–987
Paisitkriangkrai S, Shen C, Hengel AVD (2015) Learning to rank in person re-identification with metric ensembles, in Comput Vis Pattern Recogn, 1846–1855
Ristani E, Tomasi C (2018) Features for Multi-target Multi-camera Tracking and Re-identification, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6036–6046
Saquib Sarfraz M, Schumann A, Eberle A, Stiefelhagen R (2017) A Pose-Sensitive Embedding for Person Re-Identification with Expanded Cross Neighborhood Re-Ranking, arXiv e-prints, Accessed on: November 01, Available: https://ui.adsabs.harvard.edu/abs/2017arXiv171110378S
Shet V, Khamis S, Kuo CH (2013) Person re-identification using semantic color names and RankBoost," in IEEE Workshop on Applications of Computer Vision
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond Part Models: Person Retrieval with Refined Part Pooling (and A Strong Convolutional Baseline), in european conference on computer vision, 501–518
Tao D, Guo Y, Yu B, Pang J, Yu Z (2017) Deep Multi-View Feature Learning for Person Re-Identification, IEEE Trans Circuits Syst Video Technol
Wang X, Zheng WS, Li X, Zhang J (2016) Cross-scenario Transfer Person Re-identification. IEEE Transactions on Circuits & Systems for Video Technology 26(8):1447–1460
Wang G, Lai J, Huang P, Xie X (2019) Spatial-Temporal Person Re-identification, in national conference on artificial intelligence, 33, 01, 8933–8940
Wu S, Chen Y-C, Li X, Wu A-C, You J-J, Zheng W-S (2016) An enhanced deep feature representation for person re-identification," in Applications of Computer Vision (WACV), 2016 IEEE Winter Conference on, 1–8: IEEE
Wu L, Shen C, Hengel AVD (2016) Personnet: Person re-identification with deep convolutional neural networks," arXiv preprint arXiv:1601.07255
Yan Y, Zhang Q, Ni B, Zhang W, Xu M, Yang X (2019) Learning Context Graph for Person Search, in computer vision and pattern recognition, pp. 2158–2167
Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification, in Pattern Recognition (ICPR), 2014 22nd International Conference on, 34–39: IEEE
Yi D, Lei Z, Li SZ (2014) Deep Metric Learning for Practical Person Re-Identification, arXiv: Computer Vision and Pattern Recognition
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-Learned Part-Aligned Representations for Person Re-identification, in international conference on computer vision, 3239–3248
Zhao H et al. (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1077–1085
Zheng L, Wang S, Tian L, He F, Liu Z, Tian Q (2015) Query-adaptive late fusion for image search and person re-identification, in Proceedings of the IEEE conference on computer vision and pattern recognition, 1741–1750
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark, in Proceedings of the IEEE International Conference on Computer Vision, 1116–1124
Zheng L, Zhang H, Sun S, Chandraker M, Yang Y, Tian Q (2017) Person Re-identification in the Wild, in CVPR, vol. 1, p. 2
Zheng L, Huang Y, Lu H, Yang Y (2017) Pose invariant embedding for deep person re-identification, arXiv preprint arXiv:1701.07732
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2017) Random Erasing Data Augmentation, arXiv: Computer Vision and Pattern Recognition
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking Person Re-identification with k-reciprocal Encoding, arXiv e-prints, Accessed on: January 01 Available: https://ui.adsabs.harvard.edu/abs/2017arXiv170108398Z
Acknowledgments
This work was supported in part by the National Natural Science Foundation of China under Grant 61671376 and Grant 61771386 and in part by the Scientific Research Project of Shaanxi Provincial Department of Education under Grant 18JK0556.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Deng, X., Liao, K., Zheng, Y. et al. A deep multi-feature distance metric learning method for pedestrian re-identification. Multimed Tools Appl 80, 23113–23131 (2021). https://doi.org/10.1007/s11042-020-10458-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10458-8