A divide-and-unite deep network for person re-identification

Li, Rui; Zhang, Baopeng; Teng, Zhu; Fan, Jianping

doi:10.1007/s10489-020-01880-4

A divide-and-unite deep network for person re-identification

Published: 28 September 2020

Volume 51, pages 1479–1491, (2021)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Rui Li¹,
Baopeng Zhang¹,
Zhu Teng¹ &
…
Jianping Fan²

622 Accesses
18 Citations
Explore all metrics

Abstract

Person re-identification (person re-ID) is one of the most challenging tasks in the field of computer vision as it involves large variations in human appearances, human poses, background illuminations, camera views, etc. In recent literature, using part-level features for the person re-ID task provides fine-grained information, and has been proven to be effective. Instead of relying on additional skeleton key points or pose estimation models, this paper proposes a Divide-and-Unite Network to obtain feature embedding end-to-end. We design a deep network guided by image contents, which divides pedestrians into parts and obtains the part features with different contributions. These part features and the global feature are united to obtain the pedestrian descriptor for person re-ID. To summarize, the contributions of this work are two-fold. Firstly, a novel architecture of discriminative descriptor learning is proposed, which is based on the global feature and supplemented by part features. Secondly, a Feature Division Network is constructed to generate the part features with different contributions, where the divided parts maintain the consistency of content between different images. Extensive experiments are conducted on three widely-used benchmarks including Market1501, CUHK03, and DukeMTMC-reID. The results have demonstrated that the proposed model can achieve remarkable performance against numerous state-of-the-arts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

SSD: Single Shot MultiBox Detector

YOLO-based Object Detection Models: A Review and its Applications

Article 14 March 2024

References

Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: IEEE Conference on computer vision and pattern recognition(CVPR), pp 1320–1329
Chen W, Chen X, Zhang J, Huang K (2017) A multi-task deep network for person re-identification. In: 31St AAAI conference on artificial intelligence, pp 3988–3994
Chen Y, Zhu X, Gong S (2017) Person re-identification by deep learning multi-scale representations. In: IEEE International conference on computer vision workshop
Deng J, Dong W, Socher R, Li JL, Li K, Li FF (2009) Imagenet: a large-scale hierarchical image database. In: IEEE Conference on computer vision and pattern recognition
Felzenszwalb PF, Mcallester DA, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. in cvpr. In: IEEE Conference on computer vision and pattern recognition
Gao P, Yuan R, Wang F, Xiao L, Fujita H, Zhang Y (2020) Siamese attentional keypoint network for high performance visual tracking. Knowledge-based systems 193
Gao P, Zhang Q, Wang F, Xiao L, Fujita H, Zhang Y (2020) Learning reinforced attentional representation for end-to-end visual tracking. Inform Sci 517:52–67
Article Google Scholar
Geng M, Wang Y, Xiang T, Tian Y (2016) Deep transfer learning for person reidentification. arXiv:1611.05244
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 770–778
Hermans A, Beyer L, Leibe B (2017) Defense of the triplet loss for person re-identification. arXiv:1703.07737
Hirzer M (2012) Large scale metric learning from equivalence constraints. In: IEEE Conference on computer vision and pattern recognition(CVPR), pp 2288–2295
Jose C, Fleuret F (2016) Scalable metric learning via weighted approximate rank component analysis. In: European conference on computer vision
Juengling K, Bodensteiner C, Arens M (2010) Person re-identification in multi-camera networks. In: Computer vision and pattern recognition workshops, pp 55–61
Karanam S, Gou M, Ziyan W, Rates-Borras A, Camps O, Radke RJ (2016) A systematic evaluation and benchmark for person re-identification: Features, metrics, and datasets. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp 1–1
Layne R, Hospedales TM, Gong S (2012) Person re-identification by attributes. In: BMVC
Li R, Zhang B, Kang D-J, Teng Z (2019) Deep attention network for person re-identification with multi-loss. Computers & Electrical Engineering 79:106455
Article Google Scholar
Li W, Zhu X, Gong S (2017) Person re-identification by deep joint learning of multi-loss classification. In: IJCAI International joint conference on artificial intelligence, pp 2194–2200
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: The IEEE conference on computer vision and pattern recognition (CVPR)
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 2197–2206
Lin W, Shen C, Van Den Hengel A (2016) Personnet: Person re-identification with deep convolutional neural networks. arXiv:1601.07255
Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2019) Improving person re-identification by attribute and identity learning. Pattern Recognition 95:151–161
Article Google Scholar
Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–506
Article MathSciNet Google Scholar
Martinel N, Das A, Micheloni C, Roy-Chowdhury AK (2016) Temporal model adaptation for person re-identification. In: European conference on computer vision
Matsukawa T, Suzuki E (2016) Person re-identification using cnn features learned from combination of attributes. In: 23Rd international conference on pattern recognition (ICPR), pp 2428–2433
Oreifej O, Mehran R, Shah M (2010) Human identity recognition in aerial images. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 709–716
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 815–823
Shen C, Qi G-J, Jiang R, Jin Z, Yong H, Chen Y, Hua X-S (2019) Sharp Attention Network via Adaptive Sampling for Person Re-Identification. IEEE Trans Circ Syst Vid Technol 29:3016–3027
Article Google Scholar
Chi S, Li J, Zhang S, Xing J, Gao W, Qi T (2017) Pose-driven deep convolutional model for person re-identification. In: IEEE International conference on computer vision (ICCV), pp 3980–3989, 10
Sun Y, Liang Z, Yi Y, Qi T, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: European conference on computer vision
Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. In: IEEE International conference on computer vision
Tong X, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Computer vision and pattern recognition(CVPR)
Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: Computer vision - ECCV 2016. 14th european conference., pp 791–808
Varior RR, Shuai B, Jiwen L, Dong X, Wang G (2016) A siamese long short-term memory architecture for human re-identification. In: Computer vision - ECCV 2016. 14th european conference, pp 135–153
Wang H, Gong S, Zhu X, Tao X (2016) Human-in-the-loop person re-identification. In: European conference on computer vision
Wang Z, Jiang J, Wu Y, Ye M, Bai X, Satoh S (2020) Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification. IEEE Trans Image process 29(1):2013– 2025
Article Google Scholar
Li W, Rui Z, Tong X, Wang XG (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: Computer vision and pattern recognition
Wei L, Zhang S, Yao H, Gao W, Qi T (2019) Glad: Global-local-alignment descriptor for pedestrian retrieval. IEEE Transactions on Multimedia 21(4):986–999
Article Google Scholar
Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10:207–244
MATH Google Scholar
Wen Y, Zhang K, Li Z, Yu Q (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision (ECCV)
Xiao Q, Luo H, Zhang C (2017) Margin sample mining loss: A deep learning based method for person re-identification. arXiv:1710.00478
Jing X, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. arXiv:1805.03344
Yang K, He Z, Zhou Z, Fan N (2020) Siamatt: Siamese attention network for visual tracking. Knowledge-based systems 203
Yang X, Wang M, Tao D (2018) Person re-identification with metric learning using privileged information. IEEE Trans Image Process PP(99):1–1
MathSciNet MATH Google Scholar
Yao H, Zhang S, Zhang Y, Li J, Qi T (2017) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process PP(99):1–1
Google Scholar
Yi D, Lei Z, Li SZ (2014) Deep metric learning for practical person re-identification. Computer Science, pp 34–39
Li Z, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 1239–1248
Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion. In: Computer vision and pattern recognition(CVPR), pp 907–915
Zhao L, Xi L, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: IEEE International conference on computer vision (ICCV), pp 3239–3248
Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 3586– 3593
Zhedong Z, Liang Z, Yi Y (2018) A discriminatively learned cnn embedding for person re-identification. Acm Transactions on Multimedia Computing Communications and Applications 14(1):13:1–13:20
Google Scholar
Zheng L, Huang Y, Huchuan L, Yi Y (2019) Pose-invariant embedding for deep person re-identification. IEEE Trans Image Process 28(9):4500–4509
Article MathSciNet Google Scholar
Zheng L, Shen L, Tian L, Wang S, Wang J, Qi T (2015) Scalable person re-identification: a benchmark. In: IEEE International conference on computer vision
Zheng Z, Zheng L, Yi Y (2017) Pedestrian alignment network for large-scale person re-identification. IEEE Transactions on Circuits and Systems for Video Technology
Zheng Z, Zheng L, Yi Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: IEEE International conference on computer vision
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: IEEE Conference on computer vision and pattern recognition
Zhong Z, Zheng L, Zheng Z, Li S, Yi Y (2018) Camera style adaptation for person re-identification. In: IEEE Conference on computer vision and pattern recognition

Download references

Acknowledgments

This work was supported by the Fundamental Research Funds for the Central Universities of China (2020YJS040) and the Natural Science Foundation of China (61972027). We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan X Pascal GPU used for this research.

Author information

Authors and Affiliations

School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
Rui Li, Baopeng Zhang & Zhu Teng
AI Lab, Lenovo Research, Beijing, China
Jianping Fan

Authors

Rui Li
View author publications
You can also search for this author in PubMed Google Scholar
Baopeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhu Teng
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Fan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhu Teng.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, R., Zhang, B., Teng, Z. et al. A divide-and-unite deep network for person re-identification. Appl Intell 51, 1479–1491 (2021). https://doi.org/10.1007/s10489-020-01880-4

Download citation

Published: 28 September 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s10489-020-01880-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A divide-and-unite deep network for person re-identification

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

SSD: Single Shot MultiBox Detector

YOLO-based Object Detection Models: A Review and its Applications

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A divide-and-unite deep network for person re-identification

Abstract

Access this article

Similar content being viewed by others

Object detection using YOLO: challenges, architectural successors, datasets and applications

SSD: Single Shot MultiBox Detector

YOLO-based Object Detection Models: A Review and its Applications

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation