Skip to main content

Advertisement

A divide-and-unite deep network for person re-identification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Person re-identification (person re-ID) is one of the most challenging tasks in the field of computer vision as it involves large variations in human appearances, human poses, background illuminations, camera views, etc. In recent literature, using part-level features for the person re-ID task provides fine-grained information, and has been proven to be effective. Instead of relying on additional skeleton key points or pose estimation models, this paper proposes a Divide-and-Unite Network to obtain feature embedding end-to-end. We design a deep network guided by image contents, which divides pedestrians into parts and obtains the part features with different contributions. These part features and the global feature are united to obtain the pedestrian descriptor for person re-ID. To summarize, the contributions of this work are two-fold. Firstly, a novel architecture of discriminative descriptor learning is proposed, which is based on the global feature and supplemented by part features. Secondly, a Feature Division Network is constructed to generate the part features with different contributions, where the divided parts maintain the consistency of content between different images. Extensive experiments are conducted on three widely-used benchmarks including Market1501, CUHK03, and DukeMTMC-reID. The results have demonstrated that the proposed model can achieve remarkable performance against numerous state-of-the-arts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: IEEE Conference on computer vision and pattern recognition(CVPR), pp 1320–1329

  2. Chen W, Chen X, Zhang J, Huang K (2017) A multi-task deep network for person re-identification. In: 31St AAAI conference on artificial intelligence, pp 3988–3994

  3. Chen Y, Zhu X, Gong S (2017) Person re-identification by deep learning multi-scale representations. In: IEEE International conference on computer vision workshop

  4. Deng J, Dong W, Socher R, Li JL, Li K, Li FF (2009) Imagenet: a large-scale hierarchical image database. In: IEEE Conference on computer vision and pattern recognition

  5. Felzenszwalb PF, Mcallester DA, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. in cvpr. In: IEEE Conference on computer vision and pattern recognition

  6. Gao P, Yuan R, Wang F, Xiao L, Fujita H, Zhang Y (2020) Siamese attentional keypoint network for high performance visual tracking. Knowledge-based systems 193

  7. Gao P, Zhang Q, Wang F, Xiao L, Fujita H, Zhang Y (2020) Learning reinforced attentional representation for end-to-end visual tracking. Inform Sci 517:52–67

    Article  Google Scholar 

  8. Geng M, Wang Y, Xiang T, Tian Y (2016) Deep transfer learning for person reidentification. arXiv:1611.05244

  9. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 770–778

  10. Hermans A, Beyer L, Leibe B (2017) Defense of the triplet loss for person re-identification. arXiv:1703.07737

  11. Hirzer M (2012) Large scale metric learning from equivalence constraints. In: IEEE Conference on computer vision and pattern recognition(CVPR), pp 2288–2295

  12. Jose C, Fleuret F (2016) Scalable metric learning via weighted approximate rank component analysis. In: European conference on computer vision

  13. Juengling K, Bodensteiner C, Arens M (2010) Person re-identification in multi-camera networks. In: Computer vision and pattern recognition workshops, pp 55–61

  14. Karanam S, Gou M, Ziyan W, Rates-Borras A, Camps O, Radke RJ (2016) A systematic evaluation and benchmark for person re-identification: Features, metrics, and datasets. IEEE Transactions on Pattern Analysis and Machine Intelligence, pp 1–1

  15. Layne R, Hospedales TM, Gong S (2012) Person re-identification by attributes. In: BMVC

  16. Li R, Zhang B, Kang D-J, Teng Z (2019) Deep attention network for person re-identification with multi-loss. Computers & Electrical Engineering 79:106455

    Article  Google Scholar 

  17. Li W, Zhu X, Gong S (2017) Person re-identification by deep joint learning of multi-loss classification. In: IJCAI International joint conference on artificial intelligence, pp 2194–2200

  18. Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: The IEEE conference on computer vision and pattern recognition (CVPR)

  19. Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 2197–2206

  20. Lin W, Shen C, Van Den Hengel A (2016) Personnet: Person re-identification with deep convolutional neural networks. arXiv:1601.07255

  21. Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2019) Improving person re-identification by attribute and identity learning. Pattern Recognition 95:151–161

    Article  Google Scholar 

  22. Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–506

    Article  MathSciNet  Google Scholar 

  23. Martinel N, Das A, Micheloni C, Roy-Chowdhury AK (2016) Temporal model adaptation for person re-identification. In: European conference on computer vision

  24. Matsukawa T, Suzuki E (2016) Person re-identification using cnn features learned from combination of attributes. In: 23Rd international conference on pattern recognition (ICPR), pp 2428–2433

  25. Oreifej O, Mehran R, Shah M (2010) Human identity recognition in aerial images. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 709–716

  26. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision

  27. Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 815–823

  28. Shen C, Qi G-J, Jiang R, Jin Z, Yong H, Chen Y, Hua X-S (2019) Sharp Attention Network via Adaptive Sampling for Person Re-Identification. IEEE Trans Circ Syst Vid Technol 29:3016–3027

    Article  Google Scholar 

  29. Chi S, Li J, Zhang S, Xing J, Gao W, Qi T (2017) Pose-driven deep convolutional model for person re-identification. In: IEEE International conference on computer vision (ICCV), pp 3980–3989, 10

  30. Sun Y, Liang Z, Yi Y, Qi T, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: European conference on computer vision

  31. Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. In: IEEE International conference on computer vision

  32. Tong X, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Computer vision and pattern recognition(CVPR)

  33. Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: Computer vision - ECCV 2016. 14th european conference., pp 791–808

  34. Varior RR, Shuai B, Jiwen L, Dong X, Wang G (2016) A siamese long short-term memory architecture for human re-identification. In: Computer vision - ECCV 2016. 14th european conference, pp 135–153

  35. Wang H, Gong S, Zhu X, Tao X (2016) Human-in-the-loop person re-identification. In: European conference on computer vision

  36. Wang Z, Jiang J, Wu Y, Ye M, Bai X, Satoh S (2020) Learning Sparse and Identity-Preserved Hidden Attributes for Person Re-Identification. IEEE Trans Image process 29(1):2013– 2025

    Article  Google Scholar 

  37. Li W, Rui Z, Tong X, Wang XG (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: Computer vision and pattern recognition

  38. Wei L, Zhang S, Yao H, Gao W, Qi T (2019) Glad: Global-local-alignment descriptor for pedestrian retrieval. IEEE Transactions on Multimedia 21(4):986–999

    Article  Google Scholar 

  39. Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10:207–244

    MATH  Google Scholar 

  40. Wen Y, Zhang K, Li Z, Yu Q (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision (ECCV)

  41. Xiao Q, Luo H, Zhang C (2017) Margin sample mining loss: A deep learning based method for person re-identification. arXiv:1710.00478

  42. Jing X, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. arXiv:1805.03344

  43. Yang K, He Z, Zhou Z, Fan N (2020) Siamatt: Siamese attention network for visual tracking. Knowledge-based systems 203

  44. Yang X, Wang M, Tao D (2018) Person re-identification with metric learning using privileged information. IEEE Trans Image Process PP(99):1–1

    MathSciNet  MATH  Google Scholar 

  45. Yao H, Zhang S, Zhang Y, Li J, Qi T (2017) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process PP(99):1–1

    Google Scholar 

  46. Yi D, Lei Z, Li SZ (2014) Deep metric learning for practical person re-identification. Computer Science, pp 34–39

  47. Li Z, Xiang T, Gong S (2016) Learning a discriminative null space for person re-identification. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp 1239–1248

  48. Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion. In: Computer vision and pattern recognition(CVPR), pp 907–915

  49. Zhao L, Xi L, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: IEEE International conference on computer vision (ICCV), pp 3239–3248

  50. Zhao R, Ouyang W, Wang X (2013) Unsupervised salience learning for person re-identification. In: IEEE Conference on computer vision and pattern recognition (CVPR), pp 3586– 3593

  51. Zhedong Z, Liang Z, Yi Y (2018) A discriminatively learned cnn embedding for person re-identification. Acm Transactions on Multimedia Computing Communications and Applications 14(1):13:1–13:20

    Google Scholar 

  52. Zheng L, Huang Y, Huchuan L, Yi Y (2019) Pose-invariant embedding for deep person re-identification. IEEE Trans Image Process 28(9):4500–4509

    Article  MathSciNet  Google Scholar 

  53. Zheng L, Shen L, Tian L, Wang S, Wang J, Qi T (2015) Scalable person re-identification: a benchmark. In: IEEE International conference on computer vision

  54. Zheng Z, Zheng L, Yi Y (2017) Pedestrian alignment network for large-scale person re-identification. IEEE Transactions on Circuits and Systems for Video Technology

  55. Zheng Z, Zheng L, Yi Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: IEEE International conference on computer vision

  56. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: IEEE Conference on computer vision and pattern recognition

  57. Zhong Z, Zheng L, Zheng Z, Li S, Yi Y (2018) Camera style adaptation for person re-identification. In: IEEE Conference on computer vision and pattern recognition

Download references

Acknowledgments

This work was supported by the Fundamental Research Funds for the Central Universities of China (2020YJS040) and the Natural Science Foundation of China (61972027). We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan X Pascal GPU used for this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhu Teng.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, R., Zhang, B., Teng, Z. et al. A divide-and-unite deep network for person re-identification. Appl Intell 51, 1479–1491 (2021). https://doi.org/10.1007/s10489-020-01880-4

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-01880-4

Keywords