Skip to main content
Log in

Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification

  • Original article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

In person re-identification (re-id), the key to retrieving the correct person image is to extract discriminative features. The features at different levels are considered complementary. In this work, we design a person re-id learning network that can extract mutually multi-level features called ASCLNet. ASCLNet contains three feature branches, and each branch can extract mutually different levels of features. Furthermore, we propose two novel modules and apply them to learning local and attribute features in ASCLNet. One is the contextual local module, which can learn the local feature with context information from the local body part; the other is the attribute soft-sharing module, which enables shared feature representation among attributes. With the support of these two modules, ASCLNet can extract multi-level features that are more discriminative. Moreover, experimental results show that ASCLNet achieves excellent performances on Market-1501 and DukeMTMC-reID datasets with mAP of 88.85% and 80.18%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

The datasets used in this paper are public datasets.

References

  1. Bedagkar-Gala, A., Shah, S.K.: A survey of approaches and trends in person re-identification. Image Vis. Comput. 32(4), 270–286 (2014)

    Article  Google Scholar 

  2. Luo, H., Gu, Y.Z., Liao, X.Y., Lai, S.Q., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2019), pp. 1487–1495 (2019)

  3. Zhang, L., Jiang, N., Diao, Q., Zhou, Z., Wu, W.: Person re-identification with pose variation aware data augmentation. Neural Comput. Appl. 34(14), 11817–11830 (2022). https://doi.org/10.1007/s00521-022-07071-1

    Article  Google Scholar 

  4. Ye, M., Shen, J.B., Lin, G.J., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. 44(6), 2872–2893 (2022)

    Article  Google Scholar 

  5. Yang, J., Zhang, C., Tang, Y., Li, Z.: PAFM: pose-drive attention fusion mechanism for occluded person re-identification. Neural Comput. Appl. 34(10), 8241–8252 (2022). https://doi.org/10.1007/s00521-022-06903-4

    Article  Google Scholar 

  6. Wang, X., Zheng, S., Yang, R., Zheng, A., Chen, Z., Tang, J., Luo, B.: Pedestrian attribute recognition: a survey. Pattern Recogn. 121, 108220 (2022)

    Article  Google Scholar 

  7. Kostinger, M., Hirzer, M., Wohlhart, P., Roth, P.M., Bischof, H.: Large scale metric learning from equivalence constraints. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2288–2295 (2012)

  8. Liao, S.C., Hu, Y., Zhu, X., Li, S.Z.: Person re-identification by local maximal occurrence representation and metric learning. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2197–2206 (2015)

  9. Fan, X., Jiang, W., Luo, H., Fei, M.: Spherereld: deep hypersphere manifold embedding for person re-identification. J. Vis. Commun. Image Represent. 60, 51–58 (2019)

    Article  Google Scholar 

  10. Qian, X., Fu, Y., Jiang, Y.-G., Xiang, T., Xue, X.: Multi-scale deep learning architectures for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5399–5408 (2017)

  11. Wang, F.Q., Zuo, W.M., Lin, L., Zhang, D., Zhang, L.: Joint learning of single-image and cross-image representations for person re-identification. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1288–1296 (2016)

  12. Yang, F., Yan, K., Lu, S., Jia, H., Xie, X., Gao, W.: Attention driven person re-identification. Pattern Recogn. 86, 143–155 (2019)

    Article  Google Scholar 

  13. Xie, J., Ge, Y., Zhang, J., Huang, S., Chen, F., Wang, H.: Low-resolution assisted three-stream network for person re-identification. Vis. Comput. (2021). https://doi.org/10.1007/s00371-021-02127-0

    Article  Google Scholar 

  14. Ye, M., Shen, J.B., Lin, G.J., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. 44(6), 2872–2893 (2022)

    Article  Google Scholar 

  15. Zhao, H.Y., Tian, M.Q., Sun, S.Y., Shao, J., Yan, J.J., Yi, S., Wang, X.G., Tang, X.O.: Spindle net: Person re-identification with human body region guided feature decomposition and fusion. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 907–915 (2017)

  16. Zheng, L., Huang, Y.J., Lu, H.C., Yang, Y.: Pose-invariant embedding for deep person re-identification. IEEE Trans. Image Process. 28(9), 4500–4509 (2019)

    Article  MathSciNet  Google Scholar 

  17. Sun, Y.F., Zheng, L., Yang, Y., Tian, Q., Wang, S.J.: Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Computer Vision—ECCV 2018, Pt Iv 11208, pp. 501–518 (2018)

  18. Wang, G.S., Yuan, Y.F., Chen, X., Li, J.W., Zhou, X.: Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 2018 ACM Multimedia Conference (Mm’18), pp. 274–282 (2018)

  19. Wang, P., Wang, M., He, D.: Multi-scale feature pyramid and multi-branch neural network for person re-identification. Vis. Comput. 1–13 (2022)

  20. Su, C., Yang, F., Zhang, S.L., Tian, Q., Davis, L.S., Gao, W.: Multi-task learning with low rank attribute embedding for multi-camera person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. 40(5), 1167–1181 (2018)

    Article  Google Scholar 

  21. Lin, Y.T., Zheng, L., Zheng, Z.D., Wu, Y., Hu, Z.L., Yan, C.G., Yang, Y.: Improving person re-identification by attribute and identity learning. Pattern Recogn. 95, 151–161 (2019)

    Article  Google Scholar 

  22. Tay, C.-P., Roy, S., Yap, K.-H., Soc, I.C.: Aanet: Attribute attention network for person re-identifications. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Conference on Computer Vision and Pattern Recognition, pp. 7127–7136 (2019)

  23. Liu, J.W., Zha, Z.J., Xie, H.T., Xiong, Z.W., Zhang, Y.D.: Ca(3)net: Contextual-attentional attribute-appearance network for person re-identification. In: Proceedings of the 2018 ACM Multimedia Conference (Mm’18), pp. 737–745 (2018)

  24. Wang, X., Zheng, S.F., Yang, R., Zheng, A.H., Chen, Z., Tang, J., Luo, B.: Pedestrian attribute recognition: a survey. Pattern Recogn. 121 (2022)

  25. Zeng, H.T., Ai, H.Z., Zhuang, Z.J., Chen, L.: Multi-task learning via co-attentive sharing for pedestrian attribute recognition. In: 2020 IEEE International Conference on Multimedia and Expo (ICME) (2020)

  26. He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)

  27. Wang, Z.Q., Li, Z., Sun, J., Xu, Y.L.: Selective convolutional features based generalized-mean pooling for fine-grained image retrieval. In: 2018 IEEE International Conference on Visual Communications and Image Processing (IEEE VCIP) (2018)

  28. Woo, S.H., Park, J., Lee, J.Y., Kweon, I.S.: Cbam: Convolutional block attention module. In: Computer Vision—ECCV 2018, Pt Vii 11211, pp. 3–19 (2018)

  29. Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640–651 (2017)

    Article  Google Scholar 

  30. Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 2261–2269 (2017)

  31. Zheng, L., Shen, L.Y., Tian, L., Wang, S.J., Wang, J.D., Tian, Q.: Scalable person re-identification: a benchmark. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1116–1124 (2015)

  32. Zheng, Z.D., Zheng, L., Yang, Y.: Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 3774–3782 (2017)

  33. Li, W., Zhao, R., Xiao, T., Wang, X.: Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159 (2014)

  34. Wei, L.H., Zhang, S.L., Gao, W., Tian, Q.: Person transfer GAN to bridge domain gap for person re-identification. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 79–88 (2018)

  35. Arandjelovic, R., Zisserman, A.: Multiple queries for large scale specific object retrieval. In: BMVC, vol. 2, p. 6 (2012)

  36. Zhong, Z., Zheng, L., Cao, D.L., Li, S.Z.: Re-ranking person re-identification with k-reciprocal encoding. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 3652–3661 (2017)

  37. Li, W., Zhu, X.T., Gong, S.G.: Harmonious attention network for person re-identification. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2285–2294 (2018)

  38. Sun, Y.F., Xu, Q., Li, Y.L., Zhang, C., Li, Y.K., Wang, S.J., Sun, J.: Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 393–402 (2019)

  39. Zheng, M., Karanam, S., Wu, Z.Y., Radke, R.J.: Re-identification with consistent attentive siamese networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 5728–5737 (2019)

  40. Park, H., Ham, B.: Relation network for person re-identification. In: Thirty-Fourth AAAI Conference on Artificial Intelligence, the Thirty-Second Innovative Applications of Artificial Intelligence Conference and the Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, vol. 34, pp. 11839–11847 (2020)

  41. Li, H.J., Wu, G.J., Zheng, W.S.: Combined depth space based architecture search for person re-identification. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), pp. 6725–6734 (2021)

  42. Zhu, K., Guo, H., Liu, Z., Tang, M., Wang, J.: Identity-guided human semantic parsing for person re-identification. In: European Conference on Computer Vision, pp. 346–363. Springer (2020)

  43. Zheng, F., Deng, C., Sun, X., Jiang, X.Y., Guo, X.W., Yu, Z.Q., Huang, F.Y., Ji, R.R.: Pyramidal person re-identification via multi-loss dynamic training. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 8506–8514 (2019)

  44. Mansouri, N., Ammar, S., Kessentini, Y.: Re-ranking person re-identification using attributes learning. Neural Comput. Appl. 33(19), 12827–12843 (2021). https://doi.org/10.1007/s00521-021-05936-5

    Article  Google Scholar 

  45. Chikontwe, P., Lee, H.J.: Deep multi-task network for learning person identity and attributes. IEEE Access 6, 60801–60811 (2018)

    Article  Google Scholar 

  46. Wang, C., Zhang, Q., Huang, C., Liu, W.Y., Wang, X.G.: Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In: Computer Vision—ECCV 2018, Pt Iv 11208, pp. 384–400 (2018)

  47. Jin, H.Y., Lai, S.Q., Qian, X.M.: Occlusion-sensitive person re-identification via attribute-based shift attention. IEEE Trans. Circuits Syst. Video Technol. 32(4), 2170–2185 (2022)

    Article  Google Scholar 

  48. Tay, C.P., Yap, K.H.: Apnet: attribute parsing network for person re-identification. In: 2021 IEEE International Conference on Image Processing (ICIP), pp. 1144–1148 (2021)

  49. Wu, G., Zhu, X., Gong, S.: Learning hybrid ranking representation for person re-identification. Pattern Recogn. 121, 108239 (2022)

    Article  Google Scholar 

  50. Zhou, Y., Liu, P., Cui, Y., Liu, C., Duan, W.: Integration of multi-head self-attention and convolution for person re-identification. Sensors (2022). https://doi.org/10.3390/s22166293

  51. Xi, J., Huang, J., Zheng, S., Zhou, Q., Schiele, B., Hua, X.-S., Sun, Q.: Learning comprehensive global features in person re-identification: ensuring discriminativeness of more local regions. Pattern Recogn. 134, 109068 (2023)

    Article  Google Scholar 

  52. Pervaiz, N., Fraz, M.M., Shahzad, M.: Per-former: rethinking person re-identification using transformer augmented with self-attention and contextual mapping. Vis. Comput. (2022). https://doi.org/10.1007/s00371-022-02577-0

  53. Wei, L.H., Zhang, S.L., Yao, H.T., Gao, W., Tian, Q.: Glad: global-local-alignment descriptor for scalable person re-identification. IEEE Trans. Multimed. 21(4), 986–999 (2019)

    Article  Google Scholar 

  54. Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., Chen, X., Soc, I.C.: Interaction-and-aggregation network for person re-identification. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Conference on Computer Vision and Pattern Recognition, pp. 9309–9318 (2019). https://doi.org/10.1109/cvpr.2019.00954

  55. Zhao, Q., Du, N., Ouyang, Z., Kang, N., Liu, Z., Wang, X., He, Q., Xu, Y., Ge, S., Song, J.: Part-level attention networks for cross-domain person re-identification. IET Image Proc. 15(14), 3599–3607 (2021). https://doi.org/10.1049/ipr2.12292

    Article  Google Scholar 

  56. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)

  57. Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)

  58. Pan, X., Luo, P., Shi, J., Tang, X.: Two at once: Enhancing learning and generalization capacities via ibn-net. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 464–479 (2018)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yanbing Chen.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethics approval

This paper strictly abides by the moral standards of this journal.

Consent to participate

All the authors of this paper have reviewed and agreed to contribute to your journal by consensus.

Consent for publication

Once this paper is hired, we agree to publish it in your journal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, W., Chen, Y., Wang, D. et al. Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification. Vis Comput 40, 2251–2264 (2024). https://doi.org/10.1007/s00371-023-02914-x

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-023-02914-x

Keywords

Navigation