Abstract
In person re-identification (re-id), the key to retrieving the correct person image is to extract discriminative features. The features at different levels are considered complementary. In this work, we design a person re-id learning network that can extract mutually multi-level features called ASCLNet. ASCLNet contains three feature branches, and each branch can extract mutually different levels of features. Furthermore, we propose two novel modules and apply them to learning local and attribute features in ASCLNet. One is the contextual local module, which can learn the local feature with context information from the local body part; the other is the attribute soft-sharing module, which enables shared feature representation among attributes. With the support of these two modules, ASCLNet can extract multi-level features that are more discriminative. Moreover, experimental results show that ASCLNet achieves excellent performances on Market-1501 and DukeMTMC-reID datasets with mAP of 88.85% and 80.18%, respectively.
Similar content being viewed by others
Data availability
The datasets used in this paper are public datasets.
References
Bedagkar-Gala, A., Shah, S.K.: A survey of approaches and trends in person re-identification. Image Vis. Comput. 32(4), 270–286 (2014)
Luo, H., Gu, Y.Z., Liao, X.Y., Lai, S.Q., Jiang, W.: Bag of tricks and a strong baseline for deep person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2019), pp. 1487–1495 (2019)
Zhang, L., Jiang, N., Diao, Q., Zhou, Z., Wu, W.: Person re-identification with pose variation aware data augmentation. Neural Comput. Appl. 34(14), 11817–11830 (2022). https://doi.org/10.1007/s00521-022-07071-1
Ye, M., Shen, J.B., Lin, G.J., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. 44(6), 2872–2893 (2022)
Yang, J., Zhang, C., Tang, Y., Li, Z.: PAFM: pose-drive attention fusion mechanism for occluded person re-identification. Neural Comput. Appl. 34(10), 8241–8252 (2022). https://doi.org/10.1007/s00521-022-06903-4
Wang, X., Zheng, S., Yang, R., Zheng, A., Chen, Z., Tang, J., Luo, B.: Pedestrian attribute recognition: a survey. Pattern Recogn. 121, 108220 (2022)
Kostinger, M., Hirzer, M., Wohlhart, P., Roth, P.M., Bischof, H.: Large scale metric learning from equivalence constraints. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2288–2295 (2012)
Liao, S.C., Hu, Y., Zhu, X., Li, S.Z.: Person re-identification by local maximal occurrence representation and metric learning. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2197–2206 (2015)
Fan, X., Jiang, W., Luo, H., Fei, M.: Spherereld: deep hypersphere manifold embedding for person re-identification. J. Vis. Commun. Image Represent. 60, 51–58 (2019)
Qian, X., Fu, Y., Jiang, Y.-G., Xiang, T., Xue, X.: Multi-scale deep learning architectures for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 5399–5408 (2017)
Wang, F.Q., Zuo, W.M., Lin, L., Zhang, D., Zhang, L.: Joint learning of single-image and cross-image representations for person re-identification. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1288–1296 (2016)
Yang, F., Yan, K., Lu, S., Jia, H., Xie, X., Gao, W.: Attention driven person re-identification. Pattern Recogn. 86, 143–155 (2019)
Xie, J., Ge, Y., Zhang, J., Huang, S., Chen, F., Wang, H.: Low-resolution assisted three-stream network for person re-identification. Vis. Comput. (2021). https://doi.org/10.1007/s00371-021-02127-0
Ye, M., Shen, J.B., Lin, G.J., Xiang, T., Shao, L., Hoi, S.C.H.: Deep learning for person re-identification: a survey and outlook. IEEE Trans. Pattern Anal. Mach. Intell. 44(6), 2872–2893 (2022)
Zhao, H.Y., Tian, M.Q., Sun, S.Y., Shao, J., Yan, J.J., Yi, S., Wang, X.G., Tang, X.O.: Spindle net: Person re-identification with human body region guided feature decomposition and fusion. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 907–915 (2017)
Zheng, L., Huang, Y.J., Lu, H.C., Yang, Y.: Pose-invariant embedding for deep person re-identification. IEEE Trans. Image Process. 28(9), 4500–4509 (2019)
Sun, Y.F., Zheng, L., Yang, Y., Tian, Q., Wang, S.J.: Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Computer Vision—ECCV 2018, Pt Iv 11208, pp. 501–518 (2018)
Wang, G.S., Yuan, Y.F., Chen, X., Li, J.W., Zhou, X.: Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 2018 ACM Multimedia Conference (Mm’18), pp. 274–282 (2018)
Wang, P., Wang, M., He, D.: Multi-scale feature pyramid and multi-branch neural network for person re-identification. Vis. Comput. 1–13 (2022)
Su, C., Yang, F., Zhang, S.L., Tian, Q., Davis, L.S., Gao, W.: Multi-task learning with low rank attribute embedding for multi-camera person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. 40(5), 1167–1181 (2018)
Lin, Y.T., Zheng, L., Zheng, Z.D., Wu, Y., Hu, Z.L., Yan, C.G., Yang, Y.: Improving person re-identification by attribute and identity learning. Pattern Recogn. 95, 151–161 (2019)
Tay, C.-P., Roy, S., Yap, K.-H., Soc, I.C.: Aanet: Attribute attention network for person re-identifications. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Conference on Computer Vision and Pattern Recognition, pp. 7127–7136 (2019)
Liu, J.W., Zha, Z.J., Xie, H.T., Xiong, Z.W., Zhang, Y.D.: Ca(3)net: Contextual-attentional attribute-appearance network for person re-identification. In: Proceedings of the 2018 ACM Multimedia Conference (Mm’18), pp. 737–745 (2018)
Wang, X., Zheng, S.F., Yang, R., Zheng, A.H., Chen, Z., Tang, J., Luo, B.: Pedestrian attribute recognition: a survey. Pattern Recogn. 121 (2022)
Zeng, H.T., Ai, H.Z., Zhuang, Z.J., Chen, L.: Multi-task learning via co-attentive sharing for pedestrian attribute recognition. In: 2020 IEEE International Conference on Multimedia and Expo (ICME) (2020)
He, K.M., Zhang, X.Y., Ren, S.Q., Sun, J.: Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778 (2016)
Wang, Z.Q., Li, Z., Sun, J., Xu, Y.L.: Selective convolutional features based generalized-mean pooling for fine-grained image retrieval. In: 2018 IEEE International Conference on Visual Communications and Image Processing (IEEE VCIP) (2018)
Woo, S.H., Park, J., Lee, J.Y., Kweon, I.S.: Cbam: Convolutional block attention module. In: Computer Vision—ECCV 2018, Pt Vii 11211, pp. 3–19 (2018)
Shelhamer, E., Long, J., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640–651 (2017)
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 2261–2269 (2017)
Zheng, L., Shen, L.Y., Tian, L., Wang, S.J., Wang, J.D., Tian, Q.: Scalable person re-identification: a benchmark. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1116–1124 (2015)
Zheng, Z.D., Zheng, L., Yang, Y.: Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 3774–3782 (2017)
Li, W., Zhao, R., Xiao, T., Wang, X.: Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159 (2014)
Wei, L.H., Zhang, S.L., Gao, W., Tian, Q.: Person transfer GAN to bridge domain gap for person re-identification. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 79–88 (2018)
Arandjelovic, R., Zisserman, A.: Multiple queries for large scale specific object retrieval. In: BMVC, vol. 2, p. 6 (2012)
Zhong, Z., Zheng, L., Cao, D.L., Li, S.Z.: Re-ranking person re-identification with k-reciprocal encoding. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 3652–3661 (2017)
Li, W., Zhu, X.T., Gong, S.G.: Harmonious attention network for person re-identification. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2285–2294 (2018)
Sun, Y.F., Xu, Q., Li, Y.L., Zhang, C., Li, Y.K., Wang, S.J., Sun, J.: Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 393–402 (2019)
Zheng, M., Karanam, S., Wu, Z.Y., Radke, R.J.: Re-identification with consistent attentive siamese networks. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 5728–5737 (2019)
Park, H., Ham, B.: Relation network for person re-identification. In: Thirty-Fourth AAAI Conference on Artificial Intelligence, the Thirty-Second Innovative Applications of Artificial Intelligence Conference and the Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, vol. 34, pp. 11839–11847 (2020)
Li, H.J., Wu, G.J., Zheng, W.S.: Combined depth space based architecture search for person re-identification. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021), pp. 6725–6734 (2021)
Zhu, K., Guo, H., Liu, Z., Tang, M., Wang, J.: Identity-guided human semantic parsing for person re-identification. In: European Conference on Computer Vision, pp. 346–363. Springer (2020)
Zheng, F., Deng, C., Sun, X., Jiang, X.Y., Guo, X.W., Yu, Z.Q., Huang, F.Y., Ji, R.R.: Pyramidal person re-identification via multi-loss dynamic training. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 8506–8514 (2019)
Mansouri, N., Ammar, S., Kessentini, Y.: Re-ranking person re-identification using attributes learning. Neural Comput. Appl. 33(19), 12827–12843 (2021). https://doi.org/10.1007/s00521-021-05936-5
Chikontwe, P., Lee, H.J.: Deep multi-task network for learning person identity and attributes. IEEE Access 6, 60801–60811 (2018)
Wang, C., Zhang, Q., Huang, C., Liu, W.Y., Wang, X.G.: Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In: Computer Vision—ECCV 2018, Pt Iv 11208, pp. 384–400 (2018)
Jin, H.Y., Lai, S.Q., Qian, X.M.: Occlusion-sensitive person re-identification via attribute-based shift attention. IEEE Trans. Circuits Syst. Video Technol. 32(4), 2170–2185 (2022)
Tay, C.P., Yap, K.H.: Apnet: attribute parsing network for person re-identification. In: 2021 IEEE International Conference on Image Processing (ICIP), pp. 1144–1148 (2021)
Wu, G., Zhu, X., Gong, S.: Learning hybrid ranking representation for person re-identification. Pattern Recogn. 121, 108239 (2022)
Zhou, Y., Liu, P., Cui, Y., Liu, C., Duan, W.: Integration of multi-head self-attention and convolution for person re-identification. Sensors (2022). https://doi.org/10.3390/s22166293
Xi, J., Huang, J., Zheng, S., Zhou, Q., Schiele, B., Hua, X.-S., Sun, Q.: Learning comprehensive global features in person re-identification: ensuring discriminativeness of more local regions. Pattern Recogn. 134, 109068 (2023)
Pervaiz, N., Fraz, M.M., Shahzad, M.: Per-former: rethinking person re-identification using transformer augmented with self-attention and contextual mapping. Vis. Comput. (2022). https://doi.org/10.1007/s00371-022-02577-0
Wei, L.H., Zhang, S.L., Yao, H.T., Gao, W., Tian, Q.: Glad: global-local-alignment descriptor for scalable person re-identification. IEEE Trans. Multimed. 21(4), 986–999 (2019)
Hou, R., Ma, B., Chang, H., Gu, X., Shan, S., Chen, X., Soc, I.C.: Interaction-and-aggregation network for person re-identification. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Conference on Computer Vision and Pattern Recognition, pp. 9309–9318 (2019). https://doi.org/10.1109/cvpr.2019.00954
Zhao, Q., Du, N., Ouyang, Z., Kang, N., Liu, Z., Wang, X., He, Q., Xu, Y., Ge, S., Song, J.: Part-level attention networks for cross-domain person re-identification. IET Image Proc. 15(14), 3599–3607 (2021). https://doi.org/10.1049/ipr2.12292
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Wang, X., Girshick, R., Gupta, A., He, K.: Non-local neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803 (2018)
Pan, X., Luo, P., Shi, J., Tang, X.: Two at once: Enhancing learning and generalization capacities via ibn-net. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 464–479 (2018)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethics approval
This paper strictly abides by the moral standards of this journal.
Consent to participate
All the authors of this paper have reviewed and agreed to contribute to your journal by consensus.
Consent for publication
Once this paper is hired, we agree to publish it in your journal.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, W., Chen, Y., Wang, D. et al. Joint attribute soft-sharing and contextual local: a multi-level features learning network for person re-identification. Vis Comput 40, 2251–2264 (2024). https://doi.org/10.1007/s00371-023-02914-x
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-023-02914-x