Fine-grained alignment network and local attention network for person re-identification

Zhou, Dongming; Zhang, Canlong; Tang, Yanping; Li, Zhixin

doi:10.1007/s11042-022-12638-0

Fine-grained alignment network and local attention network for person re-identification

Published: 21 May 2022

Volume 81, pages 43267–43281, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Dongming Zhou¹,
Canlong Zhang ORCID: orcid.org/0000-0003-4375-1405¹,
Yanping Tang² &
…
Zhixin Li¹

296 Accesses
Explore all metrics

Abstract

Due to the influence of person posture changes, light angle of view, background and other factors, person re-identification is a challenging task. To improve the identification accuracy, recent studies have divided the pedestrians in the dataset into several blocks to extract the local features of the image for re-identification. However, these methods have such problems as the mismatch of local features of the human body and the loss of contextual clues of non-human body parts. To solve the above problems, this paper proposes a partially aligned network that can be used for person re-identification, which uses accurate local features to increase the ability of human body semantic parsing to model arbitrary contours. On this basis, the local attention network captures contextual cues that are not part of the human body. In addition, by aligning the local features of human body semantic parsing, the robustness and mobility of the model can be effectively increased. The experimental results obtained with the three datasets, Market-1501, DukeMTMC and CUHK03, show the effectiveness of the proposed model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

Article 08 August 2022

Tausif Diwan, G. Anirudh & Jitendra V. Tembhurne

End-to-End Object Detection with Transformers

References

Bai S, Bai X, Tian Q (2017) Scalable person re-identification on supervised smoothed manifold. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2530–2539
Cakir F, He K, Xia X, Kulis B, Sclaroff S (2019) Deep metric learning to rank. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1861–1870
Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 371–381
Chen L-C, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation. arXiv:1706.05587
Chen W, Cai F, Chen H, de Rijke M (2020) Hierarchical neural query suggestion with an attention mechanism. Inf Process Manag 57(6):102040
Article Google Scholar
Chen X, Fu C, Zhao Y, Zheng F, Song J, Ji R, Yang Y (2020) Salience-guided cascaded suppression network for person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3300–3310
Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In: Proceedings of the iEEE conference on computer vision and pattern recognition, pp 1335–1344
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: A large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 248–255
Elnagar A, Al-Debsi R, Einea O (2020) Arabic text classification using deep learning models. Inf Process Manag 57(1):102121
Article Google Scholar
Fu J, Liu J, Tian H, Li Y, Bao Y, Fang Z, Lu H (2019) Dual attention network for scene segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3146–3154
Han K, Guo J, Zhang C, Zhu M (2018) Attribute-aware attention model for fine-grained representation learning. In: Proceedings of the 26th ACM international conference on multimedia, pp 2040–2048
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Laenen K, Moens M-F (2020) A comparative study of outfit recommendation methods with a focus on attention-based fusion. Inf Process Manag 57 (6):102316
Article Google Scholar
Li C, Bao Z, Li L, Zhao Z (2020) Exploring temporal representations by leveraging attention-based bidirectional lstm-rnns for multi-modal emotion recognition. Inf Process Manag 57(3):102185
Article Google Scholar
Li S, Yu H, Hu R (2020) Attributes-aided part detection and refinement for person re-identification. Pattern Recogn 97:107016
Article Google Scholar
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 152–159
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2285–2294
Liu H, Feng J, Qi M, Jiang J, Yan S (2017) End-to-end comparative attention networks for person re-identification. IEEE Trans Image Process 26(7):3492–3506
Article MathSciNet MATH Google Scholar
Liu P, Zhang L, Gulla J A (2020) Dynamic attention-based explainable recommendation with textual and visual fusion. Inf Process Manag 57 (6):102099
Article Google Scholar
Liu X, Zhao H, Tian M, Sheng L, Shao J, Yi S, Yan J, Wang X (2017) Hydraplus-net: Attentive deep features for pedestrian analysis. In: Proceedings of the IEEE international conference on computer vision, pp 350–359
Ristani E, Tomasi C (2018) Features for multi-target multi-camera tracking and re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6036–6046
Ruan T, Liu T, Huang Z, Wei Y, Wei S, Zhao Y (2019) Devil in the details: Towards accurate single and multiple human parsing. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 4814–4821
Saquib Sarfraz M, Schumann A, Eberle A, Stiefelhagen R (2018) A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 420–429
Shen Y, Li H, Xiao T, Yi S, Chen D, Wang X (2018) Deep group-shuffling random walk for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2265–2274
Su Y, Fan K, Bach N, Kuo C-C J, Huang F (2019) Unsupervised multi-modal neural machine translation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 10482–10491
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European Conference on Computer Vision (ECCV), pp 480–496
Tao Z, Wei Y, Wang X, He X, Huang X, Chua T-S (2020) Mgat: Multimodal graph attention network for recommendation. Inf Process Manag 57(5):102277
Article Google Scholar
Varior R R, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: European conference on computer vision. Springer, pp 791–808
Wang Q, Chan A B (2019) Describing like humans: on diversity in image captioning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4195–4203
Wang X, Girshick R, Gupta A, He K (2018) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7794–7803
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2119–2128
Yao H, Zhang S, Hong R, Zhang Y, Xu C, Tian Q (2019) Deep representation learning with part loss for person re-identification. IEEE Trans Image Process 28(6):2860–2871
Article MathSciNet MATH Google Scholar
Yuan Y, Su H, Liu J, Zeng G (2020) Locally and multiply distorted image quality assessment via multi-stage cnns. Inf Process Manag 57(4):102175
Article Google Scholar
Zhang Z, Lan C, Zeng W, Chen Z (2019) Densely semantically aligned person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 667–676
Zhang Z, Lan C, Zeng W, Jin X, Chen Z (2020) Relation-aware global attention for person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3186–3195
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Zheng M, Karanam S, Wu Z, Radke R J (2019) Re-identification with consistent attentive siamese networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5735–5744
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE International conference on computer vision, pp 3754–3762
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1318–1327

Download references

Funding

This work is supported by the National Natural Science Foundation of China (Nos.61866004, 61966004, 61962007), the Guangxi Natural Science Foundation (Nos.2018GXNSFDA281009, 2019GXNSFDA245018,2018GXNSFDA294001), Research Fund of Guangxi Key Lab of Multi-source Information Mining and Security (No.20-A-03-01), and Guangxi “Bagui Scholar” Teams for Innovation Research Project.

Author information

Authors and Affiliations

Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004, China
Dongming Zhou, Canlong Zhang & Zhixin Li
Guilin University of Electronic Technology, Guilin, 541004, China
Yanping Tang

Authors

Dongming Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Canlong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yanping Tang
View author publications
You can also search for this author in PubMed Google Scholar
Zhixin Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Canlong Zhang.

Additional information

Availability of Data and Material

We all make sure that all data and materials support our published claims and comply with field standards.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, D., Zhang, C., Tang, Y. et al. Fine-grained alignment network and local attention network for person re-identification. Multimed Tools Appl 81, 43267–43281 (2022). https://doi.org/10.1007/s11042-022-12638-0

Download citation

Received: 17 April 2021
Revised: 28 July 2021
Accepted: 09 February 2022
Published: 21 May 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s11042-022-12638-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fine-grained alignment network and local attention network for person re-identification

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

End-to-End Object Detection with Transformers

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Availability of Data and Material

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fine-grained alignment network and local attention network for person re-identification

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Object detection using YOLO: challenges, architectural successors, datasets and applications

End-to-End Object Detection with Transformers

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Availability of Data and Material

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation