Two-stage person re-identification scheme using cross-input neighborhood differences

Kim, Hyeonwoo; Kim, Hyungjoon; Ko, Bumyeon; Shim, Jonghwa; Hwang, Eenjun

doi:10.1007/s11227-021-03994-z

Two-stage person re-identification scheme using cross-input neighborhood differences

Published: 23 July 2021

Volume 78, pages 3356–3373, (2022)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

Hyeonwoo Kim¹,
Hyungjoon Kim¹,
Bumyeon Ko¹,
Jonghwa Shim¹ &
…
Eenjun Hwang ORCID: orcid.org/0000-0002-0418-4092¹

445 Accesses
2 Citations
Explore all metrics

Abstract

Person re-identification aims to identify images of a particular person captured from different cameras or the same camera under different conditions. Person re-identification is conducted using an identification model that classifies the identity of the selected person or a verification model that discriminates between positive and negative image pairs. To further improve the re-identification performance, various methods have combined identification loss with verification loss. However, because such methods compare identities using one-dimensional embedding features without spatial information, local relationships are not considered. Thus, in this paper, we propose a two-stage person re-identification scheme using feature extraction and feature comparison networks. The former generates feature maps with spatial information, and the latter calculates their neighborhood and global differences. We conducted extensive experiments using well-known person re-identification datasets, and the proposed model achieved rank-1 accuracies of 84% and 88.4% for CUHK03 and Market-1501, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

A survey: object detection methods from CNN to transformer

Article Open access 21 October 2022

References

Kim H, Park J, Kim H, Hwang E, Rho S (2019) Robust facial landmark extraction scheme using multiple convolutional neural networks. Multimed Tools Appl 78(3):3221–3238. https://doi.org/10.1007/s11042-018-6482-7
Article Google Scholar
Kim H, Kim H, Hwang E (2019) Real-time facial feature extraction scheme using cascaded networks. IEEE International Conference on Big Data and Smart Computing (BigComp), Kyoto, Japan, pp. 1–7. https://doi.org/10.1109/BIGCOMP.2019.8679316.
Kim HW, Kim HJ, Rho S, Hwang E (2020) Augmented EMTCNN: a fast and accurate facial landmark detection network. Appl Sci 7:2253. https://doi.org/10.3390/app10072253
Article Google Scholar
Nawaz H, Maqsood M, Afzal S, Aadil F, Mehmood I, Rho S (2020) A deep feature-based real-time system for Alzheimer disease stage detection. Multimed Tools Appl. https://doi.org/10.1007/s11042-020-09087-y
Article Google Scholar
Dang LM, Hassan SI, Suhyeon I, Kumar Sangaiah A, Mehmood I, Rho S, Moon H (2020) UAV based wilt detection system via convolutional neural networks. Sustain Comput: Info Sys 28:100250. https://doi.org/10.1016/j.suscom.2018.05.010
Article Google Scholar
Thomas G, Sampaul A, Robinson YH, Julie EG, Shanmuganathan V, Rho S, Nam Y (2021) intelligent prediction approach for diabetic retinopathy using deep learning based convolutional neural networks algorithm by means of retina photographs. CMC-Comput Mater Continua 66(2):1613–1629
Article Google Scholar
Ahmed E, Jones M, and Marks TK (2015) An improved deep learning architecture for person re-identification. No 2015 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3908–3916. https://doi.org/10.1109/CVPR.2015.7299016.
Chen H et al (2018) Deep transfer learning for person re-identification. IEEE International Conference on Multimedia Big Data (BigMM), Xi’an, pp. 1–5. 101109/BigMM20188499067
Quan R, Dong X, Wu Y, Zhu L, Yang Y (2019) Auto-ReID: Searching for a part-aware ConvNet for person re-identification. 2019 IEEE International Conference on Computer Vision (ICCV), pp. 3750–3759. https://doi.org/10.1109/ICCV.2019.00385.
Bai X, Yang M, Huang T, Dou Z, Yu R, Xu Y (2020) Deep-person: Learning discriminative deep features for person re-identification. Pattern Recogn 98:107036. https://doi.org/10.1016/j.patcog.2019.107036
Article Google Scholar
Zheng L, Zhang H, Sun S, Chandraker M, Yang Y, Tian Q (2017) Person re-identification in the wild. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1367–1376. https://doi.org/10.1109/CVPR.2017.357.
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159. https://doi.org/10.1109/CVPR.2014.27.
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In Proceedings of the IEEE International Conference on Computer Vision, pp. 1116–1124. https://doi.org/10.1109/ICCV.2015.133.
Liu W, Wen Y, Yu Z, Yang M (2016) Large-margin softmax loss for convolutional neural networks. ICML 2(3):7
Google Scholar
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput, Commun, Appl 14(1):1–20. https://doi.org/10.1145/3159171
Article Google Scholar
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:170307737.
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: A unified embedding for face recognition and clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 815–823. https://doi.org/10.1109/CVPR.2015.7298682.
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision, pp. 3219–3228. https://doi.org/10.1109/ICCV.2017.349.
Zhang X, Luo H, Fan X, Xiang W, Sun Y, Xiao Q, Sun J (2017) Alignedreid: Surpassing human-level performance in person re-identification. arXiv preprint arXiv:171108184.
Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Trans Circuits Syst Video Technol 29(10):3037–3045. https://doi.org/10.1109/TCSVT.2018.2873599
Article Google Scholar
Tian M, Yi S, Li H, Li S, Zhang X, Shi J, Wang X (2018) Eliminating background-bias for robust person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5794–5803. https://doi.org/10.1109/CVPR.2018.00607.
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez A N, Polosukhin I (2017) Attention is All You Need. In Proceedings of 30^st Neural Information Processing Systems, pp. 5998–6008.
Woo S, Park J, Lee J Y, Kweon I S (2018) Cbam: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19.
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2119–2128. https://doi.org/10.1109/CVPR.2018.00226.
Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 371–381. https://doi.org/10.1109/ICCV.2019.00046.
Liu J, Ni B, Yan Y, Zhou P, Cheng S, Hu J (2018) Pose transferrable person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4099–4108. https://doi.org/10.1109/CVPR.2018.00431.
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778. https://doi.org/10.1109/CVPR.2016.90.
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Rabinovich A (2015) Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9. https://doi.org/10.1109/CVPR.2015.7298594.
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818–2826. https://doi.org/10.1109/CVPR.2016.308.
Szegedy C, Ioffe S, Vanhoucke V, Alemi A (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In Proceedings of the AAAI Conference on Artificial Intelligence 31(1). https://ojs.aaai.org/index.php/AAAI/article/view/11231.
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:14091556.
Deng J, Dong W, Socher R, Li L J, Li K, Fei-Fei L (2009) Imagenet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255. https://doi.org/10.1109/CVPR.2009.5206848.
Nair V, Hinton G E (2010) Rectified linear units improve restricted Boltzmann machines. In ICML
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2009) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645. https://doi.org/10.1109/TPAMI.2009.167
Article Google Scholar
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2285–2294. https://doi.org/10.1109/CVPR.2018.00243.
Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In Proceedings of the 26th ACM International Conference on Multimedia, pp. 274–282. https://doi.org/10.1145/3240508.3240552.
Wang C, Zhang Q, Huang C, Liu W, Wang X (2018) Mancs: A multi-task attentional network with curriculum sampling for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), pp. 365–381. https://doi.org/10.1007/978-3-030-01225-0_23.
Yang W, Huang H, Zhang Z, Chen X, Huang K, Zhang S (2019) Towards rich feature discovery with class activation maps augmentation for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1389–1398. https://doi.org/10.1109/CVPR.2019.00148.
Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of tricks and a strong baseline for deep person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 0–0. https://doi.org/10.1109/CVPRW.2019.00190.
Zheng M, Karanam S, Wu Z, Radke R J (2019) Re-identification with consistent attentive Siamese networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5735–5744. https://doi.org/10.1109/CVPR.2019.00588.
Sun Y, Zheng L, Deng W, Wang S (2017) SVDNet for pedestrian retrieval. In Proceedings of the IEEE International Conference on Computer Vision, pp. 3800–3808. https://doi.org/10.1109/ICCV.2017.410.
Bai S, Bai X, Tian Q (2017) Scalable person re-identification on supervised smoothed manifold. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2530–2539. https://doi.org/10.1109/CVPR.2017.358.
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by GAN improve the person re-identification baseline in vitro. In Proceedings of the IEEE International Conference on Computer Vision, pp. 3754–3762. https://doi.org/10.1109/ICCV.2017.405.

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2021R1A4A1031864).

Author information

Authors and Affiliations

School of Electrical Engineering, Korea University, Seoul, Republic of Korea
Hyeonwoo Kim, Hyungjoon Kim, Bumyeon Ko, Jonghwa Shim & Eenjun Hwang

Authors

Hyeonwoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Hyungjoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Bumyeon Ko
View author publications
You can also search for this author in PubMed Google Scholar
Jonghwa Shim
View author publications
You can also search for this author in PubMed Google Scholar
Eenjun Hwang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eenjun Hwang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This paper is an extended version of our paper published in the Proceedings of the 2020 International Conference on Artificial Intelligence (ICAI), Las Vegas, USA, 27–30 July 2020.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, H., Kim, H., Ko, B. et al. Two-stage person re-identification scheme using cross-input neighborhood differences. J Supercomput 78, 3356–3373 (2022). https://doi.org/10.1007/s11227-021-03994-z

Download citation

Accepted: 12 July 2021
Published: 23 July 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s11227-021-03994-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Two-stage person re-identification scheme using cross-input neighborhood differences

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

A survey: object detection methods from CNN to transformer

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Two-stage person re-identification scheme using cross-input neighborhood differences

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

Microsoft COCO: Common Objects in Context

A survey: object detection methods from CNN to transformer

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation