Deep pixel regeneration for occlusion reconstruction in person re-identification

Tagore, Nirbhay Kumar; Medi, Prathistith Raj; Chattopadhyay, Pratik

doi:10.1007/s11042-023-15322-z

Deep pixel regeneration for occlusion reconstruction in person re-identification

Published: 25 May 2023

Volume 83, pages 4443–4463, (2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Nirbhay Kumar Tagore¹,
Prathistith Raj Medi² &
Pratik Chattopadhyay³

197 Accesses
1 Citation
Explore all metrics

Abstract

Person re-identification is very important for monitoring and tracking crowd movement to provide public security. However, re-identification in the presence of occlusion is a challenging area that has not received significant attention yet. In this work, we propose a plausible solution to this problem by developing effective techniques for occlusion detection and reconstruction from RGB images/videos using Deep Neural Networks. Specifically, a CNN-based occlusion detection model is used to detect the occluded frames in an input sequence, following which a Conv-LSTM model or an Autoencoder is employed to reconstruct the pixels corresponding to the occluded regions depending on whether the input frames are sequential or non-sequential. The quality of the reconstructed RGB frames is further refined using a DCGAN. Our method has been evaluated using four public data sets for cumulative rank-based accuracy and Dice score, and the qualitative reconstruction results are indeed appealing. Quantitative evaluation in terms of re-identification accuracy using a Siamese classifier shows a Rank-1 accuracy of over 70% after reconstructing the occlusion present in each of these datasets. A comparative study with popular state-of-the-art approaches also demonstrates the effectiveness of our work for use in real-life surveillance sites.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A bi-network architecture for occlusion handling in Person re-identification

Article 15 November 2021

Random Occlusion Recovery with Noise Channel for Person Re-identification

Occlusion Reconstruction for Person Re-identification

References

Ahmed E, Jones M, Marks T K (2015) An improved deep learning architecture for person re-identification. In: Proc. of the Conf. on CVPR, pp 3908–3916
Chung D, Tahboub K, Delp E J (2017) A two stream siamese convolutional neural network for person re-identification. In: Proc. of the ICCV, pp 1983–1991
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: CVPR, 2005. CVPR 2005. IEEE Computer society conf. on, vol 1, pp 886–893
Dalvi C, Rathod M, Patil S, Gite S, Kotecha K (2021) A survey of ai-based facial emotion recognition: Features, ml & dl techniques, age-wise datasets and future directions. IEEE Access 9:165806–165840
Article Google Scholar
De Teyou G K (2020) ConvLSTM for spatio-temporal feature extraction in time-series images. In: Proc. of the NeurIPS: workshop on tackling climate change with machine learning. https://www.climatechange.ai/papers/neurips2020/12/paper.pdf
Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 48(10):2993–3003
Article Google Scholar
Fan X, Luo H, Zhang X, He L, Zhang C, Jiang W (2018) SCPNet: spatial-channel parallelism network for joint holistic and partial person re-identification. In: Proc. of the ACCV, pp 19–34
Fan D-P, Wang W, Cheng M-M, Shen J (2019) Shifting more attention to video salient object detection. In: Proc. of the Conf. on CVPR, pp 8554–8564
Fu J, Zheng H, Mei T (2017) Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: Proc. of the Conf. on CVPR, pp 4438–4446
Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person ReID. In: Proc. of the Conf. on CVPR, pp 11744–11752
Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X, Li H (2018) FD-GAN: pose-guided feature distilling GAN for robust person re-identification. arXiv:1810.02936
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proc. of the ECCV, pp 262–275
He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: Proc. of the Conf. on CVPR, pp 7073–7082
He L, Sun Z, Zhu Y, Wang Y (2018) Recognizing partial biometric patterns. arXiv:1810.07399
He L, Wang Y, Liu W, Zhao H, Sun Z, Feng J (2019) Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In: Proc. of the ICCV, pp 8450–8459
He L, Liu W (2020) Guided saliency feature learning for person re-identification in crowded scenes. In: Proc. of the ECCV, pp 357–373
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv:1703.07737
Hou R, Ma B, Chang H, Gu X, Shan S, Chen X (2019) VRSTC: occlusion-free video person re-identification. In: Proc. of the IEEE/CVF Conf. on CVPR, pp 7183–7192
Javed O, Shafique K, Shah M (2005) Appearance modeling for tracking in multiple non-overlapping cameras. In: Proc. of the Conf. on CVPR, vol 2, pp 26–33
Javed O, Shafique K, Rasheed Z, Shah M (2008) Modeling inter-camera space–time and appearance relationships for tracking across non-overlapping views. Comput Vis Image Underst 109(2):146–162
Article Google Scholar
Jiang K, Zhang T, Zhang Y, Wu F, Rui Y (2020) Self-supervised agent learning for unsupervised cross-domain person re-identification. IEEE Trans Image Process 29:8549–8560
Article Google Scholar
Kalayeh M M, Basaran E, Gökmen M, Kamasak M E, Shah M (2018) Human semantic parsing for person re-identification. In: Proc. of the Conf. on CVPR, pp 1062–1071
Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: deep filter pairing neural network for person re-identification. In: Proc. of the Conf. on CVPR, pp 152–159
Li X, Chen H, Qi X, Dou Q, Fu C-W, Heng P-A (2018) H-DenseUNet: hybrid densely connected Unet for liver and tumor segmentation from CT volumes. IEEE Trans Med Imaging 37(12):2663–2674
Article Google Scholar
Liao S, Hu Y, Zhu X, Li S Z (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proc. of the Conf. on CVPR, pp 2197–2206
Liu H, Jie Z, Jayashree K, Qi M, Jiang J, Yan S, Feng J (2017) Video-based person re-identification with accumulative motion context. IEEE Trans CSVT 28(10):2788–2802
Google Scholar
Liu Y, Yan J, Ouyang W (2017) Quality aware network for set to set recognition. In: Proc. of the Conf. on CVPR, pp 5790–5799
Liu J, Ni B, Yan Y, Zhou P, Cheng S, Hu J (2018) Pose transferrable person re-identification. In: Proc. of the Conf. on CVPR, pp 4099–4108
Liu Y-C, Tan D S, Chen J-C, Cheng W-H, Hua K-L (2019) Segmenting hepatic lesions using residual attention u-net with an adaptive weighted dice loss. In: Proc. of the ICIP, pp 3322–3326
Lu X, Wang W, Ma C, Shen J, Shao L, Porikli F (2019) See more, know more: unsupervised video object segmentation with co-attention siamese networks. In: Proc. of the Conf. on CVPR, pp 3623–3632
McLaughlin N, Martinez del Rincon J, Miller P (2016) Recurrent convolutional network for video-based person re-identification. In: Proc. of the Conf. on CVPR, pp 1325–1334
Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: Proc. of the ICCV, pp 542–551
Miao J, Wu Y, Yang Y (2021) Identifying visible parts via pose estimation for occluded person re-identification. IEEE Transactions on Neural Networks and Learning Systems
Minetto R, Segundo M P, Sarkar S (2019) Hydra: an ensemble of convolutional neural networks for geospatial land classification. IEEE Trans Geosci Remote Sens
Prosser B J, Gong S, Xiang T (2008) Multi-camera matching using bi-directional cumulative brightness transfer functions. In: Proc. of the BMVC, vol 8, 164. Citeseer, p 74
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:1511.06434
Sarfraz M S, Schumann A, Eberle A, Stiefelhagen R (2018) A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In: Proc. of the Conf. on CVPR, pp 420–429
Su C, Li J, Zhang S, Xing J, Gao W, Tian Q (2017) Pose-driven deep convolutional model for person re-identification. In: Proc. of the ICCV, pp 3960–3969
Subramaniam A, Chatterjee M, Mittal A (2016) Deep neural networks with inexact matching for person re-identification. In: Proc. of the advances in NIPS, pp 2667–2675
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proc. of the ECCV, pp 480–496
Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: learning visibility-aware part-level features for partial person re-identification. In: Proc. of the Conf. on CVPR, pp 393–402
Tagore N K, Chattopadhyay P (2020) SMSNet: a novel multi-scale siamese model for person re-identification. In: Proc. of the ICETE, pp 103–112
Tagore N K, Chattopadhyay P (2022) A bi-network architecture for occlusion handling in person re-identification. SIViP 16(4):1071–1079
Article Google Scholar
Tagore N K, Chattopadhyay P, Wang L (2020) T-MAN: a neural ensemble approach for person re-identification using spatio-temporal information. Multimed Tools Applic 79(37):28393–28409
Article Google Scholar
Tagore N K, Singh A, Manche S, Chattopadhyay P (2021) Person re-identification from appearance cues and deep siamese features. J Vis Commun Image Represent 75:103029
Article Google Scholar
Vulli A, Srinivasu P N, Sashank M S K, Shafi J, Choi J, Ijaz M F (2022) Fine-tuned densenet-169 for breast cancer metastasis prediction using fastai and 1-cycle policy. Sensors 22(8):2988
Article Google Scholar
Wang X, Doretto G, Sebastian T, Rittscher J, Tu P (2007) Shape and appearance context modeling. In: Proc. of the ICCV, pp 1–8
Wang G, Yang S, Liu H, Wang Z, Yang Y, Wang S, Yu G, Zhou E, Sun J (2020) High-order information matters: learning relation and topology for occluded person re-identification. In: Proc. of the IEEE/CVF Conf. on CVPR, pp 6449–6458
Wang Z, Zhu F, Tang S, Zhao R, He L, Song J (2022) Feature erasing and diffusion network for occluded person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4754–4763
Xingjian SHI, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W- (2015) Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Proc. of the Advances in NIPS, pp 802–810
Xiong F, Gou M, Camps O, Sznaier M (2014) Person re-identification using kernel-based metric learning methods. In: Proc. of the ECCV, pp 1–16
Xu S, Cheng Y, Gu K, Yang Y, Chang S, Zhou P (2017) Jointly attentive spatial-temporal pooling networks for video-based person re-identification. In: Proc. of the ICCV, pp 4733–4742
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proc. of the Conf. on CVPR, pp 2119–2128
Xu F, Ma B, Chang H, Shan S (2021) PRDP: person reidentification with dirty and poor data. IEEE Transactions on Cybernetics
Yan Y, Ni B, Song Z, Ma C, Yan Y, Yang X (2016) Person re-identification via recurrent feature aggregation. In: Proc. of the ECCV, pp 701–716
Yan C, Pang G, Jiao J, Bai X, Feng X, Shen C (2021) Occluded person re-identification with single-scale global representations. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 11875–11884
Yang Y, Yang J, Yan J, Liao S, Yi D, Li S Z (2014) Salient color names for person re-identification. In: Proc. of the ECCV, pp 536–551
Ye M, Li J, Ma A J, Zheng L, Yuen P C (2019) Dynamic graph co-matching for unsupervised video-based person re-identification. IEEE Trans Image Process 28(6):2976–2990
Article MathSciNet Google Scholar
Ye M, Lan X, Leng Q, Shen J (2020) Cross-modality person re-identification via modality-aware collaborative ensemble learning, vol 29
Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SCH (2021) Deep learning for person re-identification: a survey and outlook. IEEE Transactions on PAMI
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: Proc. of the ICCV, pp 3219–3228
Zheng W-S, Gong S, Xiang T (2009) Associating groups of people. In: Proc. of the BMVC, vol 2,6, pp 1–11
Zheng W-S, Gong S, Xiang T (2011) Person re-identification by probabilistic relative distance comparison. In: Proc. of the Conf. on CVPR, pp 649–656
Zheng W-S, Li X, Xiang T, Liao S, Lai J, Gong S (2015) Partial person re-identification. In: Proc. of the ICCV, pp 4678–4686
Zheng L, Yang Y, Hauptmann A G (2016) Person re-identification: past, present and future. arXiv:1610.02984
Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: Proc. of the ICCV, pp 5209–5217
Zhou Z, Huang Y, Wang W, Wang L, Tan T (2017) See the forest for the trees: joint spatial and temporal recurrent neural networks for video-based person re-identification. In: Proc. of the Conf. on CVPR, pp 4747–4756
Zhuo J, Chen Z, Lai J, Wang G (2018) Occluded person re-identification. In: Proc. of the ICME, pp 1–6
Zhuo J, Lai J, Chen P (2019) A novel teacher-student learning framework for occluded person re-identification. arXiv:1907.03253
Zhou S, Wu J, Zhang F, Sehdev P (2020) Depth occlusion perception feature analysis for person re-identification. Pattern Recogn Lett 138:617–623
Article Google Scholar
Zhou K, Yang Y, Cavallaro A, Xiang T (2021) Learning generalisable omni-scale representations for person re-identification. IEEE Transactions on PAMI

Download references

Acknowledgements

The authors would also like to thank SERB, DST for partially supporting this work through project grant (CRG/2020/005465)

Author information

Authors and Affiliations

School of Computer Science Engineering and Technology, Bennett University, Greater Noida, PIN 221310, India
Nirbhay Kumar Tagore
Department of Data Science and Artificial Intelligence, International Institute of Information Technology, Naya Raipur, Chhattisgarh, PIN 493661, India
Prathistith Raj Medi
Pattern Recognition Lab, Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, Uttar Pradesh, PIN 221005, India
Pratik Chattopadhyay

Authors

Nirbhay Kumar Tagore
View author publications
You can also search for this author inPubMed Google Scholar
Prathistith Raj Medi
View author publications
You can also search for this author inPubMed Google Scholar
Pratik Chattopadhyay
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Nirbhay Kumar Tagore.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Tagore, N.K., Medi, P.R. & Chattopadhyay, P. Deep pixel regeneration for occlusion reconstruction in person re-identification. Multimed Tools Appl 83, 4443–4463 (2024). https://doi.org/10.1007/s11042-023-15322-z

Download citation

Received: 20 June 2022
Revised: 22 February 2023
Accepted: 06 April 2023
Published: 25 May 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11042-023-15322-z

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep pixel regeneration for occlusion reconstruction in person re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A bi-network architecture for occlusion handling in Person re-identification

Random Occlusion Recovery with Noise Channel for Person Re-identification

Occlusion Reconstruction for Person Re-identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now