Abstract
Person re-identification is very important for monitoring and tracking crowd movement to provide public security. However, re-identification in the presence of occlusion is a challenging area that has not received significant attention yet. In this work, we propose a plausible solution to this problem by developing effective techniques for occlusion detection and reconstruction from RGB images/videos using Deep Neural Networks. Specifically, a CNN-based occlusion detection model is used to detect the occluded frames in an input sequence, following which a Conv-LSTM model or an Autoencoder is employed to reconstruct the pixels corresponding to the occluded regions depending on whether the input frames are sequential or non-sequential. The quality of the reconstructed RGB frames is further refined using a DCGAN. Our method has been evaluated using four public data sets for cumulative rank-based accuracy and Dice score, and the qualitative reconstruction results are indeed appealing. Quantitative evaluation in terms of re-identification accuracy using a Siamese classifier shows a Rank-1 accuracy of over 70% after reconstructing the occlusion present in each of these datasets. A comparative study with popular state-of-the-art approaches also demonstrates the effectiveness of our work for use in real-life surveillance sites.









Similar content being viewed by others
References
Ahmed E, Jones M, Marks T K (2015) An improved deep learning architecture for person re-identification. In: Proc. of the Conf. on CVPR, pp 3908–3916
Chung D, Tahboub K, Delp E J (2017) A two stream siamese convolutional neural network for person re-identification. In: Proc. of the ICCV, pp 1983–1991
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: CVPR, 2005. CVPR 2005. IEEE Computer society conf. on, vol 1, pp 886–893
Dalvi C, Rathod M, Patil S, Gite S, Kotecha K (2021) A survey of ai-based facial emotion recognition: Features, ml & dl techniques, age-wise datasets and future directions. IEEE Access 9:165806–165840
De Teyou G K (2020) ConvLSTM for spatio-temporal feature extraction in time-series images. In: Proc. of the NeurIPS: workshop on tackling climate change with machine learning. https://www.climatechange.ai/papers/neurips2020/12/paper.pdf
Ding S, Lin L, Wang G, Chao H (2015) Deep feature learning with relative distance comparison for person re-identification. Pattern Recogn 48(10):2993–3003
Fan X, Luo H, Zhang X, He L, Zhang C, Jiang W (2018) SCPNet: spatial-channel parallelism network for joint holistic and partial person re-identification. In: Proc. of the ACCV, pp 19–34
Fan D-P, Wang W, Cheng M-M, Shen J (2019) Shifting more attention to video salient object detection. In: Proc. of the Conf. on CVPR, pp 8554–8564
Fu J, Zheng H, Mei T (2017) Look closer to see better: recurrent attention convolutional neural network for fine-grained image recognition. In: Proc. of the Conf. on CVPR, pp 4438–4446
Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person ReID. In: Proc. of the Conf. on CVPR, pp 11744–11752
Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X, Li H (2018) FD-GAN: pose-guided feature distilling GAN for robust person re-identification. arXiv:1810.02936
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proc. of the ECCV, pp 262–275
He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: Proc. of the Conf. on CVPR, pp 7073–7082
He L, Sun Z, Zhu Y, Wang Y (2018) Recognizing partial biometric patterns. arXiv:1810.07399
He L, Wang Y, Liu W, Zhao H, Sun Z, Feng J (2019) Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In: Proc. of the ICCV, pp 8450–8459
He L, Liu W (2020) Guided saliency feature learning for person re-identification in crowded scenes. In: Proc. of the ECCV, pp 357–373
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv:1703.07737
Hou R, Ma B, Chang H, Gu X, Shan S, Chen X (2019) VRSTC: occlusion-free video person re-identification. In: Proc. of the IEEE/CVF Conf. on CVPR, pp 7183–7192
Javed O, Shafique K, Shah M (2005) Appearance modeling for tracking in multiple non-overlapping cameras. In: Proc. of the Conf. on CVPR, vol 2, pp 26–33
Javed O, Shafique K, Rasheed Z, Shah M (2008) Modeling inter-camera space–time and appearance relationships for tracking across non-overlapping views. Comput Vis Image Underst 109(2):146–162
Jiang K, Zhang T, Zhang Y, Wu F, Rui Y (2020) Self-supervised agent learning for unsupervised cross-domain person re-identification. IEEE Trans Image Process 29:8549–8560
Kalayeh M M, Basaran E, Gökmen M, Kamasak M E, Shah M (2018) Human semantic parsing for person re-identification. In: Proc. of the Conf. on CVPR, pp 1062–1071
Li W, Zhao R, Xiao T, Wang X (2014) DeepReID: deep filter pairing neural network for person re-identification. In: Proc. of the Conf. on CVPR, pp 152–159
Li X, Chen H, Qi X, Dou Q, Fu C-W, Heng P-A (2018) H-DenseUNet: hybrid densely connected Unet for liver and tumor segmentation from CT volumes. IEEE Trans Med Imaging 37(12):2663–2674
Liao S, Hu Y, Zhu X, Li S Z (2015) Person re-identification by local maximal occurrence representation and metric learning. In: Proc. of the Conf. on CVPR, pp 2197–2206
Liu H, Jie Z, Jayashree K, Qi M, Jiang J, Yan S, Feng J (2017) Video-based person re-identification with accumulative motion context. IEEE Trans CSVT 28(10):2788–2802
Liu Y, Yan J, Ouyang W (2017) Quality aware network for set to set recognition. In: Proc. of the Conf. on CVPR, pp 5790–5799
Liu J, Ni B, Yan Y, Zhou P, Cheng S, Hu J (2018) Pose transferrable person re-identification. In: Proc. of the Conf. on CVPR, pp 4099–4108
Liu Y-C, Tan D S, Chen J-C, Cheng W-H, Hua K-L (2019) Segmenting hepatic lesions using residual attention u-net with an adaptive weighted dice loss. In: Proc. of the ICIP, pp 3322–3326
Lu X, Wang W, Ma C, Shen J, Shao L, Porikli F (2019) See more, know more: unsupervised video object segmentation with co-attention siamese networks. In: Proc. of the Conf. on CVPR, pp 3623–3632
McLaughlin N, Martinez del Rincon J, Miller P (2016) Recurrent convolutional network for video-based person re-identification. In: Proc. of the Conf. on CVPR, pp 1325–1334
Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: Proc. of the ICCV, pp 542–551
Miao J, Wu Y, Yang Y (2021) Identifying visible parts via pose estimation for occluded person re-identification. IEEE Transactions on Neural Networks and Learning Systems
Minetto R, Segundo M P, Sarkar S (2019) Hydra: an ensemble of convolutional neural networks for geospatial land classification. IEEE Trans Geosci Remote Sens
Prosser B J, Gong S, Xiang T (2008) Multi-camera matching using bi-directional cumulative brightness transfer functions. In: Proc. of the BMVC, vol 8, 164. Citeseer, p 74
Radford A, Metz L, Chintala S (2015) Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv:1511.06434
Sarfraz M S, Schumann A, Eberle A, Stiefelhagen R (2018) A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In: Proc. of the Conf. on CVPR, pp 420–429
Su C, Li J, Zhang S, Xing J, Gao W, Tian Q (2017) Pose-driven deep convolutional model for person re-identification. In: Proc. of the ICCV, pp 3960–3969
Subramaniam A, Chatterjee M, Mittal A (2016) Deep neural networks with inexact matching for person re-identification. In: Proc. of the advances in NIPS, pp 2667–2675
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proc. of the ECCV, pp 480–496
Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: learning visibility-aware part-level features for partial person re-identification. In: Proc. of the Conf. on CVPR, pp 393–402
Tagore N K, Chattopadhyay P (2020) SMSNet: a novel multi-scale siamese model for person re-identification. In: Proc. of the ICETE, pp 103–112
Tagore N K, Chattopadhyay P (2022) A bi-network architecture for occlusion handling in person re-identification. SIViP 16(4):1071–1079
Tagore N K, Chattopadhyay P, Wang L (2020) T-MAN: a neural ensemble approach for person re-identification using spatio-temporal information. Multimed Tools Applic 79(37):28393–28409
Tagore N K, Singh A, Manche S, Chattopadhyay P (2021) Person re-identification from appearance cues and deep siamese features. J Vis Commun Image Represent 75:103029
Vulli A, Srinivasu P N, Sashank M S K, Shafi J, Choi J, Ijaz M F (2022) Fine-tuned densenet-169 for breast cancer metastasis prediction using fastai and 1-cycle policy. Sensors 22(8):2988
Wang X, Doretto G, Sebastian T, Rittscher J, Tu P (2007) Shape and appearance context modeling. In: Proc. of the ICCV, pp 1–8
Wang G, Yang S, Liu H, Wang Z, Yang Y, Wang S, Yu G, Zhou E, Sun J (2020) High-order information matters: learning relation and topology for occluded person re-identification. In: Proc. of the IEEE/CVF Conf. on CVPR, pp 6449–6458
Wang Z, Zhu F, Tang S, Zhao R, He L, Song J (2022) Feature erasing and diffusion network for occluded person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 4754–4763
Xingjian SHI, Chen Z, Wang H, Yeung D-Y, Wong W-K, Woo W- (2015) Convolutional LSTM network: a machine learning approach for precipitation nowcasting. In: Proc. of the Advances in NIPS, pp 802–810
Xiong F, Gou M, Camps O, Sznaier M (2014) Person re-identification using kernel-based metric learning methods. In: Proc. of the ECCV, pp 1–16
Xu S, Cheng Y, Gu K, Yang Y, Chang S, Zhou P (2017) Jointly attentive spatial-temporal pooling networks for video-based person re-identification. In: Proc. of the ICCV, pp 4733–4742
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proc. of the Conf. on CVPR, pp 2119–2128
Xu F, Ma B, Chang H, Shan S (2021) PRDP: person reidentification with dirty and poor data. IEEE Transactions on Cybernetics
Yan Y, Ni B, Song Z, Ma C, Yan Y, Yang X (2016) Person re-identification via recurrent feature aggregation. In: Proc. of the ECCV, pp 701–716
Yan C, Pang G, Jiao J, Bai X, Feng X, Shen C (2021) Occluded person re-identification with single-scale global representations. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 11875–11884
Yang Y, Yang J, Yan J, Liao S, Yi D, Li S Z (2014) Salient color names for person re-identification. In: Proc. of the ECCV, pp 536–551
Ye M, Li J, Ma A J, Zheng L, Yuen P C (2019) Dynamic graph co-matching for unsupervised video-based person re-identification. IEEE Trans Image Process 28(6):2976–2990
Ye M, Lan X, Leng Q, Shen J (2020) Cross-modality person re-identification via modality-aware collaborative ensemble learning, vol 29
Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SCH (2021) Deep learning for person re-identification: a survey and outlook. IEEE Transactions on PAMI
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: Proc. of the ICCV, pp 3219–3228
Zheng W-S, Gong S, Xiang T (2009) Associating groups of people. In: Proc. of the BMVC, vol 2,6, pp 1–11
Zheng W-S, Gong S, Xiang T (2011) Person re-identification by probabilistic relative distance comparison. In: Proc. of the Conf. on CVPR, pp 649–656
Zheng W-S, Li X, Xiang T, Liao S, Lai J, Gong S (2015) Partial person re-identification. In: Proc. of the ICCV, pp 4678–4686
Zheng L, Yang Y, Hauptmann A G (2016) Person re-identification: past, present and future. arXiv:1610.02984
Zheng H, Fu J, Mei T, Luo J (2017) Learning multi-attention convolutional neural network for fine-grained image recognition. In: Proc. of the ICCV, pp 5209–5217
Zhou Z, Huang Y, Wang W, Wang L, Tan T (2017) See the forest for the trees: joint spatial and temporal recurrent neural networks for video-based person re-identification. In: Proc. of the Conf. on CVPR, pp 4747–4756
Zhuo J, Chen Z, Lai J, Wang G (2018) Occluded person re-identification. In: Proc. of the ICME, pp 1–6
Zhuo J, Lai J, Chen P (2019) A novel teacher-student learning framework for occluded person re-identification. arXiv:1907.03253
Zhou S, Wu J, Zhang F, Sehdev P (2020) Depth occlusion perception feature analysis for person re-identification. Pattern Recogn Lett 138:617–623
Zhou K, Yang Y, Cavallaro A, Xiang T (2021) Learning generalisable omni-scale representations for person re-identification. IEEE Transactions on PAMI
Acknowledgements
The authors would also like to thank SERB, DST for partially supporting this work through project grant (CRG/2020/005465)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Tagore, N.K., Medi, P.R. & Chattopadhyay, P. Deep pixel regeneration for occlusion reconstruction in person re-identification. Multimed Tools Appl 83, 4443–4463 (2024). https://doi.org/10.1007/s11042-023-15322-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15322-z