Learning discriminative features for semi-supervised person re-identification

Cai, Huanhuan; Huang, Lei; Zhang, Wenfeng; Wei, Zhiqiang

doi:10.1007/s11042-021-11420-y

Learning discriminative features for semi-supervised person re-identification

Published: 13 October 2021

Volume 81, pages 1787–1809, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Huanhuan Cai^1,2,
Lei Huang ORCID: orcid.org/0000-0003-4087-3677¹,
Wenfeng Zhang¹ &
…
Zhiqiang Wei¹

453 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

We focus on the one-example person re-identification (Re-ID) task, where each identity has only one labeled example along with many unlabeled examples. Since each identity has only one labeled example, the number of initialized label examples is small, and the body parts of person are not aligned due to changes in person pose and camera angle under the camera. Therefore, the distinguishing information of learning labeled and unlabeled examples is challenging. To overcome these problems, we propose an end-to-end multi-task training network for semi-supervised Re-ID. First, we impose a part segmentation (PS) constraint on feature maps, forcing a module to predict part labels from the feature maps and enhance alignment. Second, we carefully design the network named Multiple Branch Network (MBN). MBN is a multi-branch deep network architecture, which consisting of one branch for global feature representation and two branches for local feature representation, local feature representation that including horizontal stripes representation and PS representation, respectively. Finally, loss function fusion is designed to learn discriminative features for semi-supervised Re-ID. Specifically, the MBN model is optimized by mining the object classification loss, exclusive loss and PS loss simultaneously. We validate the effectiveness of our approach by demonstrating its superiority over the state-of-the-art methods on the standard benchmark datasets, including Market-1501, DukeMTMC-reID. Notably, the rank-1 accuracy of our method outperforms the state-of-the-art method by 15.9 points (absolute, i.e., 71.7% vs. 55.8%) on Market-1501 and 8.9 points on DukeMTMC-reID.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 4

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-supervised Person Re-identification

TriEP: Expansion-Pool TriHard Loss for Person Re-Identification

Article 15 January 2022

References

Aliyu AL, Aneiba A, Patwary M (2019) Secure communication between network applications and controller in software defined network. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–8
Alp Güler R, Neverova N, Kokkinos I (2018) Densepose: dense human pose estimation in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7297–7306
Bak S, Carr P (2017) One-shot metric learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2990–2999
Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27:3893–3903
Article MathSciNet Google Scholar
Dong X, Yan Y, Ouyang W, Yang Y (2018) Style aggregated network for facial landmark detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 379–388
Dong X, Yan Y, Tan M, Yang Y, Tsang IW (2018) Late fusion via subspace search with consistency preservation. IEEE Trans Image Process 28:518–528
Article MathSciNet Google Scholar
Dong X, Zheng L, Ma F, Yang Y, Meng D (2018) Few-example object detection with model communication. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2018.2844853
Article Google Scholar
Esmaeilpour M, Cardinal P, Koerich AL (2020) Unsupervised feature learning for environmental sound classification using weighted cycle-consistent generative adversarial network. Appl Soft Comput 86:105912
Article Google Scholar
Foerster K, Schmid S (2019) Distributed consistent network updates in SDNs: local verification for global guarantees. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–4
Gao F, Jin Y, Ge Y, Lu S, Zhang Y (2020) Occluded person re-identification based on feature fusion and sparse reconstruction. Multimed Tools Appl. https://doi.org/10.1007/s11042-020-09361-z
Article Google Scholar
Ghosh SK, Ghosh SC (2019) $Q$-learning based network selection mechanism for CRNs with secrecy provisioning. In: 2019 IEEE 18th international symposium on network computing and applications (NCA) pp 1–5
Gong K, Liang X, Zhang D, Shen X, Lin L (2017) Look into person: self-supervised structure-sensitive learning and a new benchmark for human parsing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 932–940
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Computer vision—ECCV 2008, Berlin, Heidelberg, pp 262–275
Han J, Pauwels EJ, de Zeeuw PM, de With PH (2012) Employing a RGB-D sensor for real-time tracking of humans across multiple re-entries in a smart environment. IEEE Trans Consum Electron 58:255–263
Article Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 770–778
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Huang H et al (2018) EANet: enhancing alignment for cross-domain person re-identification. arXiv preprint arXiv:1812.11369
Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. In: Advances in neural information processing systems, pp 3581–3589
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 152–159
Li J, Ma AJ, Yuen PC (2018) Semi-supervised region metric learning for person re-identification. Int J Comput Vis 126:855–874
Article Google Scholar
Lin Y et al (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit 95:151–161
Article Google Scholar
Liu J et al (2019) Identity preserving generative adversarial network for cross-domain person re-identification. IEEE Access 7:114021–114032
Article Google Scholar
Liu Z, Wang D, Lu H (2017) Stepwise metric promotion for unsupervised video person re-identification. In: 2017 IEEE international conference on computer vision (ICCV), pp 2448–2457
Liu X, Liu W, Mei T, Ma H (2018) PROVID: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20:645–658
Article Google Scholar
Luan S, Chen C, Zhang B, Han J, Liu J (2018) Gabor convolutional networks. IEEE Trans Image Process 27:4357–4366
Article MathSciNet Google Scholar
Ma F, Meng D, Xie Q, Li Z, Dong X (2017) Self-paced co-training. In: Proceedings of the 34th international conference on machine learning, vol 70, pp 2275–2284
Ma H, Liu W (2018) A progressive search paradigm for the internet of things. IEEE Multimed 25:76–86
Article MathSciNet Google Scholar
Masoumi A, Ghassem-zadeh S, Hosseini SH, Ghavidel BZ (2020) Application of neural network and weighted improved PSO for uncertainty modeling and optimal allocating of renewable energies along with battery energy storage. Appl Soft Comput 88:105979
Article Google Scholar
Nguyen T-B, Le T-L, Devillaine L, Pham TTT, Ngoc NP (2019) Effective multi-shot person re-identification through representative frames selection and temporal feature pooling. Multimed Tools Appl 78:33939–33967
Article Google Scholar
Nie J, Huang L, Zhang W, Wei G, Wei Z (2019) Deep feature ranking for person re-identification. IEEE Access 7:15007–15017
Article Google Scholar
Noroozi V, Bahaadini S, Zheng L, Xie S, Shao W, Philip SY (2018) Semi-supervised deep representation learning for multi-view problems. In: 2018 IEEE international conference on big data (Big Data), pp 56–64
Rasmus A, Berglund M, Honkala M, Valpola H, Raiko T (2015) Semi-supervised learning with ladder networks. In: Advances in neural information processing systems, pp 3546–3554
Raz O, Avin C, Schmid S (2019) Nap: network-aware data partitions for efficient distributed processing. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–9
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision, pp 17–35
Roy A, Singha J, Devi SS, Laskar RH (2016) Impulse noise removal using SVM classification based fuzzy filter from gray scale images. Signal Process 128:262–273
Article Google Scholar
Roy A, Singha J, Laskar RH (2018) Removal of impulse noise from gray images using fuzzy SVM based histogram fuzzy filter. J Circuits Syst Comput 27:1850139
Article Google Scholar
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training gans. In: Advances in neural information processing systems, pp 2234–2242
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV), Munich, pp 480–496
Tan S, Zheng F, Liu L, Han J, Shao L (2016) Dense invariant feature-based support vector ranking for cross-camera person reidentification. IEEE Trans Circuits Syst Video Technol 28:356–363
Article Google Scholar
Wu Y, Lin Y, Dong X, Yan Y, Bian W, Yang Y (2019) Progressive learning for person re-identification with one example. IEEE Trans Image Process 28:2872–2881
Article MathSciNet Google Scholar
Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 1249–1258
Yang Q, Wu A, Zheng W-S (2019) Deep semi-supervised person re-identification with external memory. In: IEEE international conference on multimedia and expo, ICME 2019, Shanghai, 8–12 July 2019, pp 1096–1101
Ye M, Ma AJ, Zheng L, Li J, Yuen PC (2017) Dynamic label graph matching for unsupervised video re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5142–5150
Zhang W, Wei Z, Huang L, Nie J, Lv L, Wei G (2019) Person re-identification based on pose-aware segmentation. In: International conference on multimedia modeling, pp 302–314
Zhang W, Wei Z, Huang L, Xie K, Qin Q (2020) Adaptive attention-aware network for unsupervised person re-identification. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.05.094
Article Google Scholar
Zhang X, Jing X-Y, Zhu X, Ma F (2020) Semi-supervised person re-identification by similarity-embedded cycle GANs. Neural Comput Appl. https://doi.org/10.1007/s00521-020-04809-7
Article Google Scholar
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl (TOMM) 14:1–20
Google Scholar
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE international conference on computer vision, pp 3754–3762
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) scalable person re-identification: a benchmark. In: 2015 IEEE international conference on computer vision (ICCV), Santiago, pp 1116–1124
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, pp 1318–1327

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61872326, No.61672475); Shandong Provincial Natural Science Foundation (ZR2019MF044).

Author information

Authors and Affiliations

College of Information Science and Engineering, Ocean University of China, Qingdao, 266000, China
Huanhuan Cai, Lei Huang, Wenfeng Zhang & Zhiqiang Wei
Jining Polytechnic, Jining, 272000, China
Huanhuan Cai

Authors

Huanhuan Cai
View author publications
You can also search for this author in PubMed Google Scholar
Lei Huang
View author publications
You can also search for this author in PubMed Google Scholar
Wenfeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Huang.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cai, H., Huang, L., Zhang, W. et al. Learning discriminative features for semi-supervised person re-identification. Multimed Tools Appl 81, 1787–1809 (2022). https://doi.org/10.1007/s11042-021-11420-y

Download citation

Received: 08 November 2020
Revised: 06 February 2021
Accepted: 10 August 2021
Published: 13 October 2021
Issue Date: January 2022
DOI: https://doi.org/10.1007/s11042-021-11420-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning discriminative features for semi-supervised person re-identification

Abstract

Access this article

Similar content being viewed by others

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-supervised Person Re-identification

TriEP: Expansion-Pool TriHard Loss for Person Re-Identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning discriminative features for semi-supervised person re-identification

Abstract

Access this article

Similar content being viewed by others

PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-supervised Person Re-identification

TriEP: Expansion-Pool TriHard Loss for Person Re-Identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation