Skip to main content
Log in

Learning discriminative features for semi-supervised person re-identification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

We focus on the one-example person re-identification (Re-ID) task, where each identity has only one labeled example along with many unlabeled examples. Since each identity has only one labeled example, the number of initialized label examples is small, and the body parts of person are not aligned due to changes in person pose and camera angle under the camera. Therefore, the distinguishing information of learning labeled and unlabeled examples is challenging. To overcome these problems, we propose an end-to-end multi-task training network for semi-supervised Re-ID. First, we impose a part segmentation (PS) constraint on feature maps, forcing a module to predict part labels from the feature maps and enhance alignment. Second, we carefully design the network named Multiple Branch Network (MBN). MBN is a multi-branch deep network architecture, which consisting of one branch for global feature representation and two branches for local feature representation, local feature representation that including horizontal stripes representation and PS representation, respectively. Finally, loss function fusion is designed to learn discriminative features for semi-supervised Re-ID. Specifically, the MBN model is optimized by mining the object classification loss, exclusive loss and PS loss simultaneously. We validate the effectiveness of our approach by demonstrating its superiority over the state-of-the-art methods on the standard benchmark datasets, including Market-1501, DukeMTMC-reID. Notably, the rank-1 accuracy of our method outperforms the state-of-the-art method by 15.9 points (absolute, i.e., 71.7% vs. 55.8%) on Market-1501 and 8.9 points on DukeMTMC-reID.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Aliyu AL, Aneiba A, Patwary M (2019) Secure communication between network applications and controller in software defined network. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–8

  2. Alp Güler R, Neverova N, Kokkinos I (2018) Densepose: dense human pose estimation in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7297–7306

  3. Bak S, Carr P (2017) One-shot metric learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2990–2999

  4. Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27:3893–3903

    Article  MathSciNet  Google Scholar 

  5. Dong X, Yan Y, Ouyang W, Yang Y (2018) Style aggregated network for facial landmark detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 379–388

  6. Dong X, Yan Y, Tan M, Yang Y, Tsang IW (2018) Late fusion via subspace search with consistency preservation. IEEE Trans Image Process 28:518–528

    Article  MathSciNet  Google Scholar 

  7. Dong X, Zheng L, Ma F, Yang Y, Meng D (2018) Few-example object detection with model communication. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2018.2844853

    Article  Google Scholar 

  8. Esmaeilpour M, Cardinal P, Koerich AL (2020) Unsupervised feature learning for environmental sound classification using weighted cycle-consistent generative adversarial network. Appl Soft Comput 86:105912

    Article  Google Scholar 

  9. Foerster K, Schmid S (2019) Distributed consistent network updates in SDNs: local verification for global guarantees. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–4

  10. Gao F, Jin Y, Ge Y, Lu S, Zhang Y (2020) Occluded person re-identification based on feature fusion and sparse reconstruction. Multimed Tools Appl. https://doi.org/10.1007/s11042-020-09361-z

    Article  Google Scholar 

  11. Ghosh SK, Ghosh SC (2019) $Q$-learning based network selection mechanism for CRNs with secrecy provisioning. In: 2019 IEEE 18th international symposium on network computing and applications (NCA) pp 1–5

  12. Gong K, Liang X, Zhang D, Shen X, Lin L (2017) Look into person: self-supervised structure-sensitive learning and a new benchmark for human parsing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 932–940

  13. Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Computer vision—ECCV 2008, Berlin, Heidelberg, pp 262–275

  14. Han J, Pauwels EJ, de Zeeuw PM, de With PH (2012) Employing a RGB-D sensor for real-time tracking of humans across multiple re-entries in a smart environment. IEEE Trans Consum Electron 58:255–263

    Article  Google Scholar 

  15. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 770–778

  16. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737

  17. Huang H et al (2018) EANet: enhancing alignment for cross-domain person re-identification. arXiv preprint arXiv:1812.11369

  18. Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. In: Advances in neural information processing systems, pp 3581–3589

  19. Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907

  20. Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 152–159

  21. Li J, Ma AJ, Yuen PC (2018) Semi-supervised region metric learning for person re-identification. Int J Comput Vis 126:855–874

    Article  Google Scholar 

  22. Lin Y et al (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit 95:151–161

    Article  Google Scholar 

  23. Liu J et al (2019) Identity preserving generative adversarial network for cross-domain person re-identification. IEEE Access 7:114021–114032

    Article  Google Scholar 

  24. Liu Z, Wang D, Lu H (2017) Stepwise metric promotion for unsupervised video person re-identification. In: 2017 IEEE international conference on computer vision (ICCV), pp 2448–2457

  25. Liu X, Liu W, Mei T, Ma H (2018) PROVID: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20:645–658

    Article  Google Scholar 

  26. Luan S, Chen C, Zhang B, Han J, Liu J (2018) Gabor convolutional networks. IEEE Trans Image Process 27:4357–4366

    Article  MathSciNet  Google Scholar 

  27. Ma F, Meng D, Xie Q, Li Z, Dong X (2017) Self-paced co-training. In: Proceedings of the 34th international conference on machine learning, vol 70, pp 2275–2284

  28. Ma H, Liu W (2018) A progressive search paradigm for the internet of things. IEEE Multimed 25:76–86

    Article  MathSciNet  Google Scholar 

  29. Masoumi A, Ghassem-zadeh S, Hosseini SH, Ghavidel BZ (2020) Application of neural network and weighted improved PSO for uncertainty modeling and optimal allocating of renewable energies along with battery energy storage. Appl Soft Comput 88:105979

    Article  Google Scholar 

  30. Nguyen T-B, Le T-L, Devillaine L, Pham TTT, Ngoc NP (2019) Effective multi-shot person re-identification through representative frames selection and temporal feature pooling. Multimed Tools Appl 78:33939–33967

    Article  Google Scholar 

  31. Nie J, Huang L, Zhang W, Wei G, Wei Z (2019) Deep feature ranking for person re-identification. IEEE Access 7:15007–15017

    Article  Google Scholar 

  32. Noroozi V, Bahaadini S, Zheng L, Xie S, Shao W, Philip SY (2018) Semi-supervised deep representation learning for multi-view problems. In: 2018 IEEE international conference on big data (Big Data), pp 56–64

  33. Rasmus A, Berglund M, Honkala M, Valpola H, Raiko T (2015) Semi-supervised learning with ladder networks. In: Advances in neural information processing systems, pp 3546–3554

  34. Raz O, Avin C, Schmid S (2019) Nap: network-aware data partitions for efficient distributed processing. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–9

  35. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision, pp 17–35

  36. Roy A, Singha J, Devi SS, Laskar RH (2016) Impulse noise removal using SVM classification based fuzzy filter from gray scale images. Signal Process 128:262–273

    Article  Google Scholar 

  37. Roy A, Singha J, Laskar RH (2018) Removal of impulse noise from gray images using fuzzy SVM based histogram fuzzy filter. J Circuits Syst Comput 27:1850139

    Article  Google Scholar 

  38. Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training gans. In: Advances in neural information processing systems, pp 2234–2242

  39. Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV), Munich, pp 480–496

  40. Tan S, Zheng F, Liu L, Han J, Shao L (2016) Dense invariant feature-based support vector ranking for cross-camera person reidentification. IEEE Trans Circuits Syst Video Technol 28:356–363

    Article  Google Scholar 

  41. Wu Y, Lin Y, Dong X, Yan Y, Bian W, Yang Y (2019) Progressive learning for person re-identification with one example. IEEE Trans Image Process 28:2872–2881

    Article  MathSciNet  Google Scholar 

  42. Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 1249–1258

  43. Yang Q, Wu A, Zheng W-S (2019) Deep semi-supervised person re-identification with external memory. In: IEEE international conference on multimedia and expo, ICME 2019, Shanghai, 8–12 July 2019, pp 1096–1101

  44. Ye M, Ma AJ, Zheng L, Li J, Yuen PC (2017) Dynamic label graph matching for unsupervised video re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5142–5150

  45. Zhang W, Wei Z, Huang L, Nie J, Lv L, Wei G (2019) Person re-identification based on pose-aware segmentation. In: International conference on multimedia modeling, pp 302–314

  46. Zhang W, Wei Z, Huang L, Xie K, Qin Q (2020) Adaptive attention-aware network for unsupervised person re-identification. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.05.094

    Article  Google Scholar 

  47. Zhang X, Jing X-Y, Zhu X, Ma F (2020) Semi-supervised person re-identification by similarity-embedded cycle GANs. Neural Comput Appl. https://doi.org/10.1007/s00521-020-04809-7

    Article  Google Scholar 

  48. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124

  49. Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl (TOMM) 14:1–20

    Google Scholar 

  50. Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE international conference on computer vision, pp 3754–3762

  51. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) scalable person re-identification: a benchmark. In: 2015 IEEE international conference on computer vision (ICCV), Santiago, pp 1116–1124

  52. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, pp 1318–1327

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (No. 61872326, No.61672475); Shandong Provincial Natural Science Foundation (ZR2019MF044).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lei Huang.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cai, H., Huang, L., Zhang, W. et al. Learning discriminative features for semi-supervised person re-identification. Multimed Tools Appl 81, 1787–1809 (2022). https://doi.org/10.1007/s11042-021-11420-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-11420-y

Keywords

Navigation