Abstract
We focus on the one-example person re-identification (Re-ID) task, where each identity has only one labeled example along with many unlabeled examples. Since each identity has only one labeled example, the number of initialized label examples is small, and the body parts of person are not aligned due to changes in person pose and camera angle under the camera. Therefore, the distinguishing information of learning labeled and unlabeled examples is challenging. To overcome these problems, we propose an end-to-end multi-task training network for semi-supervised Re-ID. First, we impose a part segmentation (PS) constraint on feature maps, forcing a module to predict part labels from the feature maps and enhance alignment. Second, we carefully design the network named Multiple Branch Network (MBN). MBN is a multi-branch deep network architecture, which consisting of one branch for global feature representation and two branches for local feature representation, local feature representation that including horizontal stripes representation and PS representation, respectively. Finally, loss function fusion is designed to learn discriminative features for semi-supervised Re-ID. Specifically, the MBN model is optimized by mining the object classification loss, exclusive loss and PS loss simultaneously. We validate the effectiveness of our approach by demonstrating its superiority over the state-of-the-art methods on the standard benchmark datasets, including Market-1501, DukeMTMC-reID. Notably, the rank-1 accuracy of our method outperforms the state-of-the-art method by 15.9 points (absolute, i.e., 71.7% vs. 55.8%) on Market-1501 and 8.9 points on DukeMTMC-reID.
Similar content being viewed by others
References
Aliyu AL, Aneiba A, Patwary M (2019) Secure communication between network applications and controller in software defined network. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–8
Alp Güler R, Neverova N, Kokkinos I (2018) Densepose: dense human pose estimation in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7297–7306
Bak S, Carr P (2017) One-shot metric learning for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2990–2999
Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27:3893–3903
Dong X, Yan Y, Ouyang W, Yang Y (2018) Style aggregated network for facial landmark detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 379–388
Dong X, Yan Y, Tan M, Yang Y, Tsang IW (2018) Late fusion via subspace search with consistency preservation. IEEE Trans Image Process 28:518–528
Dong X, Zheng L, Ma F, Yang Y, Meng D (2018) Few-example object detection with model communication. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2018.2844853
Esmaeilpour M, Cardinal P, Koerich AL (2020) Unsupervised feature learning for environmental sound classification using weighted cycle-consistent generative adversarial network. Appl Soft Comput 86:105912
Foerster K, Schmid S (2019) Distributed consistent network updates in SDNs: local verification for global guarantees. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–4
Gao F, Jin Y, Ge Y, Lu S, Zhang Y (2020) Occluded person re-identification based on feature fusion and sparse reconstruction. Multimed Tools Appl. https://doi.org/10.1007/s11042-020-09361-z
Ghosh SK, Ghosh SC (2019) $Q$-learning based network selection mechanism for CRNs with secrecy provisioning. In: 2019 IEEE 18th international symposium on network computing and applications (NCA) pp 1–5
Gong K, Liang X, Zhang D, Shen X, Lin L (2017) Look into person: self-supervised structure-sensitive learning and a new benchmark for human parsing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 932–940
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Computer vision—ECCV 2008, Berlin, Heidelberg, pp 262–275
Han J, Pauwels EJ, de Zeeuw PM, de With PH (2012) Employing a RGB-D sensor for real-time tracking of humans across multiple re-entries in a smart environment. IEEE Trans Consum Electron 58:255–263
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 770–778
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Huang H et al (2018) EANet: enhancing alignment for cross-domain person re-identification. arXiv preprint arXiv:1812.11369
Kingma DP, Mohamed S, Rezende DJ, Welling M (2014) Semi-supervised learning with deep generative models. In: Advances in neural information processing systems, pp 3581–3589
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 152–159
Li J, Ma AJ, Yuen PC (2018) Semi-supervised region metric learning for person re-identification. Int J Comput Vis 126:855–874
Lin Y et al (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit 95:151–161
Liu J et al (2019) Identity preserving generative adversarial network for cross-domain person re-identification. IEEE Access 7:114021–114032
Liu Z, Wang D, Lu H (2017) Stepwise metric promotion for unsupervised video person re-identification. In: 2017 IEEE international conference on computer vision (ICCV), pp 2448–2457
Liu X, Liu W, Mei T, Ma H (2018) PROVID: progressive and multimodal vehicle reidentification for large-scale urban surveillance. IEEE Trans Multimed 20:645–658
Luan S, Chen C, Zhang B, Han J, Liu J (2018) Gabor convolutional networks. IEEE Trans Image Process 27:4357–4366
Ma F, Meng D, Xie Q, Li Z, Dong X (2017) Self-paced co-training. In: Proceedings of the 34th international conference on machine learning, vol 70, pp 2275–2284
Ma H, Liu W (2018) A progressive search paradigm for the internet of things. IEEE Multimed 25:76–86
Masoumi A, Ghassem-zadeh S, Hosseini SH, Ghavidel BZ (2020) Application of neural network and weighted improved PSO for uncertainty modeling and optimal allocating of renewable energies along with battery energy storage. Appl Soft Comput 88:105979
Nguyen T-B, Le T-L, Devillaine L, Pham TTT, Ngoc NP (2019) Effective multi-shot person re-identification through representative frames selection and temporal feature pooling. Multimed Tools Appl 78:33939–33967
Nie J, Huang L, Zhang W, Wei G, Wei Z (2019) Deep feature ranking for person re-identification. IEEE Access 7:15007–15017
Noroozi V, Bahaadini S, Zheng L, Xie S, Shao W, Philip SY (2018) Semi-supervised deep representation learning for multi-view problems. In: 2018 IEEE international conference on big data (Big Data), pp 56–64
Rasmus A, Berglund M, Honkala M, Valpola H, Raiko T (2015) Semi-supervised learning with ladder networks. In: Advances in neural information processing systems, pp 3546–3554
Raz O, Avin C, Schmid S (2019) Nap: network-aware data partitions for efficient distributed processing. In: 2019 IEEE 18th international symposium on network computing and applications (NCA), pp 1–9
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision, pp 17–35
Roy A, Singha J, Devi SS, Laskar RH (2016) Impulse noise removal using SVM classification based fuzzy filter from gray scale images. Signal Process 128:262–273
Roy A, Singha J, Laskar RH (2018) Removal of impulse noise from gray images using fuzzy SVM based histogram fuzzy filter. J Circuits Syst Comput 27:1850139
Salimans T, Goodfellow I, Zaremba W, Cheung V, Radford A, Chen X (2016) Improved techniques for training gans. In: Advances in neural information processing systems, pp 2234–2242
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV), Munich, pp 480–496
Tan S, Zheng F, Liu L, Han J, Shao L (2016) Dense invariant feature-based support vector ranking for cross-camera person reidentification. IEEE Trans Circuits Syst Video Technol 28:356–363
Wu Y, Lin Y, Dong X, Yan Y, Bian W, Yang Y (2019) Progressive learning for person re-identification with one example. IEEE Trans Image Process 28:2872–2881
Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, pp 1249–1258
Yang Q, Wu A, Zheng W-S (2019) Deep semi-supervised person re-identification with external memory. In: IEEE international conference on multimedia and expo, ICME 2019, Shanghai, 8–12 July 2019, pp 1096–1101
Ye M, Ma AJ, Zheng L, Li J, Yuen PC (2017) Dynamic label graph matching for unsupervised video re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 5142–5150
Zhang W, Wei Z, Huang L, Nie J, Lv L, Wei G (2019) Person re-identification based on pose-aware segmentation. In: International conference on multimedia modeling, pp 302–314
Zhang W, Wei Z, Huang L, Xie K, Qin Q (2020) Adaptive attention-aware network for unsupervised person re-identification. Neurocomputing. https://doi.org/10.1016/j.neucom.2020.05.094
Zhang X, Jing X-Y, Zhu X, Ma F (2020) Semi-supervised person re-identification by similarity-embedded cycle GANs. Neural Comput Appl. https://doi.org/10.1007/s00521-020-04809-7
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Zheng Z, Zheng L, Yang Y (2017) A discriminatively learned CNN embedding for person reidentification. ACM Trans Multimed Comput Commun Appl (TOMM) 14:1–20
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE international conference on computer vision, pp 3754–3762
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) scalable person re-identification: a benchmark. In: 2015 IEEE international conference on computer vision (ICCV), Santiago, pp 1116–1124
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, Honolulu, pp 1318–1327
Acknowledgements
This work is supported by the National Natural Science Foundation of China (No. 61872326, No.61672475); Shandong Provincial Natural Science Foundation (ZR2019MF044).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Cai, H., Huang, L., Zhang, W. et al. Learning discriminative features for semi-supervised person re-identification. Multimed Tools Appl 81, 1787–1809 (2022). https://doi.org/10.1007/s11042-021-11420-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-11420-y