Abstract
Person re-identification (re-ID) aims to match person images under different cameras with disjoint views. Although supervised re-ID has achieved great progress, unsupervised cross-domain re-ID remains a challenging work due to domain bias. In this work, we divide cross-domain re-ID task into two phases: domain-invariant features learning and domain-specific features learning. Our contributions are twofold. (i) To achieve domain-invariant features learning, a novel model called Pedestrian General Similarity (PGS) is proposed, which can eliminate two main factors that cause domain bias: image style and background. Compared with the existing re-ID models, PGS has better generalization ability. (ii) A novel pseudo label assignment method named Mutual Nearest Neighbors Pseudo Labeling (MNNPL) is proposed, which calculates pseudo labels based on the similarity between samples in the target domain, and the resulting pseudo labels are used to guide domain-specific feature learning. Extensive experiments are conducted on several large scale datasets, the results show that our method outperforms most published unsupervised cross-domain methods by a large margin.
Similar content being viewed by others
References
Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: Past, present and future, arXiv:1610.02984
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling and a strong convolutional baseline. Proceedings of the european conference on computer vision, pp 480–496
Zhang X, Luo H, Fan X, Xiang W, Sun Y, Xiao Q, Jiang W, Zhang C, Sun J (2017) Alignedreid: Surpassing human-level performance in person re-identification, arXiv:1711.08184
Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. Proceedings of ACM multimedia conference on multimedia conference, pp 274–282
Deng W, Zheng L, Ye Q, Kang G, Yang Y, Jiao J (2018) Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. Proceedings of IEEE Conf Comput Vis Pattern Recognit, pp 994– 1003
Wang J, Zhu X, Gong S, Li W (2018) Transferable joint attribute-identity deep learning for unsupervised person re-identification. Proceedings of IEEE Conf Comput Vis Pattern Recognit, pp 2275–2284
Torralba A, Efros AA et al (2011) Unbiased look at dataset bias. Proc IEEE Conf Comput Vis Pattern Recognit 1:7
Pan X, Luo P, Shi J, Tang X (2018) Two at once: Enhancing learning and generalization capacities via ibn-net. Proceedings of european conference on computer vision, pp 464–479
Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167
Chen D, Zhang S, Ouyang W, Yang J, Tai Y (2018) Person search via a mask-guided two-stream cnn model. Proceedings of european conference on computer vision, pp 734–750
Huang H, Yang W, Chen X, Zhao X, Huang K, Lin J, Huang G, Du D (2018) EANet: Enhancing Alignment for Cross-Domain Person Re-identification. arXiv:1812.11369
Fu Y, Wei Y, wang G, zhou Y, shi H, huang TS (2019) Self-Similarity grouping: a simple unsupervised cross domain adaptation approach for person Re-Identification. Proceedings of the IEEE international conference on computer vision, pp 6112–6121
Li W, Zhao R, Wang X (2012) Human reidentification with transferred metric learning. Asian conference on computer vision, pp 31–44
Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking. IEEE Int Workshop Perform Eval Tracking Surveill 3:1–7
Ester M, Kriegel HP, Sander J, Xu X (1996) Density-based spatial clustering of applications with noise. Int Conf Knowl Discov Data Min 240:6
Zhang X, Cao J, Shen C, You M (2019) Self-training with progressive augmentation for unsupervised cross-domain person re-identification. Proceedings of the IEEE international conference on computer vision, pp 8222–8231
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark, Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Zheng Z, Zheng L, Yang Y (2017) Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro. Proceedings of the IEEE international conference on computer vision, pp 3774–3782
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking. Proceedings of european conference on computer vision, pp 17–35
Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. Proceedings of European conference on computer vision, pp 791–808
Varior RR, Shuai B, Lu J, Xu D, Wang G (2016) A siamese long short-term memory architecture for human re-identification. Proceedings of European conference on computer vision, pp 135–153
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv:1703.07737
Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 403–412
Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Trans Circ Syst Video Technol 29:3037–3045
Zheng L, Huang Y, Lu H, Yang Y (2019) Pose invariant embedding for deep person re-identification. IEEE Trans Image Process 28:4500–4509
Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion. Proceedings of IEEE conference on computer vision and pattern recognition, pp 1077–1085
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. Proceedings of IEEE conference on computer vision and pattern recognition, pp 2197–2206
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Advances in neural information processing systems, pp 2672–2680
Wei L, Zhang S, Gao W, Tian Q (2018) Person transfer gan to bridge domain gap for person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 79–88
Zhong Z, Zheng L, Li S, Yang Y (2018) Generalizing a person retrieval model hetero-and homogeneously. Proceedings of european conference on computer vision, pp 172–188
Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of IEEE international conference on computer vision, pp 2223–2232
Zhong Z, Zheng L, Luo Z, Li S, Yang Y (2019) Invariance matters: Exemplar memory for domain adaptive person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 598–607
Fan H, Zheng L, Yan C, Yang Y (2018) Unsupervised person re-identification: Clustering and fine-tuning. ACM Trans Multimed Comput Commun Appl 14:83
Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. Proceedings of Pacific-Asia conference on knowledge discovery and data mining, pp 160–172
Yu H-X, Zheng W-S, Wu A, Guo X, Gong S, Lai J-H (2019) Unsupervised Person Re-identification by Soft Multilabel Learning. Proceedings of IEEE conference on computer vision and pattern recognition, pp 2148–2157
Yang Q, Yu H-X, Wu A, Zheng W-S (2019) Patch-Based Discriminative feature learning for unsupervised person Re-Identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 3633–3642
Ulyanov D, Vedaldi A, Lempitsky V (2016) Instance normalization: The missing ingredient for fast stylization. arXiv:1607.08022
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115:211–252
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Proceedings of IEEE conference on computer vision and pattern recognition, pp 770–778
Zhang N, Ding S, Sun T, Liao H, Wang L, Shi Z (2020) Multi-view RBM with posterior consistency and domain adaptation. Inf Sci 516:142–157
Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. Proceedings of International conference on machine learning, pp 807–814
Ulyanov D, Vedaldi A, Lempitsky V (2017) Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis. Proceedings of IEEE conference on computer vision and pattern recognition, pp 6924–6932
Huang X, Belongie S (2017) Arbitrary style transfer in real-time with adaptive instance normalization. Proceedings of IEEE international conference on computer vision, pp 1501–1510
Dumoulin V, Shlens J, Kudlur M (2016) A learned representation for artistic style. arXiv:1610.0762
Jaderberg M, Simonyan K, Zisserman A et al (2015) Spatial transformer networks. Advances in neural information processing systems, pp 2017–2025
Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. Proceedings of IEEE conference on computer vision and pattern recognition, pp 1–8
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, pp 91–99
Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, Lin Z, Desmaison A, Antiga L, Lerer A (2017) Automatic differentiation in pytorch. Proceedings of Neural Information Processing Systems
Bottou L (2010) Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT’2010, pp 177–186
Peng P, Xiang T, Wang Y, Pontil M, Gong S, Huang T, Tian Y (2016) Unsupervised cross-dataset transfer learning for person re-identification. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1306–1315
Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-Scale Feature Learning for Person Re-Identification. arXiv:1905.00953
Yu H-X, Wu A, Zheng W-S (2017) Cross-view asymmetric metric learning for unsupervised person re-identification. Proceedings of the IEEE international conference on computer vision, pp 994–1002
Wei X-S, Luo J-H, Wu J, Zhou Z-H (2017) Selective convolutional descriptor aggregation for fine-grained image retrieval. IEEE Trans Image Process 26:2868–2881
Lloyd S (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28:129–137
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. IEEE conference on computer vision and pattern recognition, pp 152–159
Ding S, Zhang N, Zhang X, Wu F (2017) Twin support vector machine: theory, algorithm and applications, vol 28
Zeng K, Ding S, Jia W (2019) Single image super-resolution using a polymorphic parallel CNN, vol 49
Zhang J, Ding S, Zhang N, Jia W (2019) Adversarial training methods for boltzmann machines. IEEE Access 8:4594–4604
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. IEEE conference on computer vision and pattern recognition, pp 1318–1327
MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations, vol 14
Luxburg Von (2007) Ulrike, a tutorial on spectral clustering. Stat Comput 17:395–416
Cong L, Ding S, Wang L, Zhang A, Jia W (2018) Image segmentation algorithm based on superpixel clustering, vol 12
Zhu Y, Deng C, Cao H, Wang H (2020) Object and Background Disentanglement for Unsupervised Cross-Domain Person Re-Identification. Neurocomputing
Zheng F, Deng C, Sun X, Jiang X, Guo X, Yu Z, Huang F, Ji R (2019) Pyramidal person re-identification via multi-loss dynamic training. Proc IEEE Conf Comput Vis Pattern Recognit, pp 8514–8522
Pu C, Xu X, Deng C (2018) Deep View-Aware metric learning for person Re-Identification. IJCAI, pp 620–626
Zhong Z., Zheng L., Zheng Z., Li S., Yang Y. (2019) Camstyle: A Novel Data Augmentation Method for Person Re-Identification. IEEE Trans Image Process 28:1176–1190
Song L, Wang C, Zhang L, Du B, Zhang Q, Huang C, Wang X (2020) Unsupervised domain adaptive re-identification: Theory and practice. Pattern Recogn 102:107173
Tang Y, Xi Y, Wang N, Song B, Gao X (2020) CGAN-TM: A Novel Domain-to-Domain Transferring Method for Person Re-Identification, vol 29
Zhao Y, Lu H (2020) Neighbor similarity and soft-label adaptation for unsupervised cross-dataset person re-identification. Neurocomputing
Jiang K, Zhang T, Zhang Y, Wu F, Rui Y (2020) Self-Supervised Agent learning for unsupervised Cross-Domain person Re-Identification. IEEE Trans Image Process 29:8549–8560
Lin Y, Wu Y, Yan C, Xu M, Yang Y (2020) Unsupervised Person Re-identification via Cross-Camera Similarity Exploration. IEEE Trans Image Process 29:5481–5490
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant Nos. 62072345, 41671382), LIESMARS Special Research Funding. The numerical calculations in this paper have been done on the supercomputing system in the Supercomputing Center of Wuhan University.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chong, Y., Peng, C., Zhang, C. et al. Learning domain invariant and specific representation for cross-domain person re-identification. Appl Intell 51, 5219–5232 (2021). https://doi.org/10.1007/s10489-020-02107-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-020-02107-2