Skip to main content
Log in

Learning domain invariant and specific representation for cross-domain person re-identification

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Person re-identification (re-ID) aims to match person images under different cameras with disjoint views. Although supervised re-ID has achieved great progress, unsupervised cross-domain re-ID remains a challenging work due to domain bias. In this work, we divide cross-domain re-ID task into two phases: domain-invariant features learning and domain-specific features learning. Our contributions are twofold. (i) To achieve domain-invariant features learning, a novel model called Pedestrian General Similarity (PGS) is proposed, which can eliminate two main factors that cause domain bias: image style and background. Compared with the existing re-ID models, PGS has better generalization ability. (ii) A novel pseudo label assignment method named Mutual Nearest Neighbors Pseudo Labeling (MNNPL) is proposed, which calculates pseudo labels based on the similarity between samples in the target domain, and the resulting pseudo labels are used to guide domain-specific feature learning. Extensive experiments are conducted on several large scale datasets, the results show that our method outperforms most published unsupervised cross-domain methods by a large margin.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  1. Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: Past, present and future, arXiv:1610.02984

  2. Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling and a strong convolutional baseline. Proceedings of the european conference on computer vision, pp 480–496

  3. Zhang X, Luo H, Fan X, Xiang W, Sun Y, Xiao Q, Jiang W, Zhang C, Sun J (2017) Alignedreid: Surpassing human-level performance in person re-identification, arXiv:1711.08184

  4. Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. Proceedings of ACM multimedia conference on multimedia conference, pp 274–282

  5. Deng W, Zheng L, Ye Q, Kang G, Yang Y, Jiao J (2018) Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. Proceedings of IEEE Conf Comput Vis Pattern Recognit, pp 994– 1003

  6. Wang J, Zhu X, Gong S, Li W (2018) Transferable joint attribute-identity deep learning for unsupervised person re-identification. Proceedings of IEEE Conf Comput Vis Pattern Recognit, pp 2275–2284

  7. Torralba A, Efros AA et al (2011) Unbiased look at dataset bias. Proc IEEE Conf Comput Vis Pattern Recognit 1:7

    Google Scholar 

  8. Pan X, Luo P, Shi J, Tang X (2018) Two at once: Enhancing learning and generalization capacities via ibn-net. Proceedings of european conference on computer vision, pp 464–479

  9. Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167

  10. Chen D, Zhang S, Ouyang W, Yang J, Tai Y (2018) Person search via a mask-guided two-stream cnn model. Proceedings of european conference on computer vision, pp 734–750

  11. Huang H, Yang W, Chen X, Zhao X, Huang K, Lin J, Huang G, Du D (2018) EANet: Enhancing Alignment for Cross-Domain Person Re-identification. arXiv:1812.11369

  12. Fu Y, Wei Y, wang G, zhou Y, shi H, huang TS (2019) Self-Similarity grouping: a simple unsupervised cross domain adaptation approach for person Re-Identification. Proceedings of the IEEE international conference on computer vision, pp 6112–6121

  13. Li W, Zhao R, Wang X (2012) Human reidentification with transferred metric learning. Asian conference on computer vision, pp 31–44

  14. Gray D, Brennan S, Tao H (2007) Evaluating appearance models for recognition, reacquisition, and tracking. IEEE Int Workshop Perform Eval Tracking Surveill 3:1–7

    Google Scholar 

  15. Ester M, Kriegel HP, Sander J, Xu X (1996) Density-based spatial clustering of applications with noise. Int Conf Knowl Discov Data Min 240:6

    Google Scholar 

  16. Zhang X, Cao J, Shen C, You M (2019) Self-training with progressive augmentation for unsupervised cross-domain person re-identification. Proceedings of the IEEE international conference on computer vision, pp 8222–8231

  17. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark, Proceedings of the IEEE international conference on computer vision, pp 1116–1124

  18. Zheng Z, Zheng L, Yang Y (2017) Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro. Proceedings of the IEEE international conference on computer vision, pp 3774–3782

  19. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance Measures and a Data Set for Multi-Target, Multi-Camera Tracking. Proceedings of european conference on computer vision, pp 17–35

  20. Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. Proceedings of European conference on computer vision, pp 791–808

  21. Varior RR, Shuai B, Lu J, Xu D, Wang G (2016) A siamese long short-term memory architecture for human re-identification. Proceedings of European conference on computer vision, pp 135–153

  22. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv:1703.07737

  23. Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 403–412

  24. Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Trans Circ Syst Video Technol 29:3037–3045

    Article  Google Scholar 

  25. Zheng L, Huang Y, Lu H, Yang Y (2019) Pose invariant embedding for deep person re-identification. IEEE Trans Image Process 28:4500–4509

    Article  MathSciNet  Google Scholar 

  26. Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X (2017) Spindle net: Person re-identification with human body region guided feature decomposition and fusion. Proceedings of IEEE conference on computer vision and pattern recognition, pp 1077–1085

  27. Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. Proceedings of IEEE conference on computer vision and pattern recognition, pp 2197–2206

  28. Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial nets. Advances in neural information processing systems, pp 2672–2680

  29. Wei L, Zhang S, Gao W, Tian Q (2018) Person transfer gan to bridge domain gap for person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 79–88

  30. Zhong Z, Zheng L, Li S, Yang Y (2018) Generalizing a person retrieval model hetero-and homogeneously. Proceedings of european conference on computer vision, pp 172–188

  31. Zhu J-Y, Park T, Isola P, Efros AA (2017) Unpaired image-to-image translation using cycle-consistent adversarial networks. Proceedings of IEEE international conference on computer vision, pp 2223–2232

  32. Zhong Z, Zheng L, Luo Z, Li S, Yang Y (2019) Invariance matters: Exemplar memory for domain adaptive person re-identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 598–607

  33. Fan H, Zheng L, Yan C, Yang Y (2018) Unsupervised person re-identification: Clustering and fine-tuning. ACM Trans Multimed Comput Commun Appl 14:83

    Article  Google Scholar 

  34. Campello RJGB, Moulavi D, Sander J (2013) Density-based clustering based on hierarchical density estimates. Proceedings of Pacific-Asia conference on knowledge discovery and data mining, pp 160–172

  35. Yu H-X, Zheng W-S, Wu A, Guo X, Gong S, Lai J-H (2019) Unsupervised Person Re-identification by Soft Multilabel Learning. Proceedings of IEEE conference on computer vision and pattern recognition, pp 2148–2157

  36. Yang Q, Yu H-X, Wu A, Zheng W-S (2019) Patch-Based Discriminative feature learning for unsupervised person Re-Identification. Proceedings of IEEE conference on computer vision and pattern recognition, pp 3633–3642

  37. Ulyanov D, Vedaldi A, Lempitsky V (2016) Instance normalization: The missing ingredient for fast stylization. arXiv:1607.08022

  38. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M et al (2015) Imagenet large scale visual recognition challenge. Int J Comput Vis 115:211–252

    Article  MathSciNet  Google Scholar 

  39. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. Proceedings of IEEE conference on computer vision and pattern recognition, pp 770–778

  40. Zhang N, Ding S, Sun T, Liao H, Wang L, Shi Z (2020) Multi-view RBM with posterior consistency and domain adaptation. Inf Sci 516:142–157

    Article  Google Scholar 

  41. Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. Proceedings of International conference on machine learning, pp 807–814

  42. Ulyanov D, Vedaldi A, Lempitsky V (2017) Improved texture networks: Maximizing quality and diversity in feed-forward stylization and texture synthesis. Proceedings of IEEE conference on computer vision and pattern recognition, pp 6924–6932

  43. Huang X, Belongie S (2017) Arbitrary style transfer in real-time with adaptive instance normalization. Proceedings of IEEE international conference on computer vision, pp 1501–1510

  44. Dumoulin V, Shlens J, Kudlur M (2016) A learned representation for artistic style. arXiv:1610.0762

  45. Jaderberg M, Simonyan K, Zisserman A et al (2015) Spatial transformer networks. Advances in neural information processing systems, pp 2017–2025

  46. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. Proceedings of IEEE conference on computer vision and pattern recognition, pp 1–8

  47. Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems, pp 91–99

  48. Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, Lin Z, Desmaison A, Antiga L, Lerer A (2017) Automatic differentiation in pytorch. Proceedings of Neural Information Processing Systems

  49. Bottou L (2010) Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT’2010, pp 177–186

  50. Peng P, Xiang T, Wang Y, Pontil M, Gong S, Huang T, Tian Y (2016) Unsupervised cross-dataset transfer learning for person re-identification. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1306–1315

  51. Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-Scale Feature Learning for Person Re-Identification. arXiv:1905.00953

  52. Yu H-X, Wu A, Zheng W-S (2017) Cross-view asymmetric metric learning for unsupervised person re-identification. Proceedings of the IEEE international conference on computer vision, pp 994–1002

  53. Wei X-S, Luo J-H, Wu J, Zhou Z-H (2017) Selective convolutional descriptor aggregation for fine-grained image retrieval. IEEE Trans Image Process 26:2868–2881

    Article  MathSciNet  Google Scholar 

  54. Lloyd S (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28:129–137

    Article  MathSciNet  Google Scholar 

  55. Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. IEEE conference on computer vision and pattern recognition, pp 152–159

  56. Ding S, Zhang N, Zhang X, Wu F (2017) Twin support vector machine: theory, algorithm and applications, vol 28

  57. Zeng K, Ding S, Jia W (2019) Single image super-resolution using a polymorphic parallel CNN, vol 49

  58. Zhang J, Ding S, Zhang N, Jia W (2019) Adversarial training methods for boltzmann machines. IEEE Access 8:4594–4604

    Article  Google Scholar 

  59. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. IEEE conference on computer vision and pattern recognition, pp 1318–1327

  60. MacQueen J et al (1967) Some methods for classification and analysis of multivariate observations, vol 14

  61. Luxburg Von (2007) Ulrike, a tutorial on spectral clustering. Stat Comput 17:395–416

    Article  MathSciNet  Google Scholar 

  62. Cong L, Ding S, Wang L, Zhang A, Jia W (2018) Image segmentation algorithm based on superpixel clustering, vol 12

  63. Zhu Y, Deng C, Cao H, Wang H (2020) Object and Background Disentanglement for Unsupervised Cross-Domain Person Re-Identification. Neurocomputing

  64. Zheng F, Deng C, Sun X, Jiang X, Guo X, Yu Z, Huang F, Ji R (2019) Pyramidal person re-identification via multi-loss dynamic training. Proc IEEE Conf Comput Vis Pattern Recognit, pp 8514–8522

  65. Pu C, Xu X, Deng C (2018) Deep View-Aware metric learning for person Re-Identification. IJCAI, pp 620–626

  66. Zhong Z., Zheng L., Zheng Z., Li S., Yang Y. (2019) Camstyle: A Novel Data Augmentation Method for Person Re-Identification. IEEE Trans Image Process 28:1176–1190

    Article  MathSciNet  Google Scholar 

  67. Song L, Wang C, Zhang L, Du B, Zhang Q, Huang C, Wang X (2020) Unsupervised domain adaptive re-identification: Theory and practice. Pattern Recogn 102:107173

  68. Tang Y, Xi Y, Wang N, Song B, Gao X (2020) CGAN-TM: A Novel Domain-to-Domain Transferring Method for Person Re-Identification, vol 29

  69. Zhao Y, Lu H (2020) Neighbor similarity and soft-label adaptation for unsupervised cross-dataset person re-identification. Neurocomputing

  70. Jiang K, Zhang T, Zhang Y, Wu F, Rui Y (2020) Self-Supervised Agent learning for unsupervised Cross-Domain person Re-Identification. IEEE Trans Image Process 29:8549–8560

    Article  Google Scholar 

  71. Lin Y, Wu Y, Yan C, Xu M, Yang Y (2020) Unsupervised Person Re-identification via Cross-Camera Similarity Exploration. IEEE Trans Image Process 29:5481–5490

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Grant Nos. 62072345, 41671382), LIESMARS Special Research Funding. The numerical calculations in this paper have been done on the supercomputing system in the Supercomputing Center of Wuhan University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shaoming Pan.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chong, Y., Peng, C., Zhang, C. et al. Learning domain invariant and specific representation for cross-domain person re-identification. Appl Intell 51, 5219–5232 (2021). https://doi.org/10.1007/s10489-020-02107-2

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-02107-2

Keywords

Navigation