skip to main content
10.1145/3571600.3571655acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicvgipConference Proceedingsconference-collections
research-article

G-PReDICT: Generalizable Person Re-ID using Domain Invariant Contrastive Techniques✱

Published:12 May 2023Publication History

ABSTRACT

Learning identity-aware, domain-invariant representations is crucial in solving domain generalizable person ReID (DG-ReID). Existing methods commonly use augmentation techniques either in feature space by mixing instance and batch normalization layers or in pixel space by adversarially generating pseudo domains. However, neither of these techniques guarantee identity preservation. Apart from increasing training data diversity, the augmented positive pairs also encode rich semantic relations which have not been fully explored. To address the above issues, we propose a novel framework for Generalizable Person Re-identification using Domain Invariant Contrastive Techniques (G-PReDICT). Specifically, we use simple yet effective perturbation strategies to hallucinate positive samples across domains by realistically modelling domain variations while preserving the target identities. We harness rich sample-sample relations between the hallucinated positive-negative pairs to learn domain-invariant representations using supervised contrastive learning. We also use a domain independent auxiliary task, i.e. attribute prediction to learn robust representations and introduce attribute annotations for two large scale public benchmarks i.e. CUHK-03 and MSMT17. Extensive experiments on standard benchmarks demonstrate the effectiveness of the proposed method.

Skip Supplemental Material Section

Supplemental Material

References

  1. Alexey Abramov, Christopher Bayer, and Claudio Heller. 2020. Keep it simple: Image statistics matching for domain adaptation. arXiv preprint arXiv:2005.12551(2020).Google ScholarGoogle Scholar
  2. Slawomir Bak, Peter Carr, and Jean-Francois Lalonde. 2018. Domain adaptation through synthesis for unsupervised person re-identification. In Proceedings of the European conference on computer vision (ECCV). 189–205.Google ScholarGoogle Scholar
  3. Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, and Armand Joulin. 2020. Unsupervised learning of visual features by contrasting cluster assignments. Advances in Neural Information Processing Systems 33 (2020), 9912–9924.Google ScholarGoogle Scholar
  4. Hao Chen, Yaohui Wang, Benoit Lagadec, Antitza Dantcheva, and Francois Bremond. 2021. Joint generative and contrastive learning for unsupervised person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2004–2013.Google ScholarGoogle ScholarCross RefCross Ref
  5. Liang-Chieh Chen, George Papandreou, Florian Schroff, and Hartwig Adam. 2017. Rethinking Atrous Convolution for Semantic Image Segmentation. ArXiv abs/1706.05587(2017).Google ScholarGoogle Scholar
  6. Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In International conference on machine learning. PMLR, 1597–1607.Google ScholarGoogle Scholar
  7. Seokeon Choi, Taekyung Kim, Minki Jeong, Hyoungseob Park, and Changick Kim. 2021. Meta batch-instance normalization for generalizable person re-identification. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition. 3425–3435.Google ScholarGoogle ScholarCross RefCross Ref
  8. Zuozhuo Dai, Guangyuan Wang, Weihao Yuan, Xiaoli Liu, Siyu Zhu, and Ping Tan. 2021. Cluster contrast for unsupervised person re-identification. arXiv preprint arXiv:2103.11568(2021).Google ScholarGoogle Scholar
  9. Weijian Deng, L. Zheng, Guoliang Kang, Yezhou Yang, Qixiang Ye, and Jianbin Jiao. 2018. Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (2018), 994–1003.Google ScholarGoogle Scholar
  10. Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Networks. https://doi.org/10.48550/ARXIV.1406.2661Google ScholarGoogle ScholarCross RefCross Ref
  11. Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9729–9738.Google ScholarGoogle ScholarCross RefCross Ref
  12. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.Google ScholarGoogle ScholarCross RefCross Ref
  13. Alexander Hermans, Lucas Beyer, and Bastian Leibe. 2017. In defense of the triplet loss for person re-identification. arXiv 2017. arXiv preprint arXiv:1703.07737 4 (2017).Google ScholarGoogle Scholar
  14. Weiquan Huang, Yan Bai, Qiuyu Ren, Xinbo Zhao, Ming Feng, and Yin Wang. 2021. Large-Scale Unsupervised Person Re-Identification with Contrastive Learning. arXiv preprint arXiv:2105.07914(2021).Google ScholarGoogle Scholar
  15. Sergey Ioffe and Christian Szegedy. 2015. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International conference on machine learning. PMLR, 448–456.Google ScholarGoogle Scholar
  16. Jieru Jia, Qiuqi Ruan, and Timothy M Hospedales. 2019. Frustratingly easy person re-identification: Generalizing person re-id in practice. arXiv preprint arXiv:1905.03422(2019).Google ScholarGoogle Scholar
  17. Xin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen, and Li Zhang. 2020. Style normalization and restitution for generalizable person re-identification. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 3143–3152.Google ScholarGoogle ScholarCross RefCross Ref
  18. Prannay Khosla, Piotr Teterwak, Chen Wang, Aaron Sarna, Yonglong Tian, Phillip Isola, Aaron Maschinot, Ce Liu, and Dilip Krishnan. 2020. Supervised contrastive learning. Advances in Neural Information Processing Systems 33 (2020), 18661–18673.Google ScholarGoogle Scholar
  19. Vikash Kumar, Sarthak Srivastava, Rohit Lal, and Anirban Chakraborty. 2021. CAFT: Class Aware Frequency Transform for Reducing Domain Gap. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 2525–2534.Google ScholarGoogle ScholarCross RefCross Ref
  20. Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, and Timothy M Hospedales. 2021. A simple feature augmentation for domain generalization. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 8886–8895.Google ScholarGoogle ScholarCross RefCross Ref
  21. Wei Li, Rui Zhao, Tong Xiao, and Xiaogang Wang. 2014. Deepreid: Deep filter pairing neural network for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition. 152–159.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Shengcai Liao and Ling Shao. 2020. Interpretable and generalizable person re-identification with query-adaptive convolution and temporal lifting. In European Conference on Computer Vision. Springer, 456–474.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Shan Lin, Haoliang Li, Chang-Tsun Li, and Alex Chichung Kot. 2018. Multi-task mid-level feature alignment network for unsupervised cross-dataset person re-identification. arXiv preprint arXiv:1807.01440(2018).Google ScholarGoogle Scholar
  24. Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu, Zhilan Hu, Chenggang Yan, and Yi Yang. 2019. Improving Person Re-identification by Attribute and Identity Learning. Pattern Recognition (2019). https://doi.org/10.1016/j.patcog.2019.06.006Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Lei Qi, Lei Wang, Jing Huo, Luping Zhou, Yinghuan Shi, and Yang Gao. 2019. A novel unsupervised camera-aware domain adaptation framework for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision. 8080–8089.Google ScholarGoogle ScholarCross RefCross Ref
  26. Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance measures and a data set for multi-target, multi-camera tracking. In European conference on computer vision. Springer, 17–35.Google ScholarGoogle ScholarCross RefCross Ref
  27. Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, Siddhartha Chaudhuri, Preethi Jyothi, and Sunita Sarawagi. 2018. Generalizing across domains via cross-gradient training. arXiv preprint arXiv:1804.10745(2018).Google ScholarGoogle Scholar
  28. Jifei Song, Yongxin Yang, Yi-Zhe Song, Tao Xiang, and Timothy M Hospedales. 2019. Generalizable person re-identification by domain-invariant mapping network. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition. 719–728.Google ScholarGoogle ScholarCross RefCross Ref
  29. Yifan Sun, Liang Zheng, Yi Yang, Qi Tian, and Shengjin Wang. 2018. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European conference on computer vision (ECCV). 480–496.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Yonglong Tian, Dilip Krishnan, and Phillip Isola. 2020. Contrastive multiview coding. In European conference on computer vision. Springer, 776–794.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. 2016. Instance normalization: The missing ingredient for fast stylization. arXiv preprint arXiv:1607.08022(2016).Google ScholarGoogle Scholar
  32. Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.Journal of machine learning research 9, 11 (2008).Google ScholarGoogle Scholar
  33. Riccardo Volpi, Hongseok Namkoong, Ozan Sener, John C Duchi, Vittorio Murino, and Silvio Savarese. 2018. Generalizing to unseen domains via adversarial data augmentation. Advances in neural information processing systems 31 (2018).Google ScholarGoogle Scholar
  34. Zheng Wang, Ruimin Hu, Chen Chen, Yi Yu, Junjun Jiang, Chao Liang, and Shin’ichi Satoh. 2018. Person Reidentification via Discrepancy Matrix and Matrix Metric. IEEE Transactions on Cybernetics 48, 10 (2018), 3006–3020. https://doi.org/10.1109/TCYB.2017.2755044Google ScholarGoogle ScholarCross RefCross Ref
  35. Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person transfer gan to bridge domain gap for person re-identification. In Proceedings of the IEEE conference on computer vision and pattern recognition. 79–88.Google ScholarGoogle ScholarCross RefCross Ref
  36. Yanchao Yang and Stefano Soatto. 2020. Fda: Fourier domain adaptation for semantic segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4085–4095.Google ScholarGoogle ScholarCross RefCross Ref
  37. Yabin Zhang, Minghan Li, Ruihuang Li, Kui Jia, and Lei Zhang. 2022. Exact feature distribution matching for arbitrary style transfer and domain generalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 8035–8045.Google ScholarGoogle ScholarCross RefCross Ref
  38. Yuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, and Nicu Sebe. 2021. Learning to generalize unseen domains via memory-based multi-source meta-learning for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 6277–6286.Google ScholarGoogle ScholarCross RefCross Ref
  39. Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable Person Re-identification: A Benchmark. In 2015 IEEE International Conference on Computer Vision (ICCV). 1116–1124. https://doi.org/10.1109/ICCV.2015.133Google ScholarGoogle ScholarCross RefCross Ref
  40. Zhedong Zheng, Liang Zheng, and Yi Yang. 2017. A discriminatively learned cnn embedding for person reidentification. ACM transactions on multimedia computing, communications, and applications (TOMM) 14, 1 (2017), 1–20.Google ScholarGoogle Scholar
  41. Zhun Zhong, Liang Zheng, Donglin Cao, and Shaozi Li. 2017. Re-ranking person re-identification with k-reciprocal encoding. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1318–1327.Google ScholarGoogle ScholarCross RefCross Ref
  42. Zhun Zhong, Liang Zheng, Shaozi Li, and Yi Yang. 2018. Generalizing a person retrieval model hetero-and homogeneously. In Proceedings of the European conference on computer vision (ECCV). 172–188.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Zhun Zhong, Liang Zheng, Zhiming Luo, Shaozi Li, and Yi Yang. 2019. Invariance matters: Exemplar memory for domain adaptive person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 598–607.Google ScholarGoogle ScholarCross RefCross Ref
  44. Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, and Tao Xiang. 2019. Omni-scale feature learning for person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3702–3712.Google ScholarGoogle ScholarCross RefCross Ref
  45. Kaiyang Zhou, Yongxin Yang, Andrea Cavallaro, and Tao Xiang. 2021. Learning generalisable omni-scale representations for person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).Google ScholarGoogle ScholarCross RefCross Ref
  46. Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, and Tao Xiang. 2020. Deep domain-adversarial image generation for domain generalisation. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 13025–13032.Google ScholarGoogle ScholarCross RefCross Ref
  47. Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, and Tao Xiang. 2020. Learning to generate novel domains for domain generalization. In European conference on computer vision. Springer, 561–578.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. Kaiyang Zhou, Yongxin Yang, Yu Qiao, and Tao Xiang. 2021. Domain generalization with mixstyle. arXiv preprint arXiv:2104.02008(2021).Google ScholarGoogle Scholar

Index Terms

  1. G-PReDICT: Generalizable Person Re-ID using Domain Invariant Contrastive Techniques✱

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        ICVGIP '22: Proceedings of the Thirteenth Indian Conference on Computer Vision, Graphics and Image Processing
        December 2022
        506 pages
        ISBN:9781450398220
        DOI:10.1145/3571600

        Copyright © 2022 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 12 May 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate95of286submissions,33%
      • Article Metrics

        • Downloads (Last 12 months)20
        • Downloads (Last 6 weeks)6

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format