skip to main content
10.1145/3628797.3628961acmotherconferencesArticle/Chapter ViewAbstractPublication PagessoictConference Proceedingsconference-collections
research-article

IQAGA: Image Quality Assessment-Driven Learning with GAN-Based Dataset Augmentation for Cross-Domain Person Re-Identification

Published: 07 December 2023 Publication History

Abstract

Person re-identification (reID) is the task of matching images of the same person across different cameras or domains. It has many applications in security, surveillance, and biometrics. However, supervised learning-based person reID faces the challenge of domain shift, which means that the performance of a model trained on a specific domain (source domain) may degrade when testing on another domain (target domain) with different distributions, backgrounds, and lighting conditions. To enhance the generalization of person reID models, we propose a new approach consisting of three components: GAN-based data augmentation, cross-domain learning, and evaluation modules. Particularly, Generative Adversarial Network (GAN) approaches are used first to generate synthetic data from real source data by diversifying the environmental condition of the dataset. We then propose a cross-domain learning approach powered by image quality assessment (IQA) to reduce the impact of low-quality images in the combined source data, including synthetic and real source data. The extensive experiments evaluate the superiority of our proposed method over state-of-the-art methods on two famous person reID benchmarks, namely DukeMTMC-reID and Market-1501.

References

[1]
Fadi Boutros, Marco Huber, Patrick Siebke, Tim Rieber, and Naser Damer. 2022. SFace: Privacy-friendly and Accurate Face Recognition using Synthetic Data. In IEEE International Joint Conference on Biometrics, IJCB 2022, Abu Dhabi, United Arab Emirates, October 10-13, 2022. IEEE, USA, 1–11. https://doi.org/10.1109/IJCB54206.2022.10007961
[2]
H. Chen, Y. Wang, B. Lagadec, A. Dantcheva, and F. Bremond. 2023. Learning Invariance From Generated Variance for Unsupervised Person Re-Identification. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 06 (jun 2023), 7494–7508. https://doi.org/10.1109/TPAMI.2022.3226866
[3]
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 8789–8797. https://doi.org/10.1109/CVPR.2018.00916
[4]
Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, and Jianbin Jiao. 2018. Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 994–1003. https://doi.org/10.1109/CVPR.2018.00110
[5]
Hanh P. Du, Anh D. Nguyen, Dat T. Nguyen, and Hoa N. Nguyen. 2023. μ PEWFace: Parallel ensemble of weighted deep convolutional neural networks with novel loss functions for face-based authentication. Image and Vision Computing 139 (2023), 104819. https://doi.org/10.1016/j.imavis.2023.104819
[6]
Hanh P. Du, Anh D. Nguyen, Dat T. Nguyen, and Hoa N. Nguyen. 2023. A Novel Deep Ensemble Learning to Enhance User Authentication in Autonomous Vehicles. IEEE Transactions on Automation Science and Engineering (T-ASE) (2023), 1–14. https://doi.org/10.1109/TASE.2023.3270764
[7]
Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, and Hongsheng Li. 2018. FD-GAN: Pose-Guided Feature Distilling GAN for Robust Person Re-Identification. In Proceedings of the 32nd International Conference on Neural Information Processing Systems (Montréal, Canada) (NIPS’18). Curran Associates Inc., Red Hook, NY, USA, 1230–1241. https://dl.acm.org/doi/10.5555/3326943.3327056
[8]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems, Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K.Q. Weinberger (Eds.). Vol. 27. Curran Associates, Inc., Red Hook, NY, USA. https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf
[9]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 770–778. https://doi.org/10.1109/CVPR.2016.90
[10]
Y. Huang, Q. Wu, J. Xu, and Y. Zhong. 2019. SBSGAN: Suppression of Inter-Domain Background Shift for Person Re-Identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 9526–9535. https://doi.org/10.1109/ICCV.2019.00962
[11]
Glenn Jocher, Ayush Chaurasia, and Jing Qiu. 2023. YOLO by Ultralytics. Ultralytics. https://github.com/ultralytics/ultralytics
[12]
Mahesh Joshi, Mark Dredze, William W. Cohen, and Carolyn Rosé. 2012. Multi-Domain Learning: When Do Domains Matter?. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, Jeju Island, Korea, 1302–1312. https://aclanthology.org/D12-1119
[13]
Amena Khatun, Simon Denman, Sridha Sridharan, and Clinton Fookes. 2021. End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification. IEEE Transactions on Information Forensics and Security 16 (2021), 3803–3813. https://doi.org/10.1109/TIFS.2021.3088012
[14]
Minchul Kim, Anil K. Jain, and Xiaoming Liu. 2022. AdaFace: Quality Adaptive Margin for Face Recognition. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 18729–18738. https://doi.org/10.1109/CVPR52688.2022.01819
[15]
Minchul Kim, Anil K. Jain, and Xiaoming Liu. 2022. AdaFace: Quality Adaptive Margin for Face Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 18750–18759.
[16]
Ha V. Le, Tu N. Nguyen, Hoa N. Nguyen, and Linh Le. 2023. An Efficient Hybrid Webshell Detection Method for Webserver of Marine Transportation Systems. IEEE Transactions on Intelligent Transportation Systems 24, 2 (2023), 2630–2642. https://doi.org/10.1109/TITS.2021.3122979
[17]
W. Li, R. Zhao, T. Xiao, and X. Wang. 2014. DeepReID: Deep Filter Pairing Neural Network for Person Re-identification. In 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 152–159. https://doi.org/10.1109/CVPR.2014.27
[18]
Wei Li, Xiatian Zhu, and Shaogang Gong. 2017. Person Re-Identification by Deep Joint Learning of Multi-Loss Classification. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17. AAAI Press, Washington D.C, USA, 2194–2200. https://doi.org/10.24963/ijcai.2017/305
[19]
Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, and Xiang Bai. 2020. Real-Time Scene Text Detection with Differentiable Binarization. Proceedings of the AAAI Conference on Artificial Intelligence 34, 07 (Apr. 2020), 11474–11481. https://doi.org/10.1609/aaai.v34i07.6812
[20]
Jiawei Liu, Zheng-Jun Zha, Di Chen, Richang Hong, and Meng Wang. 2019. Adaptive Transfer Network for Cross-Domain Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 7195–7204. https://doi.org/10.1109/CVPR.2019.00737
[21]
H. Luo, Y. Gu, X. Liao, S. Lai, and W. Jiang. 2019. Bag of Tricks and a Strong Baseline for Deep Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, New York City, USA, 1487–1495. https://doi.org/10.1109/CVPRW.2019.00190
[22]
Hao Luo, Wei Jiang, Youzhi Gu, Fuxu Liu, Xingyu Liao, Shenqi Lai, and Jianyang Gu. 2020. A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification. IEEE Transactions on Multimedia 22, 10 (2020), 2597–2609. https://doi.org/10.1109/TMM.2019.2958756
[23]
Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, and Yi Yang. 2019. Pose-Guided Feature Alignment for Occluded Person Re-Identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 542–551. https://doi.org/10.1109/ICCV.2019.00063
[24]
Anh D. Nguyen, Dat T. Nguyen, Hai N. Dao, Hai H. Le, and Nam Q. Tran. 2022. Impact Analysis of Different Effective Loss Functions by Using Deep Convolutional Neural Network for Face Recognition. In From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries, Yuen-Hsien Tseng, Marie Katsurai, and Hoa N. Nguyen (Eds.). Springer International Publishing, Cham, 101–111. https://doi.org/10.1007/978-3-031-21756-2_8
[25]
Anh D. Nguyen, Dat T. Nguyen, Hanh P. Du, Hai N. Dao, and Hoa N. Nguyen. 2022. EnsFace: An Ensemble Method of Deep Convolutional Neural Networks with Novel Effective Loss Functions for Face Recognition. In The 11th International Symposium on Information and Communication Technology(SoICT 2022). ACM, New York, NY, USA, 231–238. https://doi.org/10.1145/3568562.3568638
[26]
Anh D. Nguyen, Dang H. Pham, and Hoa N. Nguyen. 2023. GAN-Based Data Augmentation and Pseudo-label Refinement for Unsupervised Domain Adaptation Person Re-identification. In Computational Collective Intelligence, Ngoc Thanh Nguyen, János Botzheim, László Gulyás, Manuel Núñez, Jan Treur, Gottfried Vossen, and Adrianna Kozierkiewicz (Eds.). Springer Nature Switzerland, Cham, 591–605. https://doi.org/10.1007/978-3-031-41456-5_45
[27]
Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-Normalized Image Generation for Person Re-identification. In Computer Vision – ECCV 2018, Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss (Eds.). Springer International Publishing, Cham, 661–678. https://doi.org/10.1007/978-3-030-01240-3_40
[28]
Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking. In Computer Vision – ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 17–35. https://doi.org/10.1007/978-3-319-48881-3_2
[29]
Vladimir Somers, Christophe De Vleeschouwer, and Alexandre Alahi. 2023. Body Part-Based Representation Learning for Occluded Person Re-Identification. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). IEEE, New York City, USA, 1613–1623. https://doi.org/10.1109/WACV56688.2023.00166
[30]
Yuanpeng Tu. 2022. Domain Camera Adaptation and Collaborative Multiple Feature Clustering for Unsupervised Person Re-ID. In Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis. ACM, New York, NY, USA, 51–59. https://doi.org/10.1145/3552458.3556446
[31]
Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person Transfer GAN to Bridge Domain Gap for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 79–88. https://doi.org/10.1109/CVPR.2018.00016
[32]
Jiali Xi, Jianqiang Huang, Shibao Zheng, Qin Zhou, Bernt Schiele, Xian-Sheng Hua, and Qianru Sun. 2023. Learning comprehensive global features in person re-identification: Ensuring discriminativeness of more local regions. Pattern Recognition 134 (2023), 109068. https://doi.org/10.1016/j.patcog.2022.109068
[33]
Jing Xu, Rui Zhao, Feng Zhu, Huaming Wang, and Wanli Ouyang. 2018. Attention-Aware Compositional Network for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 2119–2128. https://doi.org/10.1109/CVPR.2018.00226
[34]
Fengxiang Yang, Zhun Zhong, Zhiming Luo, Sheng Lian, and Shaozi Li. 2020. Leveraging Virtual and Real Person for Unsupervised Person Re-Identification. IEEE Transactions on Multimedia 22, 9 (2020), 2444–2453. https://doi.org/10.1109/TMM.2019.2957928
[35]
Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z. Li. 2014. Deep Metric Learning for Person Re-identification. In 2014 22nd International Conference on Pattern Recognition. IEEE, New York City, USA, 34–39. https://doi.org/10.1109/ICPR.2014.16
[36]
D. Yi, Z. Lei, S. Liao, and S. Z. Li. 2014. Deep Metric Learning for Person Re-identification. In 2014 22nd International Conference on Pattern Recognition (ICPR). IEEE, New York City, USA, 34–39. https://doi.org/10.1109/ICPR.2014.16
[37]
Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, and Shin’ichi Satoh. 2020. Illumination-Adaptive Person Re-Identification. IEEE Transactions on Multimedia 22, 12 (2020), 3064–3074. https://doi.org/10.1109/TMM.2020.2969782
[38]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable Person Re-identification: A Benchmark. In 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 1116–1124. https://doi.org/10.1109/ICCV.2015.133
[39]
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, and Jan Kautz. 2019. Joint Discriminative and Generative Learning for Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 2133–2142. https://doi.org/10.1109/CVPR.2019.00224
[40]
Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, and Yi Yang. 2018. Camera Style Adaptation for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 5157–5166. https://doi.org/10.1109/CVPR.2018.00541
[41]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. In 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, Piscataway, NJ, USA, 2242–2251. https://doi.org/10.1109/ICCV.2017.244
[42]
Yang Zou, Xiaodong Yang, Zhiding Yu, B. V. K. Vijaya Kumar, and Jan Kautz. 2020. Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification. In Computer Vision – ECCV 2020, Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 87–104.

Cited By

View all
  • (2024)GAN-based data augmentation and pseudo-label refinement with holistic features for unsupervised domain adaptation person re-identificationKnowledge-Based Systems10.1016/j.knosys.2024.111471288:COnline publication date: 25-Jun-2024
  • (2024)Context-Preserved Spatial Normalization Based Person Image GenerationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5678-0_27(312-323)Online publication date: 1-Aug-2024
  • (2024)cMDTPS: Comprehensive Masked Modality Modeling with Improved Similarity Distribution Matching Loss for Text-based Person SearchThe 13th Conference on Information Technology and Its Applications10.1007/978-3-031-74127-2_16(184-196)Online publication date: 8-Nov-2024

Index Terms

  1. IQAGA: Image Quality Assessment-Driven Learning with GAN-Based Dataset Augmentation for Cross-Domain Person Re-Identification

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image ACM Other conferences
          SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology
          December 2023
          1058 pages
          ISBN:9798400708916
          DOI:10.1145/3628797
          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          Published: 07 December 2023

          Permissions

          Request permissions for this article.

          Check for updates

          Author Tags

          1. Deep Convolutional Neural Network
          2. Generative Adversarial Network
          3. Image Quality Assessment.
          4. Person Re-Identification

          Qualifiers

          • Research-article
          • Research
          • Refereed limited

          Conference

          SOICT 2023

          Acceptance Rates

          Overall Acceptance Rate 147 of 318 submissions, 46%

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)30
          • Downloads (Last 6 weeks)1
          Reflects downloads up to 01 Mar 2025

          Other Metrics

          Citations

          Cited By

          View all
          • (2024)GAN-based data augmentation and pseudo-label refinement with holistic features for unsupervised domain adaptation person re-identificationKnowledge-Based Systems10.1016/j.knosys.2024.111471288:COnline publication date: 25-Jun-2024
          • (2024)Context-Preserved Spatial Normalization Based Person Image GenerationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5678-0_27(312-323)Online publication date: 1-Aug-2024
          • (2024)cMDTPS: Comprehensive Masked Modality Modeling with Improved Similarity Distribution Matching Loss for Text-based Person SearchThe 13th Conference on Information Technology and Its Applications10.1007/978-3-031-74127-2_16(184-196)Online publication date: 8-Nov-2024

          View Options

          Login options

          View options

          PDF

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format.

          HTML Format

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media