research-article

IQAGA: Image Quality Assessment-Driven Learning with GAN-Based Dataset Augmentation for Cross-Domain Person Re-Identification

Authors:

Hoa N. NguyenAuthors Info & Claims

SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology

Pages 63 - 70

https://doi.org/10.1145/3628797.3628961

Published: 07 December 2023 Publication History

Abstract

Person re-identification (reID) is the task of matching images of the same person across different cameras or domains. It has many applications in security, surveillance, and biometrics. However, supervised learning-based person reID faces the challenge of domain shift, which means that the performance of a model trained on a specific domain (source domain) may degrade when testing on another domain (target domain) with different distributions, backgrounds, and lighting conditions. To enhance the generalization of person reID models, we propose a new approach consisting of three components: GAN-based data augmentation, cross-domain learning, and evaluation modules. Particularly, Generative Adversarial Network (GAN) approaches are used first to generate synthetic data from real source data by diversifying the environmental condition of the dataset. We then propose a cross-domain learning approach powered by image quality assessment (IQA) to reduce the impact of low-quality images in the combined source data, including synthetic and real source data. The extensive experiments evaluate the superiority of our proposed method over state-of-the-art methods on two famous person reID benchmarks, namely DukeMTMC-reID and Market-1501.

References

[1]

Fadi Boutros, Marco Huber, Patrick Siebke, Tim Rieber, and Naser Damer. 2022. SFace: Privacy-friendly and Accurate Face Recognition using Synthetic Data. In IEEE International Joint Conference on Biometrics, IJCB 2022, Abu Dhabi, United Arab Emirates, October 10-13, 2022. IEEE, USA, 1–11. https://doi.org/10.1109/IJCB54206.2022.10007961

[2]

H. Chen, Y. Wang, B. Lagadec, A. Dantcheva, and F. Bremond. 2023. Learning Invariance From Generated Variance for Unsupervised Person Re-Identification. IEEE Transactions on Pattern Analysis and Machine Intelligence 45, 06 (jun 2023), 7494–7508. https://doi.org/10.1109/TPAMI.2022.3226866

Digital Library

[3]

Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, and Jaegul Choo. 2018. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 8789–8797. https://doi.org/10.1109/CVPR.2018.00916

[4]

Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, and Jianbin Jiao. 2018. Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 994–1003. https://doi.org/10.1109/CVPR.2018.00110

[5]

Hanh P. Du, Anh D. Nguyen, Dat T. Nguyen, and Hoa N. Nguyen. 2023. μ PEWFace: Parallel ensemble of weighted deep convolutional neural networks with novel loss functions for face-based authentication. Image and Vision Computing 139 (2023), 104819. https://doi.org/10.1016/j.imavis.2023.104819

Digital Library

[6]

Hanh P. Du, Anh D. Nguyen, Dat T. Nguyen, and Hoa N. Nguyen. 2023. A Novel Deep Ensemble Learning to Enhance User Authentication in Autonomous Vehicles. IEEE Transactions on Automation Science and Engineering (T-ASE) (2023), 1–14. https://doi.org/10.1109/TASE.2023.3270764

[7]

Yixiao Ge, Zhuowan Li, Haiyu Zhao, Guojun Yin, Shuai Yi, Xiaogang Wang, and Hongsheng Li. 2018. FD-GAN: Pose-Guided Feature Distilling GAN for Robust Person Re-Identification. In Proceedings of the 32nd International Conference on Neural Information Processing Systems (Montréal, Canada) (NIPS’18). Curran Associates Inc., Red Hook, NY, USA, 1230–1241. https://dl.acm.org/doi/10.5555/3326943.3327056

[8]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative Adversarial Nets. In Advances in Neural Information Processing Systems, Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K.Q. Weinberger (Eds.). Vol. 27. Curran Associates, Inc., Red Hook, NY, USA. https://proceedings.neurips.cc/paper_files/paper/2014/file/5ca3e9b122f61f8f06494c97b1afccf3-Paper.pdf

[9]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 770–778. https://doi.org/10.1109/CVPR.2016.90

[10]

Y. Huang, Q. Wu, J. Xu, and Y. Zhong. 2019. SBSGAN: Suppression of Inter-Domain Background Shift for Person Re-Identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 9526–9535. https://doi.org/10.1109/ICCV.2019.00962

[11]

Glenn Jocher, Ayush Chaurasia, and Jing Qiu. 2023. YOLO by Ultralytics. Ultralytics. https://github.com/ultralytics/ultralytics

[12]

Mahesh Joshi, Mark Dredze, William W. Cohen, and Carolyn Rosé. 2012. Multi-Domain Learning: When Do Domains Matter?. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, Jeju Island, Korea, 1302–1312. https://aclanthology.org/D12-1119

[13]

Amena Khatun, Simon Denman, Sridha Sridharan, and Clinton Fookes. 2021. End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification. IEEE Transactions on Information Forensics and Security 16 (2021), 3803–3813. https://doi.org/10.1109/TIFS.2021.3088012

[14]

Minchul Kim, Anil K. Jain, and Xiaoming Liu. 2022. AdaFace: Quality Adaptive Margin for Face Recognition. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 18729–18738. https://doi.org/10.1109/CVPR52688.2022.01819

[15]

Minchul Kim, Anil K. Jain, and Xiaoming Liu. 2022. AdaFace: Quality Adaptive Margin for Face Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 18750–18759.

[16]

Ha V. Le, Tu N. Nguyen, Hoa N. Nguyen, and Linh Le. 2023. An Efficient Hybrid Webshell Detection Method for Webserver of Marine Transportation Systems. IEEE Transactions on Intelligent Transportation Systems 24, 2 (2023), 2630–2642. https://doi.org/10.1109/TITS.2021.3122979

[17]

W. Li, R. Zhao, T. Xiao, and X. Wang. 2014. DeepReID: Deep Filter Pairing Neural Network for Person Re-identification. In 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 152–159. https://doi.org/10.1109/CVPR.2014.27

Digital Library

[18]

Wei Li, Xiatian Zhu, and Shaogang Gong. 2017. Person Re-Identification by Deep Joint Learning of Multi-Loss Classification. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI-17. AAAI Press, Washington D.C, USA, 2194–2200. https://doi.org/10.24963/ijcai.2017/305

[19]

Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, and Xiang Bai. 2020. Real-Time Scene Text Detection with Differentiable Binarization. Proceedings of the AAAI Conference on Artificial Intelligence 34, 07 (Apr. 2020), 11474–11481. https://doi.org/10.1609/aaai.v34i07.6812

[20]

Jiawei Liu, Zheng-Jun Zha, Di Chen, Richang Hong, and Meng Wang. 2019. Adaptive Transfer Network for Cross-Domain Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 7195–7204. https://doi.org/10.1109/CVPR.2019.00737

[21]

H. Luo, Y. Gu, X. Liao, S. Lai, and W. Jiang. 2019. Bag of Tricks and a Strong Baseline for Deep Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). IEEE, New York City, USA, 1487–1495. https://doi.org/10.1109/CVPRW.2019.00190

[22]

Hao Luo, Wei Jiang, Youzhi Gu, Fuxu Liu, Xingyu Liao, Shenqi Lai, and Jianyang Gu. 2020. A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification. IEEE Transactions on Multimedia 22, 10 (2020), 2597–2609. https://doi.org/10.1109/TMM.2019.2958756

[23]

Jiaxu Miao, Yu Wu, Ping Liu, Yuhang Ding, and Yi Yang. 2019. Pose-Guided Feature Alignment for Occluded Person Re-Identification. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 542–551. https://doi.org/10.1109/ICCV.2019.00063

[24]

Anh D. Nguyen, Dat T. Nguyen, Hai N. Dao, Hai H. Le, and Nam Q. Tran. 2022. Impact Analysis of Different Effective Loss Functions by Using Deep Convolutional Neural Network for Face Recognition. In From Born-Physical to Born-Virtual: Augmenting Intelligence in Digital Libraries, Yuen-Hsien Tseng, Marie Katsurai, and Hoa N. Nguyen (Eds.). Springer International Publishing, Cham, 101–111. https://doi.org/10.1007/978-3-031-21756-2_8

Digital Library

[25]

Anh D. Nguyen, Dat T. Nguyen, Hanh P. Du, Hai N. Dao, and Hoa N. Nguyen. 2022. EnsFace: An Ensemble Method of Deep Convolutional Neural Networks with Novel Effective Loss Functions for Face Recognition. In The 11th International Symposium on Information and Communication Technology(SoICT 2022). ACM, New York, NY, USA, 231–238. https://doi.org/10.1145/3568562.3568638

Digital Library

[26]

Anh D. Nguyen, Dang H. Pham, and Hoa N. Nguyen. 2023. GAN-Based Data Augmentation and Pseudo-label Refinement for Unsupervised Domain Adaptation Person Re-identification. In Computational Collective Intelligence, Ngoc Thanh Nguyen, János Botzheim, László Gulyás, Manuel Núñez, Jan Treur, Gottfried Vossen, and Adrianna Kozierkiewicz (Eds.). Springer Nature Switzerland, Cham, 591–605. https://doi.org/10.1007/978-3-031-41456-5_45

Digital Library

[27]

Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-Normalized Image Generation for Person Re-identification. In Computer Vision – ECCV 2018, Vittorio Ferrari, Martial Hebert, Cristian Sminchisescu, and Yair Weiss (Eds.). Springer International Publishing, Cham, 661–678. https://doi.org/10.1007/978-3-030-01240-3_40

Digital Library

[28]

Ergys Ristani, Francesco Solera, Roger Zou, Rita Cucchiara, and Carlo Tomasi. 2016. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking. In Computer Vision – ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 17–35. https://doi.org/10.1007/978-3-319-48881-3_2

[29]

Vladimir Somers, Christophe De Vleeschouwer, and Alexandre Alahi. 2023. Body Part-Based Representation Learning for Occluded Person Re-Identification. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). IEEE, New York City, USA, 1613–1623. https://doi.org/10.1109/WACV56688.2023.00166

[30]

Yuanpeng Tu. 2022. Domain Camera Adaptation and Collaborative Multiple Feature Clustering for Unsupervised Person Re-ID. In Proceedings of the 3rd International Workshop on Human-Centric Multimedia Analysis. ACM, New York, NY, USA, 51–59. https://doi.org/10.1145/3552458.3556446

Digital Library

[31]

Longhui Wei, Shiliang Zhang, Wen Gao, and Qi Tian. 2018. Person Transfer GAN to Bridge Domain Gap for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 79–88. https://doi.org/10.1109/CVPR.2018.00016

[32]

Jiali Xi, Jianqiang Huang, Shibao Zheng, Qin Zhou, Bernt Schiele, Xian-Sheng Hua, and Qianru Sun. 2023. Learning comprehensive global features in person re-identification: Ensuring discriminativeness of more local regions. Pattern Recognition 134 (2023), 109068. https://doi.org/10.1016/j.patcog.2022.109068

Digital Library

[33]

Jing Xu, Rui Zhao, Feng Zhu, Huaming Wang, and Wanli Ouyang. 2018. Attention-Aware Compositional Network for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 2119–2128. https://doi.org/10.1109/CVPR.2018.00226

[34]

Fengxiang Yang, Zhun Zhong, Zhiming Luo, Sheng Lian, and Shaozi Li. 2020. Leveraging Virtual and Real Person for Unsupervised Person Re-Identification. IEEE Transactions on Multimedia 22, 9 (2020), 2444–2453. https://doi.org/10.1109/TMM.2019.2957928

[35]

Dong Yi, Zhen Lei, Shengcai Liao, and Stan Z. Li. 2014. Deep Metric Learning for Person Re-identification. In 2014 22nd International Conference on Pattern Recognition. IEEE, New York City, USA, 34–39. https://doi.org/10.1109/ICPR.2014.16

Digital Library

[36]

D. Yi, Z. Lei, S. Liao, and S. Z. Li. 2014. Deep Metric Learning for Person Re-identification. In 2014 22nd International Conference on Pattern Recognition (ICPR). IEEE, New York City, USA, 34–39. https://doi.org/10.1109/ICPR.2014.16

Digital Library

[37]

Zelong Zeng, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Yung-Yu Chuang, and Shin’ichi Satoh. 2020. Illumination-Adaptive Person Re-Identification. IEEE Transactions on Multimedia 22, 12 (2020), 3064–3074. https://doi.org/10.1109/TMM.2020.2969782

Digital Library

[38]

Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable Person Re-identification: A Benchmark. In 2015 IEEE International Conference on Computer Vision (ICCV). IEEE, New York City, USA, 1116–1124. https://doi.org/10.1109/ICCV.2015.133

[39]

Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, and Jan Kautz. 2019. Joint Discriminative and Generative Learning for Person Re-Identification. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, New York City, USA, 2133–2142. https://doi.org/10.1109/CVPR.2019.00224

[40]

Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, and Yi Yang. 2018. Camera Style Adaptation for Person Re-identification. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE, New York City, USA, 5157–5166. https://doi.org/10.1109/CVPR.2018.00541

[41]

Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros. 2017. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. In 2017 IEEE International Conference on Computer Vision (ICCV). IEEE, Piscataway, NJ, USA, 2242–2251. https://doi.org/10.1109/ICCV.2017.244

[42]

Yang Zou, Xiaodong Yang, Zhiding Yu, B. V. K. Vijaya Kumar, and Jan Kautz. 2020. Joint Disentangling and Adaptation for Cross-Domain Person Re-Identification. In Computer Vision – ECCV 2020, Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (Eds.). Springer International Publishing, Cham, 87–104.

Digital Library

Cited By

Pham DNguyen ANguyen H(2024)GAN-based data augmentation and pseudo-label refinement with holistic features for unsupervised domain adaptation person re-identificationKnowledge-Based Systems10.1016/j.knosys.2024.111471288:COnline publication date: 25-Jun-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.111471
Wei WLiang MDuan X(2024)Context-Preserved Spatial Normalization Based Person Image GenerationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5678-0_27(312-323)Online publication date: 1-Aug-2024
https://doi.org/10.1007/978-981-97-5678-0_27
Nguyen APham DNguyen DNguyen H(2024)cMDTPS: Comprehensive Masked Modality Modeling with Improved Similarity Distribution Matching Loss for Text-based Person SearchThe 13th Conference on Information Technology and Its Applications10.1007/978-3-031-74127-2_16(184-196)Online publication date: 8-Nov-2024
https://doi.org/10.1007/978-3-031-74127-2_16

Index Terms

IQAGA: Image Quality Assessment-Driven Learning with GAN-Based Dataset Augmentation for Cross-Domain Person Re-Identification
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Matching
        Object identification
        Tracking
  2. Machine learning
    1. Machine learning approaches
      1. Neural networks

Recommendations

JoT-GAN: A Framework for Jointly Training GAN and Person Re-Identification Model
To cope with the problem caused by inadequate training data, many person re-identification (re-id) methods exploit generative adversarial networks (GAN) for data augmentation, where the training of GAN is typically independent of that of the re-id model. ...
Cross-dataset person re-identification using deep convolutional neural networks: effects of context and domain adaptation

Over the past years, the impact of surveillance systems on public safety increases dramatically. One significant challenge in this domain is person re-identification, which aims to detect whether a person has already been captured by another camera in ...
Study of cross-domain person re-identification based on DCGAN
Abstract
Person re-identification(re-ID) techniques have been rapidly improving with the development of deep neural networks, and the accuracy of fully supervised re-ID models is already very high. However, when person re-identification models with ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

SOICT '23: Proceedings of the 12th International Symposium on Information and Communication Technology

December 2023

1058 pages

ISBN:9798400708916

DOI:10.1145/3628797

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 December 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

SOICT 2023

SOICT 2023: The 12th International Symposium on Information and Communication Technology

December 7 - 8, 2023

Ho Chi Minh, Vietnam

Acceptance Rates

Overall Acceptance Rate 147 of 318 submissions, 46%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
53
Total Downloads

Downloads (Last 12 months)30
Downloads (Last 6 weeks)1

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Pham DNguyen ANguyen H(2024)GAN-based data augmentation and pseudo-label refinement with holistic features for unsupervised domain adaptation person re-identificationKnowledge-Based Systems10.1016/j.knosys.2024.111471288:COnline publication date: 25-Jun-2024
https://dl.acm.org/doi/10.1016/j.knosys.2024.111471
Wei WLiang MDuan X(2024)Context-Preserved Spatial Normalization Based Person Image GenerationAdvanced Intelligent Computing Technology and Applications10.1007/978-981-97-5678-0_27(312-323)Online publication date: 1-Aug-2024
https://doi.org/10.1007/978-981-97-5678-0_27
Nguyen APham DNguyen DNguyen H(2024)cMDTPS: Comprehensive Masked Modality Modeling with Improved Similarity Distribution Matching Loss for Text-based Person SearchThe 13th Conference on Information Technology and Its Applications10.1007/978-3-031-74127-2_16(184-196)Online publication date: 8-Nov-2024
https://doi.org/10.1007/978-3-031-74127-2_16

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten