Debiasing vision-language models for vision tasks: a survey

Zhu, Beier; Zhang, Hanwang

doi:10.1007/s11704-024-40051-3

Debiasing vision-language models for vision tasks: a survey

Letter
Published: 12 November 2024

Volume 19, article number 191321, (2025)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Beier Zhu¹ &
Hanwang Zhang¹

128 Accesses
2 Citations
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Radford A, Kim J W, Hallacy C, Ramesh A, Goh G, Agarwal S, Sastry G, Askell A, Mishkin P, Clark J, Krueger G, Sutskever I. Learning transferable visual models from natural language supervision. In: Proceedings of the 38th International Conference on Machine Learning. 2021, 8748–8763
Google Scholar
Seth A, Hemani M, Agarwal C. DeAR: debiasing vision-language models with additive residuals. In: Proceedings of 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023, 6820–6829
Google Scholar
Zhu B, Tang K, Sun Q, Zhang H. Generalized logit adjustment: Calibrating fine-tuned models by removing label bias in foundation models. In: Proceedings of the 37th Conference on Neural Information Processing Systems. 2023, 64663–64680
Google Scholar
Allingham J U, Ren J, Dusenberry M W, Gu X, Cui Y, Tran D, Liu J Z, Lakshminarayanan B. A simple zero-shot prompt weighting technique to improve prompt ensembling in text-image models. In: Proceedings of the 40th International Conference on Machine Learning. 2023, 26
Google Scholar
Wang J, Liu Y, Wang X. Are gender-neutral queries really gender-neutral? Mitigating gender bias in image search. In: Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing. 2021, 1995–2008
Chapter Google Scholar
Wang X, Wu Z, Lian L, Yu S X. Debiased learning from naturally imbalanced pseudo-labels. In: Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022, 14627–14637
Google Scholar
Cui J, Zhu B, Wen X, Qi X, Yu B, Zhang H. Classes are not equal: an empirical study on image recognition fairness. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. 2024, 23283–23292
Google Scholar
Zhu B, Niu Y, Lee S, Hur M, Zhang H. Debiased fine-tuning for vision-language models by prompt regularization. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 3834–3842
Google Scholar
Zhang M, Ré C. Contrastive adapters for foundation model group robustness. In: Proceedings of the 36th International Conference on Neural Information Processing Systems. 2022, 1576
Google Scholar
Chuang C Y, Jampani V, Li Y, Torralba A, Jegelka S. Debiasing vision-language models via biased prompts. 2023, arXiv preprint arXiv: 2302.00070
Google Scholar
Parashar S, Lin Z, Liu T, Dong X, Li Y, Ramanan D, Caverlee J, Kong S. The neglected tails in vision-language models. In: Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2024, 12988–12997
Google Scholar
Berg H, Hall S, Bhalgat Y, Kirk H, Shtedritski A, Bain M. A prompt array keeps the bias away: Debiasing vision-language models with adversarial learning. In: Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing. 2022, 806–822
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Beier Zhu & Hanwang Zhang

Authors

Beier Zhu
View author publications
Search author on:PubMed Google Scholar
Hanwang Zhang
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Beier Zhu.

Ethics declarations

Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.

Electronic Supplementary Material

Debiasing Vision-Language Models for Vision Tasks: A Survey

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, B., Zhang, H. Debiasing vision-language models for vision tasks: a survey. Front. Comput. Sci. 19, 191321 (2025). https://doi.org/10.1007/s11704-024-40051-3

Download citation

Received: 11 January 2024
Accepted: 08 July 2024
Published: 12 November 2024
DOI: https://doi.org/10.1007/s11704-024-40051-3

Part of a collection:

Excellent Young Computer Scientists Vision on Foundation Models

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Debiasing vision-language models for vision tasks: a survey

Access this article

Subscribe and save

Buy Now

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Electronic Supplementary Material

Debiasing Vision-Language Models for Vision Tasks: A Survey

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now