Where Are Biases? Adversarial Debiasing with Spurious Feature Visualization

Chen, Chi-Yu; Ching, Pu; Huang, Pei-Hsin; Hu, Min-Chun

doi:10.1007/978-3-031-53305-1_1

Chi-Yu Chen¹⁴,
Pu Ching¹⁵,
Pei-Hsin Huang¹⁵ &
…
Min-Chun Hu¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14554))

Included in the following conference series:

International Conference on Multimedia Modeling

1224 Accesses

Abstract

To avoid deep learning models utilizing shortcuts in a training dataset, many debiasing models have been developed to encourage models learning from accurate correlations. Some research constructs robust models via adversarial training. Although this series of methods shows promising debiasing performance, we do not know precisely what spurious features have been discarded during adversarial training. To address its lack of explainability especially in scenarios with low error tolerance, we design AdvExp, which not only visualizes the underlying spurious feature behind adversarial training but also maintains good debiasing performance with the assistance of a robust optimization algorithm. We show promising performance of AdvExp on BiasCheXpert, a subsampled dataset from CheXpert, and uncover potential regions in radiographs recognized by deep neural networks as gender or race-related features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Adversarial learning with optimism for bias reduction in machine learning

Article 26 October 2023

Debiasing Surgeon: Fantastic Weights and How to Find Them

Right for the Right Reason: Training Agnostic Networks

References

Larrazabal, A.J., Nieto, N., Peterson, V., Milone, D.H., Ferrante, E.: Gender imbalance in medical imaging datasets produces biased classifiers for computer-aided diagnosis. In: Proceedings of the National Academy of Sciences, vol. 117, no. 23, pp. 12592–12594 (2020)
Google Scholar
Gichoya, J.W., et al.: Ai recognition of patient race in medical imaging: a modelling study. Lancet Digit. Health 4(6), e406–e414 (2022)
Article Google Scholar
Ganin, Y., Lempitsky, V.: Unsupervised domain adaptation by backpropagation. In: International Conference on Machine Learning, pp. 1180–1189. PMLR (2015)
Google Scholar
Zhang, B.H., Lemoine, B., Mitchell, M.: Mitigating unwanted biases with adversarial learning. In: Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pp. 335–340 (2018)
Google Scholar
Kim, B., Kim, H., Kim, K., Kim, S., Kim, J.: Learning not to learn: training deep neural networks with biased data. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9012–9020 (2019)
Google Scholar
Du, S., Hers, B., Bayasi, N., Hamarneh, G., Garbi, R.: FairDisCo: fairer AI in dermatology via disentanglement contrastive learning. In: Karlinsky, L., Michaeli, T., Nishino, K. (eds.) Computer Vision – ECCV 2022 Workshops. ECCV 2022. Lecture Notes in Computer Science, vol. 13804, pp. 185–202. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-25069-9_13
Bahng, H., Chun, S., Yun, S., Choo, J., Oh, S.J.: Learning de-biased representations with biased representations. In: International Conference on Machine Learning, pp. 528–539. PMLR (2020)
Google Scholar
Wang, Z., et al.: Fairness-aware adversarial perturbation towards bias mitigation for deployed deep models. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10379–10388 (2022)
Google Scholar
Kehrenberg, T., Bartlett, M., Thomas, O., Quadrianto, N.: Null-sampling for interpretable and fair representations. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12371, pp. 565–580. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58574-7_34
Chapter Google Scholar
Singla, S., Feizi, S.: Salient ImageNet: how to discover spurious features in deep learning? arXiv preprint arXiv:2110.04301 (2021)
Wang, T., Zhao, J., Yatskar, M., Chang, K.-W., Ordonez, V.: Balanced datasets are not enough: estimating and mitigating gender bias in deep image representations. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 5310–5319 (2019)
Google Scholar
Sagawa, S., Koh, P.W., Hashimoto, T.B., Liang, P.: Distributionally robust neural networks for group shifts: on the importance of regularization for worst-case generalization. arXiv preprint arXiv:1911.08731 (2019)
Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. Knowl. Inf. Syst. 33(1), 1–33 (2012)
Article Google Scholar
Irvin, J., et al.: CheXpert: a large chest radiograph dataset with uncertainty labels and expert comparison. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, pp. 590–597 (2019)
Google Scholar
Du, M., Yang, F., Zou, N., Hu, X.: Fairness in deep learning: a computational perspective. IEEE Intell. Syst. 36(4), 25–34 (2020)
Article Google Scholar
Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 618–626 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

National Yang Ming Chiao Tung University, Taipei, Taiwan
Chi-Yu Chen
National Tsing Hua University, Hsinchu, Taiwan
Pu Ching, Pei-Hsin Huang & Min-Chun Hu

Authors

Chi-Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Pu Ching
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Hsin Huang
View author publications
You can also search for this author in PubMed Google Scholar
Min-Chun Hu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chi-Yu Chen .

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Stevan Rudinac
Delft University of Technology, Delft, The Netherlands
Alan Hanjalic
Delft University of Technology, Delft, The Netherlands
Cynthia Liem
University of Amsterdam, Amsterdam, The Netherlands
Marcel Worring
Reykjavik University, Reykjavik, Iceland
Björn Þór Jónsson
Microsoft Research Lab – Asia, Beijing, China
Bei Liu
The University of Tokyo, Tokyo, Japan
Yoko Yamakata

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, CY., Ching, P., Huang, PH., Hu, MC. (2024). Where Are Biases? Adversarial Debiasing with Spurious Feature Visualization. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14554. Springer, Cham. https://doi.org/10.1007/978-3-031-53305-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-031-53305-1_1
Published: 28 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53304-4
Online ISBN: 978-3-031-53305-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Where Are Biases? Adversarial Debiasing with Spurious Feature Visualization