research-article

Crowdsourcing Detection of Sampling Biases in Image Datasets

Authors:

Anirudh Vegesana,

Yung-Hsiang Lu,

George K. Thiruvathukal,

Ming YinAuthors Info & Claims

WWW '20: Proceedings of The Web Conference 2020

Pages 2955 - 2961

https://doi.org/10.1145/3366423.3380063

Published: 20 April 2020 Publication History

Abstract

Despite many exciting innovations in computer vision, recent studies reveal a number of risks in existing computer vision systems, suggesting results of such systems may be unfair and untrustworthy. Many of these risks can be partly attributed to the use of a training image dataset that exhibits sampling biases and thus does not accurately reflect the real visual world. Being able to detect potential sampling biases in the visual dataset prior to model development is thus essential for mitigating the fairness and trustworthy concerns in computer vision. In this paper, we propose a three-step crowdsourcing workflow to get humans into the loop for facilitating bias discovery in image datasets. Through two sets of evaluation studies, we find that the proposed workflow can effectively organize the crowd to detect sampling biases in both datasets that are artificially created with designed biases and real-world image datasets that are widely used in computer vision research and system development.

References

[1]

Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. 2015. Vqa: Visual question answering. In Proceedings of the IEEE international conference on computer vision. 2425–2433.

Digital Library

[2]

Michael S Bernstein, Greg Little, Robert C Miller, Björn Hartmann, Mark S Ackerman, David R Karger, David Crowell, and Katrina Panovich. 2010. Soylent: a word processor with a crowd inside. In Proceedings of the 23nd annual ACM symposium on User interface software and technology. ACM, 313–322.

Digital Library

[3]

Joy Buolamwini and Timnit Gebru. 2018. Gender shades: Intersectional accuracy disparities in commercial gender classification. In Conference on fairness, accountability and transparency. 77–91.

[4]

Ferran Cabezas, Axel Carlier, Vincent Charvillat, Amaia Salvador, and Xavier Giro-i Nieto. 2015. Quality control in crowdsourced object segmentation. In 2015 IEEE International Conference on Image Processing (ICIP). IEEE, 4243–4247.

[5]

Lydia B Chilton, Greg Little, Darren Edge, Daniel S Weld, and James A Landay. 2013. Cascade: Crowdsourcing taxonomy creation. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1999–2008.

Digital Library

[6]

Jinho D Choi, Joel Tetreault, and Amanda Stent. 2015. It depends: Dependency parser comparison using a web-based evaluation tool. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 387–396.

[7]

Andre Esteva, Brett Kuprel, Roberto A Novoa, Justin Ko, Susan M Swetter, Helen M Blau, and Sebastian Thrun. 2017. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 7639 (2017), 115.

[8]

Li Fei-Fei, Rob Fergus, and Pietro Perona. 2007. Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. Computer vision and Image understanding 106, 1 (2007), 59–70.

[9]

Jochen Hemming and Thomas Rath. 2001. PA—Precision agriculture: Computer-vision-based weed identification under field conditions using controlled lighting. Journal of agricultural engineering research 78, 3 (2001), 233–243.

[10]

Matthew Kay, Cynthia Matuszek, and Sean A Munson. 2015. Unequal representation and gender stereotypes in image search results for occupations. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. ACM, 3819–3828.

Digital Library

[11]

Aditya Khosla, Tinghui Zhou, Tomasz Malisiewicz, Alexei A Efros, and Antonio Torralba. 2012. Undoing the damage of dataset bias. In European Conference on Computer Vision. Springer, 158–171.

Digital Library

[12]

Juho Kim, Phu Tran Nguyen, Sarah Weir, Philip J Guo, Robert C Miller, and Krzysztof Z Gajos. 2014. Crowdsourcing step-by-step information extraction to enhance existing how-to videos. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 4017–4026.

Digital Library

[13]

Genevieve Patterson and James Hays. 2012. Sun attribute database: Discovering, annotating, and recognizing scene attributes. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2751–2758.

[14]

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. 2016. Why should i trust you?: Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 1135–1144.

Digital Library

[15]

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, 2015. Imagenet large scale visual recognition challenge. International journal of computer vision 115, 3 (2015), 211–252.

[16]

Hao Su, Jia Deng, and Li Fei-Fei. 2012. Crowdsourcing annotations for visual object detection. In Workshops at the Twenty-Sixth AAAI Conference on Artificial Intelligence.

[17]

Tian Tian, Ning Chen, and Jun Zhu. 2017. Learning attributes from the crowdsourced relative labels. In Thirty-First AAAI Conference on Artificial Intelligence.

Digital Library

[18]

Antonio Torralba, Alexei A Efros, 2011. Unbiased look at dataset bias. In CVPR, Vol. 1. Citeseer, 7.

[19]

Florian Tramer, Vaggelis Atlidakis, Roxana Geambasu, Daniel Hsu, Jean-Pierre Hubaux, Mathias Humbert, Ari Juels, and Huang Lin. 2017. FairTest: Discovering unwarranted associations in data-driven applications. In 2017 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE, 401–416.

[20]

Luis von Ahn and Laura Dabbish. 2004. Labeling images with a computer game. In Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 319–326.

Digital Library

[21]

Michael J Wilber, Iljung S Kwak, and Serge J Belongie. 2014. Cost-effective hits for relative similarity comparisons. In Second AAAI conference on human computation and crowdsourcing.

[22]

Kaiyu Yang, Klint Qinami, Li Fei-Fei, Jia Deng, and Olga Russakovsky. 2019. Towards fairer datasets: Filtering and balancing the distribution of the people subtree in the imagenet hierarchy. arXiv preprint arXiv:1912.07726(2019).

[23]

Jieyu Zhao, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2017. Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing.

Cited By

He GBharos AGadiraju U(2024)To Err Is AI! Debugging as an Intervention to Facilitate Appropriate Reliance on AI SystemsProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675130(98-105)Online publication date: 10-Sep-2024
https://dl.acm.org/doi/10.1145/3648188.3675130
Oppenlaender JAbbas TGadiraju U(2024)The State of Pilot Study Reporting in Crowdsourcing: A Reflection on Best Practices and GuidelinesProceedings of the ACM on Human-Computer Interaction10.1145/36410238:CSCW1(1-45)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3641023
Zhang YZong RShang LZeng HYue ZWang DChua TNgo CKa-Wei Lee RKumar RLauw H(2024)SymLearn: A Symbiotic Crowd-AI Collective Learning Framework to Web-based Healthcare Policy Adherence AssessmentProceedings of the ACM Web Conference 202410.1145/3589334.3645519(2497-2508)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645519
Show More Cited By

Index Terms

Crowdsourcing Detection of Sampling Biases in Image Datasets

Index terms have been assigned to the content through auto-classification.

Recommendations

Investigating and Mitigating Biases in Crowdsourced Data
CSCW '21 Companion: Companion Publication of the 2021 Conference on Computer Supported Cooperative Work and Social Computing

It is common practice for machine learning systems to rely on crowdsourced label data for training and evaluation. It is also well-known that biases present in the label data can induce biases in the trained models. Biases may be introduced by the ...
A New Microorganism Dataset for Image Segmentation and Classification Evaluation
ISICDM 2020: The Fourth International Symposium on Image Computing and Digital Medicine

Environmental Microorganism Data Set Fifth Version (EMDS-5) is a microscopic image dataset including original Environmental Microorganism (EM) images and two sets of Ground Truth (GT) images. The GT image sets include a single-object GT image set and a ...
World-wide scale geotagged image dataset for automatic image annotation and reverse geotagging
MMSys '14: Proceedings of the 5th ACM Multimedia Systems Conference

In this paper, a dataset of geotagged photos on a world-wide scale is presented. The dataset contains a sample of more than 14 million geotagged photos crawled from Flickr with the corresponding metadata. To guarantee the spatial representativeness of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

WWW '20: Proceedings of The Web Conference 2020

April 2020

3143 pages

ISBN:9781450370233

DOI:10.1145/3366423

Editors:
Yennun Huang
Acadmica sinica, Taiwan
,
Irwin King
The Chinese University of Hong Kong, Hong Kong
,
Tie-Yan Liu
Microsoft Research Asia, China
,
Maarten van Steen
University of Twente, Netherlands

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 April 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

WWW '20

Sponsor:

SIGWEB

WWW '20: The Web Conference 2020

April 20 - 24, 2020

Taipei, Taiwan

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
511
Total Downloads

Downloads (Last 12 months)45
Downloads (Last 6 weeks)3

Reflects downloads up to 11 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

He GBharos AGadiraju U(2024)To Err Is AI! Debugging as an Intervention to Facilitate Appropriate Reliance on AI SystemsProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675130(98-105)Online publication date: 10-Sep-2024
https://dl.acm.org/doi/10.1145/3648188.3675130
Oppenlaender JAbbas TGadiraju U(2024)The State of Pilot Study Reporting in Crowdsourcing: A Reflection on Best Practices and GuidelinesProceedings of the ACM on Human-Computer Interaction10.1145/36410238:CSCW1(1-45)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3641023
Zhang YZong RShang LZeng HYue ZWang DChua TNgo CKa-Wei Lee RKumar RLauw H(2024)SymLearn: A Symbiotic Crowd-AI Collective Learning Framework to Web-based Healthcare Policy Adherence AssessmentProceedings of the ACM Web Conference 202410.1145/3589334.3645519(2497-2508)Online publication date: 13-May-2024
https://dl.acm.org/doi/10.1145/3589334.3645519
Kalananthan SKichutkin AShang ZStrausz ABautiste FEl-Assady M(2024)MindSet: A Bias-Detection Interface Using a Visual Human-in-the-Loop WorkflowArtificial Intelligence. ECAI 2023 International Workshops10.1007/978-3-031-50485-3_8(93-105)Online publication date: 25-Jan-2024
https://doi.org/10.1007/978-3-031-50485-3_8
Wang XLiang CYin MElkind E(2023)The effects of AI biases and explanations on human decision fairnessProceedings of the Thirty-Second International Joint Conference on Artificial Intelligence10.24963/ijcai.2023/343(3076-3084)Online publication date: 19-Aug-2023
https://dl.acm.org/doi/10.24963/ijcai.2023/343
Shahbazi NLin YAsudeh AJagadish H(2023)Representation Bias in Data: A Survey on Identification and Resolution TechniquesACM Computing Surveys10.1145/358843355:13s(1-39)Online publication date: 13-Jul-2023
https://dl.acm.org/doi/10.1145/3588433
Al Darwich RBabout LStrzecha K(2022)An Edge Detection Method Based on Local Gradient Estimation: Application to High-Temperature Metallic Droplet ImagesApplied Sciences10.3390/app1214697612:14(6976)Online publication date: 9-Jul-2022
https://doi.org/10.3390/app12146976
Hettiachchi DKostakos VGoncalves J(2022)A Survey on Task Assignment in CrowdsourcingACM Computing Surveys10.1145/349452255:3(1-35)Online publication date: 3-Feb-2022
https://dl.acm.org/doi/10.1145/3494522
Sharifi Noorian SQiu SGadiraju UYang JBozzon A(2022)What Should You Know? A Human-In-the-Loop Approach to Unknown Unknowns Characterization in Image RecognitionProceedings of the ACM Web Conference 202210.1145/3485447.3512040(882-892)Online publication date: 25-Apr-2022
https://dl.acm.org/doi/10.1145/3485447.3512040
Fabbrizzi SPapadopoulos SNtoutsi EKompatsiaris I(2022)A survey on bias in visual datasetsComputer Vision and Image Understanding10.1016/j.cviu.2022.103552223:COnline publication date: 1-Oct-2022
https://dl.acm.org/doi/10.1016/j.cviu.2022.103552
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten