ABSTRACT
AI for good (AI4G) projects involve developing and applying artificial intelligence (AI) based solutions to further goals in areas such as sustainability, health, humanitarian aid, and social justice. Developing and deploying such solutions must be done in collaboration with partners who are experts in the domain in question and who already have experience in making progress towards such goals. Based on our experiences, we detail the different aspects of this type of collaboration broken down into four high-level categories: communication, data, modeling, and impact, and distill eleven takeaways to guide such projects in the future. We briefly describe two case studies to illustrate how some of these takeaways were applied in practice during our past collaborations.
- Naheed R Abbasi, Helen M Shaw, Darrell S Rigel, Robert J Friedman, William H McCarthy, Iman Osman, Alfred W Kopf, and David Polsky. 2004. Early diagnosis of cutaneous melanoma: revisiting the ABCD criteria. JAMA, Vol. 292, 22 (2004), 2771--2776.Google ScholarCross Ref
- Raja Abdulrahim. 2021. AI Emerges as Crucial Tool for Groups Seeking Justice for Syria War Crimes. The Wall Street Journal (2021). https://www.wsj.com/articles/ai-emerges-as-crucial-tool-for-groups-seeking-justice-for-syria-war-crimes-11613228401Google Scholar
- Jorge A Ahumada, Eric Fegraus, Tanya Birch, Nicole Flores, Roland Kays, Timothy G O'Brien, Jonathan Palmer, Stephanie Schuttler, Jennifer Y Zhao, Walter Jetz, et al. 2020. Wildlife insights: A platform to maximize the potential of camera trap and other passive sensor wildlife data for the planet. Environmental Conservation, Vol. 47, 1 (2020), 1--6.Google ScholarCross Ref
- Michele Avanzo, Joseph Stancanello, and Issam El Naqa. 2017. Beyond imaging: the promise of radiomics. Physica Medica, Vol. 38 (2017), 122--139.Google ScholarCross Ref
- N. Baker, H. Lu, Gennady Erlikhman, and P. Kellman. 2020. Local features and global shape information in object classification by deep convolutional neural networks. Vision Research, Vol. 172 (2020), 46--61.Google ScholarCross Ref
- Andriy I Bandos, Howard E Rockette, and David Gur. 2013. Subject-centered free-response ROC (FROC) analysis. Medical physics, Vol. 40, 5 (2013), 051706.Google Scholar
- Shimaa Baraka, Benjamin Akera, Bibek Aryal, Tenzing Sherpa, Finu Shresta, Anthony Ortiz, Kris Sankaran, Juan Lavista Ferres, Mir Matin, and Yoshua Bengio. 2020. Machine Learning for Glacier Monitoring in the Hindu Kush Himalaya. arXiv preprint arXiv:2012.05013 (2020).Google Scholar
- Sara Beery, Yang Liu, Dan Morris, Jim Piavis, Ashish Kapoor, Neel Joshi, Markus Meister, and Pietro Perona. 2020. Synthetic examples improve generalization for rare classes. In The IEEE Winter Conference on Applications of Computer Vision. 863--873.Google ScholarCross Ref
- Sara Beery, Grant Van Horn, and Pietro Perona. 2018. Recognition in Terra Incognita. In Proceedings of the European Conference on Computer Vision. Munich, Germany.Google ScholarCross Ref
- Bettina Berendt. 2019. AI for the Common Good?! Pitfalls, challenges, and ethics pen-testing. Paladyn, Journal of Behavioral Robotics, Vol. 10, 1 (2019), 44--65.Google ScholarCross Ref
- Roderic Broadhurst. 2020. Child sex abuse images and exploitation materials. In The Human Factor of Cybercrime, Rutger Leukfeldt and Thomas J. Holt (Eds.). Routledge, 310--336.Google Scholar
- Mauro Castelli, Leonardo Vanneschi, and Alevs Popovivc. 2015. Predicting burned areas of forest fires: an artificial intelligence approach. Fire ecology, Vol. 11, 1 (2015), 106--118.Google Scholar
- Lowik Chanussot, Abhishek Das, Siddharth Goyal, Thibaut Lavril, Muhammed Shuaibi, Morgane Riviere, Kevin Tran, Javier Heras-Domingo, Caleb Ho, Weihua Hu, et al. 2020. The Open Catalyst 2020 (OC20) Dataset and Community Challenges. arXiv preprint arXiv:2010.09990 (2020).Google Scholar
- Nan-Chen Chen, Margaret Drouhard, Rafal Kocielnik, Jina Suh, and Cecilia R Aragon. 2018. Using machine learning to support qualitative coding in social science: Shifting the focus to ambiguity. ACM Transactions on Interactive Intelligent Systems (TiiS), Vol. 8, 2 (2018), 1--20.Google ScholarDigital Library
- Michael Chui, James Manyika, and Mehdi Miremadi. 2018. What AI can and can't do (yet) for your business. McKinsey Quarterly, Vol. 1 (2018), 97--108.Google Scholar
- Jim Collins. 2019. Turning the flywheel: a monograph to accompany good to great .Random House.Google Scholar
- Josh Cowls, Thomas King, Mariarosaria Taddeo, and Luciano Floridi. 2019. Designing AI for social good: Seven essential factors. SSRN 3388669 (2019).Google Scholar
- Fei Fang, Thanh Hong Nguyen, Rob Pickles, Wai Y Lam, Gopalasamy R Clements, Bo An, Amandeep Singh, Milind Tambe, Andrew Lemieux, et al. 2016. Deploying PAWS: Field Optimization of the Protection Assistant for Wildlife Security.. In AAAI, Vol. 16. 3966--3973.Google ScholarCross Ref
- Fei Fang, Milind Tambe, Bistra Dilkina, and Andrew J Plumptre. 2019. Artificial intelligence and conservation .Cambridge University Press.Google Scholar
- Salvador Garc'ia, Julián Luengo, and Francisco Herrera. 2015. Data preprocessing in data mining .Springer.Google Scholar
- Robert Geirhos, Patricia Rubisch, Claudio Michaelis, Matthias Bethge, Felix A. Wichmann, and Wieland Brendel. 2019. ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. In International Conference on Learning Representations .Google Scholar
- Shahrzad Gholami, Narendran Kodandapani, Jane Wang, and Juan M. Lavista Ferres. 2021. Where there's Smoke, there's Fire: Wildfire Risk Predictive Modeling via Historical Climate Data. In Annual Conference on Innovative Applications of Artificial Intelligence (IAAI) .Google Scholar
- Shahrzad Gholami, Sara Mc Carthy, Bistra Dilkina, Andrew J Plumptre, Milind Tambe, Margaret Driciru, Fred Wanyama, Aggrey Rwetsiba, Mustapha Nsubaga, Joshua Mabonga, et al. 2018. Adversary Models Account for Imperfect Crime Data: Forecasting and Planning against Real-world Poachers. In AAMAS. 823--831.Google Scholar
- Carla Gomes, Thomas Dietterich, Christopher Barrett, Jon Conrad, Bistra Dilkina, Stefano Ermon, Fei Fang, Andrew Farnsworth, Alan Fern, Xiaoli Fern, et al. 2019. Computational sustainability: Computing for a better world and a sustainable future. Commun. ACM, Vol. 62, 9 (2019), 56--65.Google ScholarDigital Library
- Noel Gorelick, Matt Hancher, Mike Dixon, Simon Ilyushchenko, David Thau, and Rebecca Moore. 2017. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote sensing of Environment, Vol. 202 (2017), 18--27.Google Scholar
- Ben Green. 2019. "Good" isn't good enough. In Proceedings of the AI for Social Good workshop at NeurIPS .Google Scholar
- Saul Greenberg. 2020. Automated Image Recognition for Wildlife Camera Traps: Making it Work for You. Technical Report. Science.Google Scholar
- Ritwik Gupta, Bryce Goodman, Nirav Patel, Ricky Hosfelt, Sandra Sajeev, Eric Heim, Jigar Doshi, Keane Lucas, Howie Choset, and Matthew Gaston. 2019. Creating xBD: A Dataset for Assessing Building Damage from Satellite Imagery. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops .Google Scholar
- Hanxiang Hao, Sriram Baireddy, Emily R Bartusiak, Latisha Konz, Kevin LaTourette, Michael Gribbons, Moses Chan, Mary L Comer, and Edward J Delp. 2020. An Attention-Based System for Damage Assessment Using Satellite Imagery. arXiv preprint arXiv:2004.06643 (2020).Google Scholar
- Jamie Hayes, Luca Melis, George Danezis, and Emiliano De Cristofaro. 2019. LOGAN: Membership inference attacks against generative models. Proceedings on Privacy Enhancing Technologies, Vol. 2019, 1 (2019), 133--152.Google ScholarCross Ref
- Fred Hohman, Kanit Wongsuphasawat, Mary Beth Kery, and Kayur Patel. 2020. Understanding and Visualizing Data Iteration in Machine Learning. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1--13.Google ScholarDigital Library
- Anjali Jaiswal and Madhura Joshi. 2020. Climate Action: All Eyes on India. https://www.nrdc.org/experts/anjali-jaiswal/climate-action-all-eyes-indiaGoogle Scholar
- Bargav Jayaraman, Lingxiao Wang, David Evans, and Quanquan Gu. 2020. Revisiting Membership Inference Under Realistic Assumptions. arXiv preprint arXiv:2005.10881 (2020).Google Scholar
- Neal Jean, Marshall Burke, Michael Xie, W Matthew Davis, David B Lobell, and Stefano Ermon. 2016. Combining satellite imagery and machine learning to predict poverty. Science, Vol. 353, 6301 (2016), 790--794.Google ScholarCross Ref
- Alexandre Lacoste, Alexandra Luccioni, Victor Schmidt, and Thomas Dandres. 2019. Quantifying the Carbon Emissions of Machine Learning. arXiv preprint arXiv:1910.09700 (2019).Google Scholar
- Maxime Lenormand, Sylvie Huet, Floriana Gargiulo, and Guillaume Deffuant. 2012. A Universal Model of Commuting Networks. PLoS ONE, Vol. 7, 10 (2012).Google ScholarCross Ref
- Xiyang Liu, Yixi Xu, Sumit Mukherjee, and Juan Lavista Ferres. 2020. MACE: A Flexible Framework for Membership Privacy Estimation in Generative Models. arXiv preprint arXiv:2009.05683 (2020).Google Scholar
- Gary Marcus. 2019. An Epidemic of AI Misinformation. The Gradient (2019). https://thegradient.pub/an-epidemic-of-ai-misinformation/Google Scholar
- D Douglas Miller and Eric W Brown. 2018. Artificial intelligence in medical practice: the question to the answer? The American journal of medicine, Vol. 131, 2 (2018), 129--133.Google Scholar
- Jared Moore. 2019. AI for not bad. Frontiers in Big Data, Vol. 2 (2019), 32.Google ScholarCross Ref
- Sumit Mukherjee, Yixi Xu, Anusua Trivedi, and Juan Lavista Ferres. 2019. Protecting GANs against privacy attacks by preventing overfitting. arXiv preprint arXiv:2001.00071 (2019).Google Scholar
- Md Nasir, Brian Baucom, Panayiotis Georgiou, and Shrikanth Narayanan. 2015. Redundancy analysis of behavioral coding for couples therapy and improved estimation of behavior from noisy annotations. In 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1886--1890.Google ScholarCross Ref
- Md Nasir, Brian R Baucom, Craig J Bryan, Shrikanth S Narayanan, and Panayiotis G Georgiou. 2017. Complexity in Speech and its Relation to Emotional Bond in Therapist-Patient Interactions During Suicide Risk Assessment Interviews.. In INTERSPEECH. 3296--3300.Google Scholar
- Mohammad Sadegh Norouzzadeh, Anh Nguyen, Margaret Kosmala, Alexandra Swanson, Meredith S. Palmer, Craig Packer, and Jeff Clune. 2018. Automatically identifying, counting, and describing wild animals in camera-trap images with deep learning. Proceedings of the National Academy of Sciences (2018).Google ScholarCross Ref
- Felipe Oviedo, Zekun Ren, Xue Hansong, Siyu Isaac Parker Tian, Kaicheng Zhang, Mariya Layurova, Thomas Heumueller, Ning Li, Erik Birgersson, Shijing Sun, et al. 2020. Bridging the gap between photovoltaics R&D and manufacturing with data-driven optimization. arXiv preprint arXiv:2004.13599 (2020).Google Scholar
- Kasey Panetta. 2020. 5 Trends Drive the Gartner Hype Cycle for Emerging Technologies, 2020. https://www.gartner.com/smarterwithgartner/5-trends-drive-the-gartner-hype-cycle-for-emerging-technologies-2020/Google Scholar
- Clionadh Raleigh, Andrew Linke, Håvard Hegre, and Joakim Karlsen. 2010. Introducing ACLED: an armed conflict location and event dataset: special data feature. Journal of peace research, Vol. 47, 5 (2010), 651--660.Google ScholarCross Ref
- Vikas C Raykar, Shipeng Yu, Linda H Zhao, Gerardo Hermosillo Valadez, Charles Florin, Luca Bogoni, and Linda Moy. 2010. Learning from crowds. Journal of Machine Learning Research, Vol. 11, 4 (2010).Google Scholar
- Zekun Ren, Felipe Oviedo, Maung Thway, Siyu IP Tian, Yue Wang, Hansong Xue, Jose Dario Perea, Mariya Layurova, Thomas Heumueller, Erik Birgersson, et al. 2020. Embedding physics domain knowledge into a Bayesian network enables layer-by-layer process innovation for photovoltaics. npj Computational Materials, Vol. 6, 1 (2020), 1--9.Google Scholar
- Caleb Robinson and Bistra Dilkina. 2018. A machine learning approach to modeling human migration. In Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies. 1--8.Google ScholarDigital Library
- Caleb Robinson, Le Hou, Kolya Malkin, Rachel Soobitsky, Jacob Czawlytko, Bistra Dilkina, and Nebojsa Jojic. 2019 a. Large scale high-resolution land cover mapping with multi-resolution data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 12726--12735.Google ScholarCross Ref
- Caleb Robinson, Anthony Ortiz, Kolya Malkin, Blake Elias, Andi Peng, Dan Morris, Bistra Dilkina, and Nebojsa Jojic. 2019 b. Human-Machine Collaboration for Fast Land Cover Mapping. arXiv preprint arXiv:1906.04176 (2019).Google Scholar
- Marcos Rodrigues and Juan de la Riva. 2014. An insight into machine-learning algorithms to model human-caused wildfire occurrence. Environmental Modelling & Software, Vol. 57 (2014), 192--201.Google ScholarCross Ref
- David Rolnick, Priya L Donti, Lynn H Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, et al. 2019. Tackling climate change with machine learning. arXiv preprint arXiv:1906.05433 (2019).Google Scholar
- Mohammad Sadegh Norouzzadeh, Dan Morris, Sara Beery, Neel Joshi, Nebojsa Jojic, and Jeff Clune. 2019. A deep active learning system for species identification and counting in camera trap images. Methods in Ecology and Evolution (2019).Google Scholar
- Youssef Safi and Abdelaziz Bouroumi. 2013. Prediction of forest fires using artificial neural networks. Applied Mathematical Sciences, Vol. 7, 6 (2013), 271--286.Google ScholarCross Ref
- Nithya Sambasivan, Shivani Kapania, Hannah Highfill, Diana Akrong, Praveen Kumar Paritosh, and Lora Mois Aroyo. 2021. "Everyone wants to do the model work, not the data work": Data Cascades in High-Stakes AI.Google Scholar
- Stefan Schneider, Saul Greenberg, Graham W Taylor, and Stefan C Kremer. 2020. Three critical factors affecting automated image species recognition performance for camera traps. Ecology and evolution, Vol. 10, 7 (2020), 3503--3517.Google Scholar
- Daniel Sheldon, Andrew Farnsworth, Jed W Irvine, Benjamin Van Doren, Kevin F Webb, Thomas G Dietterich, and Steve Kelling. 2013. Approximate Bayesian inference for reconstructing velocities of migrating birds from weather radar. In Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence .Google ScholarDigital Library
- Tao Sheng, Chen Feng, Shaojie Zhuo, Xiaopeng Zhang, Liang Shen, and Mickey Aleksic. 2018. A quantization-friendly separable convolution for mobilenets. In 2018 1st Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications (EMC2). IEEE, 14--18.Google ScholarCross Ref
- Zheyuan Ryan Shi, Claire Wang, and Fei Fang. 2020. Artificial intelligence for social good: A survey. arXiv preprint arXiv:2001.01818 (2020).Google Scholar
- Reza Shokri, Marco Stronati, Congzheng Song, and Vitaly Shmatikov. 2017. Membership inference attacks against machine learning models. In 2017 IEEE Symposium on Security and Privacy (SP). IEEE, 3--18.Google ScholarCross Ref
- Emma Strubell, Ananya Ganesh, and Andrew McCallum. 2019. Energy and policy considerations for deep learning in NLP. In 57th Annual Meeting of the Association for Computational Linguistics (ACL) .Google ScholarCross Ref
- Shijing Sun, Armi Tiihonen, Felipe Oviedo, Zhe Liu, Janak Thapa, Noor Titan Putri Hartono, Anuj Goyal, Clio Batali, Alex Encinas, Jason Yoo, et al. 2021. A Physical Data Fusion Approach to Optimize Compositional Stability of Halide Perovskites. Matter (2021).Google Scholar
- Saeid Asgari Taghanaki, Yefeng Zheng, S Kevin Zhou, Bogdan Georgescu, Puneet Sharma, Daguang Xu, Dorin Comaniciu, and Ghassan Hamarneh. 2019. Combo loss: Handling input and output imbalance in multi-organ segmentation. Computerized Medical Imaging and Graphics, Vol. 75 (2019), 24--33.Google ScholarCross Ref
- Erik Trautman. 2018. The Virtuous Cycle of AI Products. https://www.eriktrautman.com/posts/the-virtuous-cycle-of-ai-productsGoogle Scholar
- Anusua Trivedi, Sumit Mukherjee, Edmund Tse, Anne Ewing, and Juan Lavista Ferres. 2019. Risks of Using Non-verified Open Data: A case study on using Machine Learning techniques for predicting Pregnancy Outcomes in India. arXiv preprint arXiv:1910.02136 (2019).Google Scholar
- Tinka Valentijn, Jacopo Margutti, Marc van den Homberg, and Jorma Laaksonen. 2020. Multi-Hazard and Spatial Transferability of a CNN for Automated Building Damage Assessment. Remote Sensing, Vol. 12, 17 (2020), 2839.Google ScholarCross Ref
- Adam Van Etten, Dave Lindenbaum, and Todd M Bacastow. 2018. Spacenet: A remote sensing dataset and challenge series. arXiv preprint arXiv:1807.01232 (2018).Google Scholar
- Ricardo Vinuesa, Hossein Azizpour, Iolanda Leite, Madeline Balaam, Virginia Dignum, Sami Domisch, Anna Fell"ander, Simone Daniela Langhans, Max Tegmark, and Francesco Fuso Nerini. 2020. The role of artificial intelligence in achieving the Sustainable Development Goals. Nature communications, Vol. 11, 1 (2020), 1--10.Google Scholar
- Randi Vita, Swapnil Mahajan, James A Overton, Sandeep Kumar Dhanda, Sheridan Martini, Jason R Cantrell, Daniel K Wheeler, Alessandro Sette, and Bjoern Peters. 2019. The immune epitope database (IEDB): 2018 update. Nucleic acids research, Vol. 47, D1 (2019), D339--D343.Google Scholar
- Kiri Wagstaff. 2012. Machine learning that matters. arXiv preprint arXiv:1206.4656 (2012).Google ScholarDigital Library
- Sherrie Wang, William Chen, Sang Michael Xie, George Azzari, and David B Lobell. 2020. Weakly supervised deep learning for segmentation of remote sensing imagery. Remote Sensing, Vol. 12, 2 (2020), 207.Google ScholarCross Ref
- Ben G Weinstein. 2018. A computer vision for animal ecology. Journal of Animal Ecology, Vol. 87, 3 (2018), 533--545.Google ScholarCross Ref
- Christine T Wolf. 2020. Democratizing AI? experience and accessibility in the age of artificial intelligence. XRDS: Crossroads, The ACM Magazine for Students, Vol. 26, 4 (2020), 12--15.Google ScholarDigital Library
- Carole-Jean Wu, David Brooks, Kevin Chen, Douglas Chen, Sy Choudhury, Marat Dukhan, Kim Hazelwood, Eldad Isaac, Yangqing Jia, Bill Jia, et al. 2019. Machine learning at facebook: Understanding inference at the edge. In 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA). IEEE, 331--344.Google ScholarCross Ref
- Liyang Xie, Kaixiang Lin, Shu Wang, Fei Wang, and Jiayu Zhou. 2018. Differentially private generative adversarial network. arXiv preprint arXiv:1802.06739 (2018).Google Scholar
- Lily Xu, Shahrzad Gholami, Sara Mc Carthy, Bistra Dilkina, Andrew Plumptre, Milind Tambe, Rohit Singh, Mustapha Nsubuga, Joshua Mabonga, Margaret Driciru, et al. 2020. Stay Ahead of Poachers: Illegal Wildlife Poaching Prediction and Patrol Planning Under Uncertainty with Field Test Evaluations (Short Version). In 2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE, 1898--1901.Google ScholarCross Ref
- Amir Hossein Yazdavar, Hussein S Al-Olimat, Monireh Ebrahimi, Goonmeet Bajaj, Tanvi Banerjee, Krishnaprasad Thirunarayan, Jyotishman Pathak, and Amit Sheth. 2017. Semi-supervised approach to monitoring clinical depressive symptoms in social media. In Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 1191--1198.Google ScholarDigital Library
- John R Zech, Marcus A Badgeley, Manway Liu, Anthony B Costa, Joseph J Titano, and Eric Karl Oermann. 2018. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study. PLoS medicine, Vol. 15, 11 (2018), e1002683.Google Scholar
- Zhi-Hua Zhou. 2018. A brief introduction to weakly supervised learning. National Science Review, Vol. 5, 1 (2018), 44--53.Google ScholarCross Ref
Index Terms
- Becoming Good at AI for Good
Recommendations
Show me a good time: using content to provide activity awareness to collaborators with activityspotter
GROUP '10: Proceedings of the 2010 ACM International Conference on Supporting Group WorkIn order to study the effect supporting awareness of a colleague's activity on a collaborator's communication intentions, we developed ActivitySpotter. It is a research tool and awareness display that determines a user's current activity through a ...
Can AI be for Good in the Midst of Cyber Attacks and Privacy Violations?: A Position Paper
CODASPY '20: Proceedings of the Tenth ACM Conference on Data and Application Security and PrivacyArtificial Intelligence (AI) is affecting every aspect of our lives from healthcare to finance to driving to managing the home. Sophisticated machine learning techniques with a focus on deep learning are being applied successfully to detect cancer, to ...
Collaboard: a remote collaboration groupware device featuring an embodiment-enriched shared workspace
GROUP '10: Proceedings of the 2010 ACM International Conference on Supporting Group WorkIn this paper we present a mixed presence groupware device called "CollaBoard". The device improves collaboration between co-located and remote partners by providing a high level of workspace awareness. This is achieved by superimposing a life-size ...
Comments