ABSTRACT
The cross-cutting and interdisciplinary nature of data work has created an opportunity to engage more students from diverse backgrounds in data science and has expanded pathways for entry for future data professionals. However, without greater representation of Black, Indigenous, and other marginalized people of color in data science, we risk reinforcing existing systems of differentiated power that oppress as opposed to empower these groups. In this paper, the term emancipatory data science is coined to highlight the unique contributions of individuals who use their expertise to mitigate data harms for minoritized, and marginalized populations and to suggest a way forward for the data science workforce and research community given our increasingly algorithmic society.
- Duranton, J., Erlenbach, J., Brégé, C., Danziger, J, Gallego, A, & Pauly, M. (2020). What's Keeping Women out of Data Science? Boston Consulting Group. (https://www.bcg.com/publications/2020/what-keeps-women-out-data-science.aspxGoogle Scholar
- Harnham Report. 2019. "USA Diversity in Data and Analytics: A review of diversity within the data and analytics industry in 2019," (https://www.harnham.com/us/2019-usa-diversity-in-data-analyticsreport; Accessed: September 12, 2019)Google Scholar
- Ho, A., Nguyen, A., Pafford, J. L., & Slater, R. (2019). A Data Science Approach to Defining a Data Scientist. SMU Data Science Review, 2(3), 4.Google Scholar
- General Assembly. 2017. "The Study of Data Science Lags in Gender and Racial Representation," The Index, September 25, 2017 (https://theindex.generalassemb.ly/data-science-education-lags-behind-indiversity-ff59ffa718ec; Accessed: September 12, 2019)Google Scholar
- U.S. Census Bureau (2018). Current Population Survey. Available at https://www.census.gov/en.htmlGoogle Scholar
- Alsan, M., Garrick, O., & Graziani, G. (2019). Does diversity matter for health? Experimental evidence from Oakland. American Economic Review, 109(12), 4071--4111.Google ScholarCross Ref
- Gershenson, S., Hart, C. M., Hyman, J., Lindsay, C., & Papageorge, N. W. (2018). The long-run impacts of same-race teachers (No. w25254). National Bureau of Economic Research.Google ScholarCross Ref
- Steenbarger, B. (2020). Why Diversity Matters In The World Of Finance. Forbes. https://www.forbes.com/sites/brettsteenbarger/2020/06/15/why-diversity-matters-in-the-world-of-finance/'sh=1ba5671c7913#215041397913Google Scholar
- Marshall B., and Geier S., 2019 "Targeted Curricular Innovations in Data Science," Proceedings of the IEEE Frontiers in Education Conference.Google Scholar
- Merton, R. K. (1949). The role of applied social science in the formation of policy: a research memorandum. Philosophy of Science, 16(3), 161--181.Google ScholarCross Ref
- Rieley, M. "Big data adds up to opportunities in math careers," Beyond the Numbers: Employment & Unemployment, vol. 7, no. 8 (U.S. Bureau of Labor Statistics, June 2018), https://www.bls.gov/opub/btn/volume-7/big-data-adds-up.htmGoogle Scholar
- Brahm, C., Sheth, A., Sinha, V., Dai, J. (2019). Advanced Analytics Talent Will Double. It's Still Not Enough. Bain & Company.Google Scholar
- Miller, S., & Hughes, D. (2017). The quant crunch: How the demand for data science skills is disrupting the job market. Burning Glass Technologies.Google Scholar
- Chui, M., Bughin, J., Hazan, E., Ramaswamy, S., Allas, T., Dahlström, P., Henke, N., Trench, M. 2017. "Artificial Intelligence the Next Digital Frontier?," McKinsey and Company Global Institute, (47).Google Scholar
- National Academies of Sciences, Engineering, and Medicine. 2018. "Envisioning the Data Science Discipline: The Undergraduate Perspective," Washington, DC: The National Academies Press. (doi:10.17226/24886).Google ScholarDigital Library
- Naughton, J. (2021). "Why Silicon Valley's most astute critics are all women". The Guardian. https://www.theguardian.com/commentisfree/2021/apr/03/why-silicon-valley-most-astute-critics-women-tech-gender; Accessed: April 5th, 2021.Google Scholar
- Oberski, D. L. (2020). Human Data Science. Patterns, 1(4), 100069.Google ScholarCross Ref
- Dhar, V. (2013). Data science and prediction. Communications of the ACM, 56(12), 64--73. Google ScholarDigital Library
- Redden, J., Brand, J., & Terzieva, V. 2020. Data Harm Record Data Harm Record. https://datajusticelab.org/data-harm-record/\Google Scholar
- Buolamwini, J., & Gebru, T. (2018, January). Gender shades: Intersectional accuracy disparities in commercial gender classification. In Conference on fairness, accountability and transparency (pp. 77--91).Google Scholar
- Garvie, C., Bedoya, A. M., & Frankle, J. (2019). The perpetual line-up. Unregulated police face recognition in America. Georgetown Law Center on Privacy & Technology.Google Scholar
- Fortunato, S., Flammini, A., Menczer, F., & Vespignani, A. (2006). Topical interests and the mitigation of search engine bias. Proceedings of the national academy of sciences, 103(34), 12684--12689.Google ScholarCross Ref
- Noble, S. U. (2018). Algorithms of oppression: How search engines reinforce racism. NYU Press.Google ScholarCross Ref
- O'neil, C. (2016). Weapons of math destruction: How big data increases inequality and threatens democracy. Broadway Books. Google ScholarDigital Library
- Dressel, J., & Farid, H. (2018). The accuracy, fairness, and limits of predicting recidivism. Science advances, 4(1), eaao5580.Google Scholar
- Flores, A. W., Bechtel, K., & Lowenkamp, C. T. (2016). False positives, false negatives, and false analyses: A rejoinder to machine bias: There's software used across the country to predict future criminals. and it's biased against blacks. Fed. Probation, 80, 38Google Scholar
- Angwin, J., Larson, J., Mattu, S., & Kirchner, L. (2019). Machine bias: There's software used across the country to predict future criminals and it's biased against blacks. 2016. URL https://www. propublica. org/article/machine-bias-risk-assessments-in-criminal-sentencing.Google Scholar
- Wright, E. O. (2010). Envisioning real utopias (Vol. 98). London: Verso. p. 7.Google Scholar
- Young, A., Zhu, Y., & Venkatesh, V. (2021). Emancipation Research in Information Systems: Integrating Agency, Dialogue, Inclusion, and Rationality Research. In Proceedings of the 54th Hawaii International Conference on System Sciences (p. 6359).Google ScholarCross Ref
- Adam, A. (2000). Gender, emancipation and critical information systems. ECIS 2001 Proceedings, 25.Google Scholar
- Cushman, M., & McLean, R. (2008). Exclusion, inclusion and changing the face of information systems research. Information Technology & People.Google Scholar
- Córdoba, J. R. (2007). Developing inclusion and critical reflection in information systems planning. Organization, 14(6), 909--927.Google ScholarCross Ref
- Schultze, U., & Leidner, D. E. (2002). Studying knowledge management in information systems research: discourses and theoretical assumptions. MIS quarterly, 213--242. Google ScholarDigital Library
- Bell, D. A. (1995). Who's afraid of critical race theory. University of Illinois Law Review, 1995(4), 893--910.Google Scholar
- Crenshaw, K., Gotanda, N., Peller, G., & Thomas, K. (1995). Critical race theory. The Key Writings that formed the Movement. New York.Google Scholar
- Crenshaw, K. W. (1988). Race, reform, and retrenchment: Transformation and legitimation in antidiscrimination law. Harvard Law Review, 1331--1387.Google Scholar
- Wells, R. S., & Stage, F. K. (2015). Past, present, and future of critical quantitative research in higher education. New Directions for Institutional Research, 2014(163), 103--112.Google ScholarCross Ref
- Parker, L., & Lynn, M. (2002). What's race got to do with it? Critical race theory's conflicts with and connections to qualitative research methodology and epistemology. Qualitative inquiry, 8(1), 7--22.Google Scholar
- Galton, S. F. (1892). Hereditary genius: An inquiry into laws and consequences. MacMillan.Google Scholar
- Pearson K. (1901) cited by: Paul, D.B., & Moore, J. In: The Oxford Handbook of the History of Eugenics. P. Levine & A. Bashford (Eds). Oxford: Oxford University Press. 2010Google Scholar
- Fisher, R. A. (1914). Some hopes of a eugenist. The Eugenics Review, 5(4), 309.Google Scholar
- Ladson-Billings, G., & Tate, W. F. (2006). Toward a critical race theory of education. Critical race theory in education: All God's children got a song, 11, 30.Google Scholar
- González Canché, M. S., & Rios-Aguilar, C. (2015). Critical social network analysis in community colleges: Peer effects and credit attainment. New Directions for Institutional Research, 2014(163), 75--91.Google ScholarCross Ref
- Gillborn, D., Warmington, P., & Demack, S. (2018). QuantCrit: education, policy,"Big Data'and principles for a critical race theory of statistics. Race Ethnicity and Education, 21(2), 158--179.Google ScholarCross Ref
- Gillborn, D. (2005). Education policy as an act of white supremacy: Whiteness, critical race theory and education reform. Journal of Education Policy, 20(4), 485--505.Google ScholarCross Ref
- King, K. L., Houston, I. S., & Middleton, R. A. (2001). An explanation for school failure: Moving beyond black inferiority and alienation as a policy-making agenda. British Journal of Educational Studies, 49(4), 428--445.Google ScholarCross Ref
- Zuberi, T., & Bonilla-Silva, E. (Eds.). (2008). White logic, white methods: Racism and methodology. Rowman & Littlefield Publishers.Google Scholar
- Ben-David, A. (2020). Counter-archiving Facebook. European Journal of Communication, 35(3), 249--264.Google Scholar
- Stoler, A. L. (2002). Colonial archives and the arts of governance: on the content in the form. In Refiguring the archive (pp. 83--102). Springer, Dordrecht.Google ScholarCross Ref
- Motha, S., & Van Rijswijk, H. (2016). Introduction: A counter-archival sense.Google ScholarCross Ref
- Solórzano, D. G., & Yosso, T. J. (2002). Critical race methodology: Counter-storytelling as an analytical framework for education research. Qualitative inquiry, 8(1), 23--44.Google Scholar
- Milner IV, H. R., & Howard, T. C. (2013). Counter-narrative as method: Race, policy and research for teacher education. Race Ethnicity and Education, 16(4), 536--561.Google ScholarCross Ref
- Stoler, A. L. (2018). On archiving as dissensus. Comparative Studies of South Asia, Africa and the Middle East, 38(1), 43--56.Google ScholarCross Ref
- Wells, I.B. (1895). A red record. Tabulated statistics and alleged causes of lynchings in the United States, 1892--1893--1894. Respectfully submitted to the nineteenth century civilization in "the land of the free and the home of the brave."Google Scholar
- Alexander, M. (2020). The new Jim Crow: Mass incarceration in the age of colorblindness. The New Press.Google Scholar
- Equal Justice Initiative. (2015). Lynching in America: Confronting the Legacy of Racial Terror. Montgomery. http://www. eji. org/files/EJI Lynching in America SUMMARY. pdf.Google Scholar
- Wells, I.B. & Royster, J. J. (2016). Southern horrors and other writings: The anti-lynching campaign of Ida B. Wells, 1892--1900. Macmillan Higher Education.Google Scholar
- Benjamin, R. (n.d.). Ida B. Wells Just Data Lab https://www.thejustdatalab.com/Google Scholar
- Zuboff, Shoshana. The Age of Surveillance Capitalism: The Fight for a Human Future at the New Frontier of Power. Profile Books, 2019. Google ScholarDigital Library
- Dean, J [@JeffDean]. 2020, June 7. The Gender Shades project by @jovialjoy, @timnitGebru, @rajiinio et al. examined many of these commercial APIs and identified harmful gender & race biases. Many of these APIs have now been discontinued because of their potential for harm. Twitter. https://twitter.com/JeffDean/status/1280535427150082048Google Scholar
- Dean, J [@JeffDean]. 2020, December 4. I understand the concern over Timnit's resignation from Google. She's done a great deal to move the field forward with her research. I wanted to share the email I sent to Google Research and some thoughts on our research process. Twitter. https://twitter.com/search?q=Jeff%20dean%20timnit&src=typed_queryGoogle Scholar
- Bender, E. M., Gebru, T., McMillan-Major, A., & Mitchell, M. (2021, March). On the Dangers of Stochastic Parrots: Can Language Models Be Too Big???. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (pp. 610--623). Google ScholarDigital Library
- Noble, S. (2020, June 28). AI Ethics Book Chat -- June 2020: Meet Author Dr. Safiya Noble "Algorithms of Oppression. https://lighthouse3.com/our-blog/june-2020-meet-author-dr-safiya-noble/Google Scholar
- Noble, S. [@safiyanoble]. (2018, February 2). Shameless plug: If everyone bought one right now for themselves, and one for a friend, this book could have a chance at improving the internet for women and people marginalized by tech... Twitter. https://twitter.com/safiyanoble/status/959621339110719488Google Scholar
- Metcalf, J., & Crawford, K. (2016). Where are human subjects in big data research? The emerging ethics divide. Big Data & Society, 3(1), 2053951716650211.Google ScholarCross Ref
- Connolly, R. (2020). Why computing belongs within the social sciences. Communications of the ACM, 63(8), 54--59. Google ScholarDigital Library
- Jackson, L. F., Kuhlman, C., Jackson, F. L., & Fox, K. (2019). Including Vulnerable Populations in the Assessment of Data From Vulnerable Populations. Frontiers in Big Data, 2, 19.Google ScholarCross Ref
- Gangadharan, S. P., Eubanks, V., & Barocas, S. (2014). Data and discrimination: Collected essays. Open Technology.Google Scholar
- D'Ignazio, C., & Klein, L. F. (2020). Data feminism. MIT Press.Google ScholarCross Ref
- Galston, W. A. (2020). Is Seeing Still Believing? The Deepfake Challenge to Truth in Politics. Brookings. January, 8.Google Scholar
- Obermeyer, Z., Powers, B., Vogeli, C., & Mullainathan, S. (2019). Dissecting racial bias in an algorithm used to manage the health of populations. Science, 366(6464), 447--453.Google Scholar
- Benjamin, R. (2019). Assessing risk, automating racism. Science, 366(6464), 421--422.Google ScholarCross Ref
- Munn, N. (2018). This predictive policing company compares its software to "Broken Windows' policing. Vice. Available at: https://www. vice. com/en_us/article/d3k5pv/predpol-predictive-policing-broken-windows-theory-chicago-lucy-parsons (Accessed 15 May 2019).Google Scholar
- Kitchin, R., Cardullo, P., & Di Feliciantonio, C. (2019). Citizenship, justice, and the right to the smart city. In The right to the smart city. Emerald Publishing Limited.Google ScholarCross Ref
- Van Zoonen, L. (2016). Privacy concerns in smart cities. Government Information Quarterly, 33(3), 472--480.Google Scholar
- Harcourt, B. E., & Ludwig, J. (2006). Broken windows: New evidence from New York City and a five-city social experiment. U. Chi. L. Rev., 73, 271.Google Scholar
- Hao, K. (2019). AI is sending people to jail-and getting it wrong. Technology Review, 21.Google Scholar
- Gillis, T. B., & Spiess, J. L. (2019). Big data and discrimination. The University of Chicago Law Review, 86(2), 459--488.Google Scholar
- Crawford, K. (2016). Artificial intelligence's white guy problem. The New York Times, 25(06).Google Scholar
- Garvie, C., & Frankle, J. (2016). Facial-recognition software might have a racial bias problem. The Atlantic, 7.Google Scholar
- Chesney, R., & Citron, D. (2019). Deepfakes and the new disinformation war: The coming age of post-truth geopolitics. Foreign Aff., 98, 147.Google Scholar
- Noble, S. U. (2013). Google search: Hyper-visibility as a means of rendering black women and girls invisible. InVisible Culture, (19).Google Scholar
- Smith, H. (2020). Algorithmic bias: should students pay the price?. AI & society, 35(4), 1077--1078.Google Scholar
- Adams, R., & McIntyre, N. (2020). England A-level downgrades hit pupils from disadvantaged areas hardest. The Guardian. https://www. theguardian. com/education/2020/aug/13/england-a-level-downgrades-hit-pupils-from-disadvantaged-areas-hardest#. Accessed, 17.Google Scholar
- Monroe-White, T., Marshall, B. & Contreras-Palacios, H. (2021, February). "Waking up to Marginalization: Public Value Failures in Artificial Intelligence and Data Science." In Proceedings of the 2021 AAAI Workshop on Diversity in Artificial Intelligence.Google Scholar
- Monroe-White, T., & Marshall, B. (2019, December). "Data Science Intelligence: Mitigating Public Value Failures Using PAIR Principles" Proceedings of the 2019 Pre-ICIS SIGDSA Symposium. 4. https://aisel.aisnet.org/sigdsa2019/4Google Scholar
- Bozeman, B. (2002). Public-value failure: When efficient markets may not do. Public administration review, 62(2), 145--161.Google Scholar
- Berman, F., Rutenbar, R., Hailpern, B., Christensen, H., Davidson, S., Estrin, D., Franklin, M., Martonosi, M., Raghavan, P., Stodden, S., and Szalay, A. S. 2018. "Realizing the Potential of Data Science," Communications of the ACM, (61:4), pp. 67--72 (doi: 10.1145/3188721). Google ScholarDigital Library
- Benjamin, R. (2019). Race after technology: Abolitionist tools for the new Jim Code. Cambridge, Polity.Google Scholar
- Zuberi, T., & Bonilla-Silva, E. (Eds.). (2008). White logic, white methods: Racism and methodology. Rowman & Littlefield Publishers.Google Scholar
- Krishnan, L., Ogunwole, S. M., & Cooper, L. A. (2020). Historical Insights on Coronavirus Disease 2019 (COVID-19), the 1918 Influenza Pandemic, and Racial Disparities: Illuminating a Path Forward. Annals of Internal Medicine.Google ScholarCross Ref
- Castillo-Montoya, M., Abreu, J., & Abad, A. (2019). Racially liberatory pedagogy: a Black Lives Matter approach to education. International Journal of Qualitative Studies in Education, 32(9), 1125--1145.Google ScholarCross Ref
- Johnson, A., & Elliott, S. (2020). Culturally Relevant Pedagogy: A Model To Guide Cultural Transformation in STEM Departments. Journal of microbiology & biology education, 21(1), 21.1.35. https://doi.org/10.1128/jmbe.v21i1.2097Google ScholarCross Ref
Index Terms
- Emancipatory Data Science: A Liberatory Framework for Mitigating Data Harms and Fostering Social Transformation
Recommendations
Utopias of participation: design, criticality, and emancipation
PDC '14: Proceedings of the 13th Participatory Design Conference: Short Papers, Industry Cases, Workshop Descriptions, Doctoral Consortium papers, and Keynote abstracts - Volume 2From its earliest incarnation in labor movements in Scandinavia in the 1970s, Participatory Design has had an emancipatory politics inscribed in it. As PD is appropriated in other contexts, this emancipatory politics can continue to be foregrounded or, ...
Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science
CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing SystemsThe work involved in gathering, wrangling, cleaning, and otherwise preparing data for analysis is often the most time consuming and tedious aspect of data work. Although many studies describe data preparation within the context of data science workflows,...
What does it mean to embed ethics in data science? An integrative approach based on microethics and virtues
AbstractIn the past few years, scholars have been questioning whether the current approach in data ethics based on the higher level case studies and general principles is effective. In particular, some have been complaining that such an approach to ethics ...
Comments