Abstract
Systems based on Artificial Intelligence, namely Data-driven decision systems have been used in the private sector in areas such as retail, finance, and telecommunications. More recently, data-driven decision systems started to be applied in different areas of public interest, such as health, urban planning, education, criminal justice, and public administration. Several countries have been defining their own Artificial Intelligence (AI) Policies, with respective national strategies. Data-driven decision systems are, therefore, becoming an essential part of the operations of different companies and public services, on a daily basis, while creating new challenges for society. Part of those challenges is related to the risks of those systems, namely the dimensions: Bias, Explainability, and Accuracy. The ethical problems that emerge from these risks, in particular, in the public domain, make them a concern and a new challenge for Public Policy decision-makers. The goal of this work is to understand how are Bias, Explainability, and Accuracy addressed in the Cross-Industry Standard Process for Data Mining (CRISP-DM) and the Public Policy (PP) processes, and establish the parallel between these processes. In order to do that, the documents related to these topics that are listed in the “Law and Policy Reading”, published by the “AI Now Research Institute” from New York University are analyzed. In this way, Public Policy decision-makers and developers are able to identify which phases should be looked at, in both processes, when identifying, using, evaluating, and comparing these risks in tools based on Data-driven Decision Models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
References
AI Now Law and Policy Reading List—AI Now Institute—Medium. https://medium.com/@AINowInstitute/ai-now-law-and-policy-reading-list-641368f09228
AI Now Research Institute: AI Now Institute (2017) https://ainowinstitute.org/
AI Now Research Institute: AI Now Law and Policy Reading List—Google Sheets (2017) https://docs.google.com/spreadsheets/d/1qtNGMB46GOwbkJUdVV9RPUNHbNkK9XW8rln48lAeEks/edit#gid=0
Anderson JE (2003) Public policymaking: an introduction. Mifflin Company, Boston, pp 1–34
Aydemir FBBBB, Dalpiaz F (2018) A roadmap for ethics-aware software engineering. In: Proceedings of the international workshop on software fairness—FairWare ’18. ACM Press, New York, New York, USA, pp 15–21. https://doi.org/10.1145/3194770.3194778, http://dl.acm.org/citation.cfm?doid=3194770.3194778
Azevedo A, Santos MF (2008) KDD, SEMMA and CRISP-DM: a parallel overview. http://recipp.ipp.pt/bitstream/10400.22/136/3/KDD-CRISP-SEMMA.pdf
Bellamy RK, Dey K, Hind M, Hoffman SC, Houde S, Kannan K, Lohia P, Mehta S, Mojsilovic A, Nagar S, Ramamurthy KN, Richards J, Saha D, Sattigeri P, Singh M, Varshney KR, Zhang Y (2019) Think your artificial intelligence software is fair? Think again. IEEE Softw 36(4):76–80. https://doi.org/10.1109/MS.2019.2908514
Brauneis R, Goodman EP (2017) Algorithmic transparency for the smart city. SSRN Electron J 103:103–176
Burnap P, Williams ML (2015) Cyber hate speech on twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet 7(2):223–242
Center for Data Science and Public Policy: Our Open Source Mission: Code, Data, and other Resources. https://dsapp.uchicago.edu/home/resources/opensource/
Chapman P, Clinton J, Kerber R, Khabaza T, Reinartz T, Shearer C, Wirth R (2000) CRISP-DM 1.0: Step-by-step data mining guide. https://www.semanticscholar.org/paper/CRISP-DM-1.0%3A-Step-by-step-data-mining-guide-Chapman-Clinton/54bad20bbc7938991bf34f86dde0babfbd2d5a72
Chouldechova A (2017) Fair prediction with disparate impact: a study of bias in recidivism prediction instruments, pp 1–6. https://doi.org/10.1089/big.2016.0047, http://arxiv.org/abs/1703.00056, http://arxiv.org/abs/1610.07524
Citron DK (2007) Technological due process. Washington Univ Law Rev 85(6):1249–1313
Collingwood L, Wilkerson J (2012) Tradeoffs in accuracy and efficiency in supervised learning methods. J Inf Technol Polit 9(3):298–318
Danks D, London AJ, Dignum V (2017) Algorithmic bias in autonomous systems. In: Proceedings of the 26th international joint conference on artificial intelligence. IJCAI—international joint conferences on artificial intelligence, pp 4691–4697. http://dl.acm.org/citation.cfm?id=3171944
Davies J (2012) Word cloud generator—Jason Davies. https://www.jasondavies.com/wordcloud/
De Marchi G, Lucertini G, Tsoukiàs A (2016) From evidence-based policy making to policy analytics. Ann Oper Res 236(1):15–38
Flick U, Kardoff EV, Steinke I (2004) A companion to qualitative research. Sage Publications UK. https://books.google.pt/books?hl=pt-PT&lr=&id=6lwPkSo2XW8C&oi=fnd&pg=PP282&dq=content+analysis&ots=Zq_U0nQpMi&sig=BsGhLeZ91wSRsCiaVH2vDDEuSW0&redir_esc=y#v=onepage&q=contentanalysis&f=false
IBM Research Institute: AI Fairness 360 (2018) http://aif360.mybluemix.net/?cm_mc_uid=42643015043715552375663&cm_mc_sid_50200000=24341351557649842548
International Risk Governance Center: The Governance of Decision-Making Algorithms. Tech rep (2018) https://irgc.epfl.ch/issues/projects-cybersecurity/the-governance-of-decision-making-algorithms/
Lecher C (2018) What happens when an algorithm cuts your health care. https://www.theverge.com/2018/3/21/17144260/healthcare-medicaid-algorithm-arkansas-cerebral-palsy
Lipton ZC, Chouldechova A, McAuley J (2017) Does mitigating ML’s impact disparity require treatment disparity? Adv Neural Inf Process Syst 2018-Decem:8125–8135. https://arxiv.org/abs/1711.07076
Mathison S (2005) Cross-case analysis. In: Encyclopedia of evaluation. Sage Publications, Inc., 2455 Teller Road, Thousand Oaks California 91320 United States of America. https://doi.org/10.4135/9781412950558.n129, http://methods.sagepub.com/reference/encyclopedia-of-evaluation/n129.xml
Ncr PC, Spss JC, Ncr RK, Spss TK, Daimlerchrysler TR, Spss CS, Daimlerchrysler RW, Chapman P, Clinton JJ, Kerber R, Khabaza TT, Daimlerchrysler TR, Shearer CRH, Daimlerchrysler RW, Reinartz T, Shearer CRH, Wirth R (2000) CRISP-DM 1.0: step-by-step data mining guide. SPSS inc 78:1–78. https://doi.org/10.1017/CBO9781107415324.004, https://www.semanticscholar.org/paper/CRISP-DM-1.0
Perry J, Kingdon JW (1985) Agendas, alternatives, and public policies. J Policy Anal Manage 4(4):621. https://doi.org/10.2307/3323801
Prabhakaran S, Gensim topic modeling—a guide to building best LDA models. https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/#9createbigramandtrigrammodels
Saldana J (2009) The coding manual for qualitative researchers
Wirth R, Hipp J (2000) CRISP-DM: towards a standard process model for data mining. In: Proceedings of the 4th international conference on the practical application of knowledge discovery and data mining, pp 29–39. https://doi.org/10.1.1.198.5133, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.198.5133
Young J, Mendizabel E (2009) Helping researchers become policy entrepreneurs
Zou C, Hou D (2014) LDA analyzer: a tool for exploring topic models. In: Proceedings—30th international conference on software maintenance and evolution, ICSME 2014. Institute of Electrical and Electronics Engineers Inc., pp 593–596. https://doi.org/10.1109/ICSME.2014.103
Acknowledgements
This work is a result of the project Operation NORTE-08-5369-FSE-000045 supported by Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Social Fund (ESF). Project “Network Science for urban engineering” under the FCT Arrangement/Agreement—Scientific and technological cooperation FCT/ INDIA-2017/2019 Ref: FCT/4755/3/5/2017/S.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Teixeira, S., Rodrigues, J.C., Veloso, B., Gama, J. (2022). Challenges of Data-Driven Decision Models: Implications for Developers and for Public Policy Decision-Makers. In: Banerji, P., Jana, A. (eds) Advances in Urban Design and Engineering. Design Science and Innovation. Springer, Singapore. https://doi.org/10.1007/978-981-19-0412-7_7
Download citation
DOI: https://doi.org/10.1007/978-981-19-0412-7_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0411-0
Online ISBN: 978-981-19-0412-7
eBook Packages: EngineeringEngineering (R0)