Challenges of Data-Driven Decision Models: Implications for Developers and for Public Policy Decision-Makers

Teixeira, Sónia; Rodrigues, José Coelho; Veloso, Bruno; Gama, João

doi:10.1007/978-981-19-0412-7_7

Sónia Teixeira^4,5,
José Coelho Rodrigues^4,5,
Bruno Veloso^4,7 &
…
João Gama^4,6

Part of the book series: Design Science and Innovation ((DSI))

384 Accesses

Abstract

Systems based on Artificial Intelligence, namely Data-driven decision systems have been used in the private sector in areas such as retail, finance, and telecommunications. More recently, data-driven decision systems started to be applied in different areas of public interest, such as health, urban planning, education, criminal justice, and public administration. Several countries have been defining their own Artificial Intelligence (AI) Policies, with respective national strategies. Data-driven decision systems are, therefore, becoming an essential part of the operations of different companies and public services, on a daily basis, while creating new challenges for society. Part of those challenges is related to the risks of those systems, namely the dimensions: Bias, Explainability, and Accuracy. The ethical problems that emerge from these risks, in particular, in the public domain, make them a concern and a new challenge for Public Policy decision-makers. The goal of this work is to understand how are Bias, Explainability, and Accuracy addressed in the Cross-Industry Standard Process for Data Mining (CRISP-DM) and the Public Policy (PP) processes, and establish the parallel between these processes. In order to do that, the documents related to these topics that are listed in the “Law and Policy Reading”, published by the “AI Now Research Institute” from New York University are analyzed. In this way, Public Policy decision-makers and developers are able to identify which phases should be looked at, in both processes, when identifying, using, evaluating, and comparing these risks in tools based on Data-driven Decision Models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Hardcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Automated Decision Systems: Why Human Autonomy is at Stake

Countering flaws in algorithm design and applications: a Delphi study

Article 16 November 2024

Data-Driven Decision Support Systems in E-Governance: Leveraging AI for Policymaking

Notes

References

AI Now Law and Policy Reading List—AI Now Institute—Medium. https://medium.com/@AINowInstitute/ai-now-law-and-policy-reading-list-641368f09228
AI Now Research Institute: AI Now Institute (2017) https://ainowinstitute.org/
AI Now Research Institute: AI Now Law and Policy Reading List—Google Sheets (2017) https://docs.google.com/spreadsheets/d/1qtNGMB46GOwbkJUdVV9RPUNHbNkK9XW8rln48lAeEks/edit#gid=0
Anderson JE (2003) Public policymaking: an introduction. Mifflin Company, Boston, pp 1–34
Google Scholar
Aydemir FBBBB, Dalpiaz F (2018) A roadmap for ethics-aware software engineering. In: Proceedings of the international workshop on software fairness—FairWare ’18. ACM Press, New York, New York, USA, pp 15–21. https://doi.org/10.1145/3194770.3194778, http://dl.acm.org/citation.cfm?doid=3194770.3194778
Azevedo A, Santos MF (2008) KDD, SEMMA and CRISP-DM: a parallel overview. http://recipp.ipp.pt/bitstream/10400.22/136/3/KDD-CRISP-SEMMA.pdf
Bellamy RK, Dey K, Hind M, Hoffman SC, Houde S, Kannan K, Lohia P, Mehta S, Mojsilovic A, Nagar S, Ramamurthy KN, Richards J, Saha D, Sattigeri P, Singh M, Varshney KR, Zhang Y (2019) Think your artificial intelligence software is fair? Think again. IEEE Softw 36(4):76–80. https://doi.org/10.1109/MS.2019.2908514
Brauneis R, Goodman EP (2017) Algorithmic transparency for the smart city. SSRN Electron J 103:103–176
Google Scholar
Burnap P, Williams ML (2015) Cyber hate speech on twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet 7(2):223–242
Google Scholar
Center for Data Science and Public Policy: Our Open Source Mission: Code, Data, and other Resources. https://dsapp.uchicago.edu/home/resources/opensource/
Chapman P, Clinton J, Kerber R, Khabaza T, Reinartz T, Shearer C, Wirth R (2000) CRISP-DM 1.0: Step-by-step data mining guide. https://www.semanticscholar.org/paper/CRISP-DM-1.0%3A-Step-by-step-data-mining-guide-Chapman-Clinton/54bad20bbc7938991bf34f86dde0babfbd2d5a72
Chouldechova A (2017) Fair prediction with disparate impact: a study of bias in recidivism prediction instruments, pp 1–6. https://doi.org/10.1089/big.2016.0047, http://arxiv.org/abs/1703.00056, http://arxiv.org/abs/1610.07524
Citron DK (2007) Technological due process. Washington Univ Law Rev 85(6):1249–1313
Google Scholar
Collingwood L, Wilkerson J (2012) Tradeoffs in accuracy and efficiency in supervised learning methods. J Inf Technol Polit 9(3):298–318
Google Scholar
Danks D, London AJ, Dignum V (2017) Algorithmic bias in autonomous systems. In: Proceedings of the 26th international joint conference on artificial intelligence. IJCAI—international joint conferences on artificial intelligence, pp 4691–4697. http://dl.acm.org/citation.cfm?id=3171944
Davies J (2012) Word cloud generator—Jason Davies. https://www.jasondavies.com/wordcloud/
De Marchi G, Lucertini G, Tsoukiàs A (2016) From evidence-based policy making to policy analytics. Ann Oper Res 236(1):15–38
Google Scholar
Flick U, Kardoff EV, Steinke I (2004) A companion to qualitative research. Sage Publications UK. https://books.google.pt/books?hl=pt-PT&lr=&id=6lwPkSo2XW8C&oi=fnd&pg=PP282&dq=content+analysis&ots=Zq_U0nQpMi&sig=BsGhLeZ91wSRsCiaVH2vDDEuSW0&redir_esc=y#v=onepage&q=contentanalysis&f=false
IBM Research Institute: AI Fairness 360 (2018) http://aif360.mybluemix.net/?cm_mc_uid=42643015043715552375663&cm_mc_sid_50200000=24341351557649842548
International Risk Governance Center: The Governance of Decision-Making Algorithms. Tech rep (2018) https://irgc.epfl.ch/issues/projects-cybersecurity/the-governance-of-decision-making-algorithms/
Lecher C (2018) What happens when an algorithm cuts your health care. https://www.theverge.com/2018/3/21/17144260/healthcare-medicaid-algorithm-arkansas-cerebral-palsy
Lipton ZC, Chouldechova A, McAuley J (2017) Does mitigating ML’s impact disparity require treatment disparity? Adv Neural Inf Process Syst 2018-Decem:8125–8135. https://arxiv.org/abs/1711.07076
Mathison S (2005) Cross-case analysis. In: Encyclopedia of evaluation. Sage Publications, Inc., 2455 Teller Road, Thousand Oaks California 91320 United States of America. https://doi.org/10.4135/9781412950558.n129, http://methods.sagepub.com/reference/encyclopedia-of-evaluation/n129.xml
Ncr PC, Spss JC, Ncr RK, Spss TK, Daimlerchrysler TR, Spss CS, Daimlerchrysler RW, Chapman P, Clinton JJ, Kerber R, Khabaza TT, Daimlerchrysler TR, Shearer CRH, Daimlerchrysler RW, Reinartz T, Shearer CRH, Wirth R (2000) CRISP-DM 1.0: step-by-step data mining guide. SPSS inc 78:1–78. https://doi.org/10.1017/CBO9781107415324.004, https://www.semanticscholar.org/paper/CRISP-DM-1.0
Perry J, Kingdon JW (1985) Agendas, alternatives, and public policies. J Policy Anal Manage 4(4):621. https://doi.org/10.2307/3323801
Article Google Scholar
Prabhakaran S, Gensim topic modeling—a guide to building best LDA models. https://www.machinelearningplus.com/nlp/topic-modeling-gensim-python/#9createbigramandtrigrammodels
Saldana J (2009) The coding manual for qualitative researchers
Google Scholar
Wirth R, Hipp J (2000) CRISP-DM: towards a standard process model for data mining. In: Proceedings of the 4th international conference on the practical application of knowledge discovery and data mining, pp 29–39. https://doi.org/10.1.1.198.5133, http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.198.5133
Young J, Mendizabel E (2009) Helping researchers become policy entrepreneurs
Google Scholar
Zou C, Hou D (2014) LDA analyzer: a tool for exploring topic models. In: Proceedings—30th international conference on software maintenance and evolution, ICSME 2014. Institute of Electrical and Electronics Engineers Inc., pp 593–596. https://doi.org/10.1109/ICSME.2014.103

Download references

Acknowledgements

This work is a result of the project Operation NORTE-08-5369-FSE-000045 supported by Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Social Fund (ESF). Project “Network Science for urban engineering” under the FCT Arrangement/Agreement—Scientific and technological cooperation FCT/ INDIA-2017/2019 Ref: FCT/4755/3/5/2017/S.

Author information

Authors and Affiliations

INESC TEC, Porto, Portugal
Sónia Teixeira, José Coelho Rodrigues, Bruno Veloso & João Gama
Faculty of Engineering, University of Porto, Porto, Portugal
Sónia Teixeira & José Coelho Rodrigues
Faculty of Economics, University of Porto, Porto, Portugal
João Gama
Portucalense University, Porto, Portugal
Bruno Veloso

Authors

Sónia Teixeira
View author publications
You can also search for this author in PubMed Google Scholar
José Coelho Rodrigues
View author publications
You can also search for this author in PubMed Google Scholar
Bruno Veloso
View author publications
You can also search for this author in PubMed Google Scholar
João Gama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to João Gama .

Editor information

Editors and Affiliations

Department of Civil Engineering, Indian Institute of Technology Bombay, Mumbai, Maharashtra, India
Pradipta Banerji
Centre for Urban Science and Engineering, Indian Institute of Technology Bombay, Mumbai, Maharashtra, India
Arnab Jana

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Teixeira, S., Rodrigues, J.C., Veloso, B., Gama, J. (2022). Challenges of Data-Driven Decision Models: Implications for Developers and for Public Policy Decision-Makers. In: Banerji, P., Jana, A. (eds) Advances in Urban Design and Engineering. Design Science and Innovation. Springer, Singapore. https://doi.org/10.1007/978-981-19-0412-7_7

Download citation

DOI: https://doi.org/10.1007/978-981-19-0412-7_7
Published: 11 April 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0411-0
Online ISBN: 978-981-19-0412-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics