skip to main content
10.1145/3209281.3209366acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesdg-oConference Proceedingsconference-collections
short-paper

Exploring data science for public good in South Africa: evaluating factors that lead to success

Published: 30 May 2018 Publication History

Abstract

In the pursuit of public service, governments have to oversee many complex systems. In recent years, data-driven methodologies have been adopted as tools to oversee and enhance service delivery. In this paper we discuss the ways that the government of South Africa, and its agencies, use data tools as well as the policies and investments that have been put into place; some of which have created a more enabling ecosystem while others have created difficulties and obstacles on the road towards realising the powers of analytics for social good. We discuss the current data landscape from the lens of Open Data policies, data readiness policies, human capital development and ethics. The paper is a summary of our observations, our successes, aspirations and challenges as we endeavour to contribute to a more data-driven governance.

References

[1]
Chris Bizer, Tom Heath, Danny Ayers, and Yves Raimond. 2007. Interlinking open data on the web. In Demonstrations track, 4th european semantic web conference, innsbruck, austria.
[2]
Heinrich R Bohlmann, Jan Horn Van Heerden, PB Dixon, and MT Rimmer. 2015. The impact of the 2014 platinum mining strike in South Africa: An economy-wide analysis. Economic Modelling 51 (2015), 403--411.
[3]
Rick Bonney, Tina B Phillips, Heidi L Ballard, and Jody W Enck. 2016. Can citizen science enhance public understanding of science? Public Understanding of Science 25, 1 (2016), 2--16.
[4]
Joshua Brustein. 2016. This Guy Trains Computers to Find Future Criminals. https://www.bloomberg.com/features/2016-richard-berk-future-crime/. (2016). {Online; accessed 29-January-2018}.
[5]
Gennaro Castelli, Michael Hackett, Michael Quinn Howard, Lawrence Edmund Jones, HulChu Su Shung, and Heath Daniel Brand. 2016. Energy management system that provides a real time assessment of a potentially compromising situation that can affect a utility company. (June 14 2016). US Patent 9,367,935.
[6]
Charlie Catlett and Rayid Ghani. 2015. Big Data for social good. (2015).
[7]
City of Cape Town. 2014. Open Data Policy. Technical Report. City of Cape Town.
[8]
Andreiwid Sheffer Corrêa and Pär-Ola Zander. 2017. Unleashing Tabular Content to Open Data: A Survey on PDF Table Extraction Methods and Tools. In Proceedings of the 18th Annual International Conference on Digital Government Research. 54--63.
[9]
Kate Crawford. 2016. Can an algorithm be agonistic? Ten scenes from life in calculated publics. Science, Technology, & Human Values 41, 1 (2016), 77--92.
[10]
Kate Crawford and Jason Schultz. 2014. Big data and due process: Toward a framework to redress predictive privacy harms. BCL Rev. 55 (2014), 93.
[11]
Ben Daniel. 2015. Big Data and analytics in higher education: Opportunities and challenges. British journal of educational technology 46, 5 (2015), 904--920.
[12]
Alta de Waal, Hildegarde Koen, Pieter de Villiers, Henk Roodt, Nyalleng Moorosi, and Gregor Pavlin. 2016. Construction and evaluation of Bayesian networks with expert-defined latent variables. In Information Fusion (FUSION), 2016 19th International Conference on. 774--781.
[13]
Vasant Dhar. 2013. Data science and prediction. Commun. ACM 56, 12 (2013), 64--73.
[14]
Julia Dressel and Hany Farid. 2018. The accuracy, fairness, and limits of predicting recidivism. Science Advances 4, 1 (2018), eaao5580.
[15]
Harrison Edwards and Amos Storkey. 2015. Censoring Representations with an Adversary. arXiv preprint arXiv:1511.05897 (2015).
[16]
Bryce Goodman and Seth Flaxman. 2016. European Union regulations on algorithmic decision-making and a" right to explanation". ICML Workshop on Human Interpretability in Machine Learning (2016).
[17]
Ying Han, Lisa B Signorello, Sara S Strom, Rick A Kittles, Benjamin A Rybicki, Janet L Stanford, Phyllis J Goodman, Sonja I Berndt, John Carpten, Graham Casey, et al. 2015. Generalizability of established prostate cancer risk variants in men of African ancestry. International journal of cancer 136, 5 (2015), 1210--1217.
[18]
Dirk Helbing, Bruno S Frey, Gerd Gigerenzer, Ernst Hafen, Michael Hagner, Yvonne Hofstetter, Jeroen van den Hoven, Roberto V Zicari, and Andrej Zwitter. 2017. Will democracy survive big data and artificial intelligence. Scientific American 25 (2017).
[19]
Martin Hilbert. 2016. Big Data for Development: A Review of Promises and Challenges. Development Policy Review 34, 1 (2016), 135--174.
[20]
Godwin Kaisara and Shaun Pather. 2011. The e-Government evaluation challenge: A South African Batho Pele-aligned service quality approach. Government information quarterly 28, 2 (2011), 211--221.
[21]
Joshua A Kroll, Solon Barocas, Edward W Felten, Joel R Reidenberg, David G Robinson, and Harlan Yu. 2016. Accountable algorithms. University of Pennsylvania Law Review (2016), 633.
[22]
Vukosi Marivate and Pelonomi Moiloa. 2016. Catching crime: Detection of public safety incidents using social media. In Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech), 2016. IEEE, 1--5.
[23]
Simnikiwe Mzekandaba. 2015. Lack of data scientists poses SKA problem. (August 2015).
[24]
Simnikiwe Mzekandaba. 2015. Universities unite to boost SKA. (September 2015).
[25]
SABC News. 2016. New tool scrutinises SA political twittersphere. http://elections.sabc.co.za/Article?id=26782. (2016). {Online; accessed 29-January-2018}.
[26]
Eric Karl Oermann, Alex Rubinsteyn, Dale Ding, Justin Mascitelli, Robert M Starke, Joshua B Bederson, Hideyuki Kano, L Dade Lunsford, Jason P Sheehan, Jeffrey Hammerbacher, et al. 2016. Using a machine learning approach to predict outcomes after radiosurgery for cerebral arteriovenous malformations. Scientific reports 6 (2016), 21161.
[27]
Ferda Ofli, Patrick Meier, Muhammad Imran, Carlos Castillo, Devis Tuia, Nicolas Rey, Julien Briant, Pauline Millet, Friedrich Reinhard, Matthew Parkan, et al. 2016. Combining human computing and machine learning to make sense of big (aerial) data for disaster response. Big data 4, 1 (2016), 47--59.
[28]
Adegboyega Ojo, Lukasz Porwol, Mohammad Waqar, Arkadiusz Stasiewicz, Edobor Osagie, Michael Hogan, Owen Harney, and Fatemeh Ahmadi Zeleti. 2016. Realizing the Innovation Potentials from Open Data: Stakeholders' Perspectives on the Desired Affordances of Open Data Environment. In Working Conference on Virtual Enterprises. 48--59.
[29]
Meenal J Patel, Alexander Khalaf, and Howard J Aizenstein. 2016. Studying depression using imaging and machine learning methods. NeuroImage: Clinical 10 (2016), 115--123.
[30]
Alice B Popejoy and Stephanie M Fullerton. 2016. Genomics is failing on diversity. Nature 538, 7624 (2016), 161.
[31]
Praekelt. 2017. Praekelt.org Pilots Whatsapp for Social Impact on Behalf of MomConnect. https://medium.com/mobileforgood/praekelt-org-pilots-whatsapp-for-social-impact-19a336f5b04e. (2017). {Online; accessed 29-January-2018}.
[32]
Foster Provost and Tom Fawcett. 2013. Data science and its relationship to big data and data-driven decision making. Big data (2013), 51--59.
[33]
Abel Ramoelo, Moses Azong Cho, Renaud Mathieu, Sabelo Madonsela, Ruben Van De Kerchove, Zaneta Kaszta, and Eléonore Wolff. 2015. Monitoring grass nutrients and biomass as indicators of rangeland quality and quantity using random forest modelling and WorldView-2 data. International Journal of Applied Earth Observation and Geoinformation 43 (2015), 43--54.
[34]
A Tversky and D Kahneman. 1981. The framing of decisions and the psychology of choice. Science 211, 4481 (1981), 453--458. arXiv:http://science.sciencemag.org/content/211/4481/453.full.pdf
[35]
Blake Valenta. 2018. How to Open Data While Protecting Privacy. http://www.govtech.com/data/How-to-Open-Data-While-Protecting-Privacy.html. (2018). {Online; accessed 29-January-2018}.
[36]
Ciira wa Maina, David Muchiri, and Peter Njoroge. 2016. A Bioacoustic Record of a Conservancy in the Mount Kenya Ecosystem. Biodiversity data journal 4 (2016).

Cited By

View all
  • (2024)A collaborative approach to advancing research and training in Public Health Data Science—challenges, opportunities, and lessons learntFrontiers in Public Health10.3389/fpubh.2024.147494712Online publication date: 11-Dec-2024
  • (2023)Adapting to the digital age: Investigating the frameworks for financial services in modern communitiesBusiness Strategy & Development10.1002/bsd2.3037:1Online publication date: 26-Sep-2023
  • (2022)Data science for empowerment: understanding the data science training landscape for women and girls in AfricaGender, Technology and Development10.1080/09718524.2022.213756226:3(437-462)Online publication date: 25-Nov-2022
  • Show More Cited By

Index Terms

  1. Exploring data science for public good in South Africa: evaluating factors that lead to success

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age
    May 2018
    889 pages
    ISBN:9781450365260
    DOI:10.1145/3209281
    Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 May 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data readiness
    2. data science for social good
    3. e-government
    4. government data
    5. open data
    6. representation and inclusion

    Qualifiers

    • Short-paper

    Conference

    dg.o '18

    Acceptance Rates

    Overall Acceptance Rate 150 of 271 submissions, 55%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)17
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 20 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A collaborative approach to advancing research and training in Public Health Data Science—challenges, opportunities, and lessons learntFrontiers in Public Health10.3389/fpubh.2024.147494712Online publication date: 11-Dec-2024
    • (2023)Adapting to the digital age: Investigating the frameworks for financial services in modern communitiesBusiness Strategy & Development10.1002/bsd2.3037:1Online publication date: 26-Sep-2023
    • (2022)Data science for empowerment: understanding the data science training landscape for women and girls in AfricaGender, Technology and Development10.1080/09718524.2022.213756226:3(437-462)Online publication date: 25-Nov-2022
    • (undefined)The Ghost in the Machine: The Ethical Risks of AISSRN Electronic Journal10.2139/ssrn.3719745

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media