Abstract
As part of an ongoing design science research project, this paper presents a systematic literature review and the classification of 214 papers scoping the work on Data Science (DS) in the fields of Information Systems and Human-Computer Interaction. The overall search was conducted on Web of Science, Science Direct and ACM Digital Library, for papers about the design of IT artefacts for Data Science, over the period of 1997 until 2017. The work identifies promising research clusters in the crossroads of IS, HCI and Design, but few studies were found with concrete guidance on how to design a system for DS, when targeting for broader technical and business user profiles and multi-domain applications. In this paper, we propose a DS lifecycle process and a set of design principles to guide the design of such a system to support the whole creative DS lifecycle process.
A prior version of this paper has been published in the ISD2018 Proceedings (http://aisel.aisnet.org/isd2014/proceedings2018).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, H., Chiang, R.H.L., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS Q. 36, 1165–1188 (2012)
Manyika, J., Chui, M., Bughin, J., et al.: Big Data: The Next Frontier for Innovation, Competition, and Productivity, pp. 1–22. McKinsey Company (2011)
McAfee, A., Brynjolfsson, E., Davenport, T.H., et al.: Big data: the management revolution. Harv. Bus. Rev. 90, 61–67 (2012). https://doi.org/00475394
Mockus, A.: Operational data are missing, incorrect, and decontextualized. In: Perspectives on Data Science for Software Engineering, pp. 317–322 (2016)
Hevner, A.R., March, S.T., Park, J., Ram, S.: Design science in information. Syst. Res. 28, 75–105 (2004). https://doi.org/10.2307/25148869
Peffers, K., Tuunanen, T., Rothenberger, M.A., Chatterjee, S.: A design science research methodology for information systems research. J. Manage. Inf. Syst. 24, 45–77 (2008). https://doi.org/10.2753/MIS0742-1222240302
Nagle, T., Sammon, D.: The development of a design research canvas for data practitioners. J. Decis. Syst. 25, 369–380 (2016). https://doi.org/10.1080/12460125.2016.1187386
Webster, J., Watson, R.T.: Analyzing the past to prepare for the future: writing a literature review. MIS Q. 26, xii–xxiii (2002). https://doi.org/1210112213
Levy, Y., Ellis, T.J.: A systems approach to conduct an effective literature review in support of information systems research. Inf. Sci. J. 9, 181–212 (2006). https://doi.org/10.1049/cp.2009.0961
Tranfield, D., Denyer, D., Smart, P.: Towards a methodology for developing evidence-informed management knowledge by means of systematic review*. Br. J. Manage. 14, 207–222 (2003). https://doi.org/10.1111/1467-8551.00375
Lee, I.: Big data: dimensions, evolution, impacts, and challenges. Bus. Horiz. 60, 293–303 (2017). https://doi.org/10.1016/j.bushor.2017.01.004
Storey, V.C., Song, I.-Y.: Big data technologies and management: what conceptual modeling can do. Data Knowl. Eng. 108, 50–67 (2017). https://doi.org/10.1016/j.datak.2017.01.001
Cheng, S., Liu, B., Shi, Y., et al.: Evolutionary computation and big data: key challenges and future directions. In: Tan, Y., Shi, Y. (eds.) Data Mining and Big Data 2016, pp. 3–14. Springer International Publishing, AG, GEWERBESTRASSE 11, CHAM, CH-6330, Switzerland (2016)
Philip Chen, C.L., Zhang, C.-Y., Chen, C.L.P., Zhang, C.-Y.: Data-intensive applications, challenges, techniques and technologies: a survey on Big Data. Inf. Sci. (Ny) 275, 314–347 (2014). https://doi.org/10.1016/j.ins.2014.01.015
Rabl, T., Sadoghi, M., Jacobsen, H.-A., et al.: Solving big data challenges for enterprise application performance management. Proc. VLDB Endow. 5, 1724–1735 (2012). https://doi.org/10.14778/2367502.2367512
Miller, H.G., Mork, P.: From data to decisions: a value chain for big data. IT Prof. 15, 57–59 (2013). https://doi.org/10.1109/MITP.2013.11
Demirkan, H., Delen, D.: Leveraging the capabilities of service-oriented decision support systems: putting analytics and big data in cloud. Decis. Support Syst. 55, 412–421 (2013). https://doi.org/10.1016/j.dss.2012.05.048
Kowalczyk, M., Buxmann, P.: An ambidextrous perspective on business intelligence and analytics support in decision processes: insights from a multiple case study. Decis. Support Syst. 80, 1–13 (2015). https://doi.org/10.1016/j.dss.2015.08.010
Hashem, I.A.T., Yaqoob, I., Anuar, N.B., et al.: The rise of “big data” on cloud computing: review and open research issues. Inf. Syst. 47, 98–115 (2015). https://doi.org/10.1016/j.is.2014.07.006
Haug, F.S.: Bad big data science. In: Joshi, J., Karypis, G., Liu, L., Hu, X., Ak, R., Xia, Y., Xu, W., Sato, A.H., Rachuri, S., Ungar, L., Yu, P.S., Govindaraju, R., Suzumura, T. (eds.) 2016 IEEE International Conference on Big Data (BIG DATA), pp. 2863–2871. IEEE, New York, USA (2016)
Hazen, B.T., Boone, C.A., Ezell, J.D., Jones-Farmer, L.A.: Data quality for data science, predictive analytics, and big data in supply chain management: an introduction to the problem and suggestions for research and applications. Int. J. Prod. Econ. 154, 72–80 (2014). https://doi.org/10.1016/j.ijpe.2014.04.018
Vedula, S.S., Hager, G.D.: Surgical data science: the new knowledge domain. Innov. Surg. Sci. 2, 109+ (2017). https://doi.org/10.1515/iss-2017-0004
Westra, B.L., Sylvia, M., Weinfurter, E.F., et al.: Big data science: a literature review of nursing research exemplars. Nurs. Outlook 65, 549–561 (2017). https://doi.org/10.1016/j.outlook.2016.11.021
Roy, S., Ray, R., Roy, A., et al.: IoT, big data science & analytics, cloud computing and mobile app based hybrid system for smart agriculture. In: Chakrabarti, S., Saha, H. (eds.) 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON), pp. 303–304. IEEE, New York, USA (2017)
Woodard, J.: Big data and Ag-analytics: an open source, open data platform for agricultural & environmental finance, insurance, and risk. Agric. Financ. Rev. 76, 15–26 (2016). https://doi.org/10.1108/AFR-03-2016-0018
Liu, M.-C., Huang, Y.-M.: The use of data science for education: the case of social-emotional learning. Smart Learn. Environ. 4, 1 (2017). https://doi.org/10.1186/s40561-016-0040-4
Conte, R., Giardini, F.: Towards computational and behavioral social science. Eur. Psychol. 21, 131–140 (2016). https://doi.org/10.1027/1016-9040/a000257
Chang, R.M., Kauffman, R.J., Kwon, Y.: Understanding the paradigm shift to computational social science in the presence of big data. Decis. Support Syst. 63, 67–80 (2014). https://doi.org/10.1016/j.dss.2013.08.008
Bibri, S.E., Krogstie, J.: Smart sustainable cities of the future: an extensive interdisciplinary literature review. Sustain. Cities Soc. 31, 183–212 (2017). https://doi.org/10.1016/j.scs.2017.02.016
Fischer, F., Fuchs, J., Mansmann, F., Keim, D.A.: BANKSAFE: visual analytics for big data in large-scale computer networks. Inf. Vis. 14, 51–61 (2015). https://doi.org/10.1177/1473871613488572
Talbot, J., Lee, B., Kapoor, A., Tan, D.S.: EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1283–1292 (2009)
Bandi, A., Fellah, A.: Crafting a data visualization course for the tech industry. J. Comput. Sci. Coll. 33, 46–56 (2017)
Snyder, J.: Vernacular visualization practices in a citizen science project. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 2097–2111. ACM, New York, NY, USA (2017)
Bumblauskas, D., Nold, H., Bumblauskas, P., Igou, A.: Big data analytics: transforming data to action. Bus. Process. Manage. J. 23, 703–720 (2017). https://doi.org/10.1108/BPMJ-03-2016-0056
Horita, F.E.A.A., de Albuquerque, J.P., Marchezini, V., Mendiondo, E.M.: Bridging the gap between decision-making and emerging big data sources: an application of a model-based framework to disaster management in Brazil. Decis. Support Syst. 97, 12–22 (2017). https://doi.org/10.1016/j.dss.2017.03.001
Crotty, A., Galakatos, A., Zgraggen, E., et al.: The case for interactive data exploration accelerators (IDEAs). In: Proceedings of the Workshop on Human-in-the-Loop Data Analytics—HILDA’16, pp. 1–6. ACM Press, New York, New York, USA(2016)
Grainger, S., Mao, F., Buytaert, W.: Environmental data visualisation for non-scientific contexts: literature review and design framework. Environ. Model Softw. 85, 299–318 (2016)
Howe, B., Franklin, M., Haas, L., et al.: Data science education: we’re missing the boat, again. In: 2017 IEEE 33RD International Conference on Data Engineering, pp. 1473–1474. IEEE, New York, USA (2017)
Dichev, C., Dicheva, D., Salem, W., et al.: Towards data science literacy. Procedia Comput. Sci. 108, 2151–2160 (2017). https://doi.org/10.1016/j.procs.2017.05.240
Newman, R., Chang, V., Walters, R.J., Wills, G.B.: Model and experimental development for business data science. Int. J. Inf. Manage. 36, 607–617 (2016). https://doi.org/10.1016/j.ijinfomgt.2016.04.004
Das, M., Cui, R., Campbell, D. R., et al.: Towards methods for systematic research on big data. In 2015 IEEE International Conference on Big Data, pp. 2072–2081. IEEE, New York, USA (2015)
Hoffman, S., Podgurski, A.: Big bad data: law, public health, and biomedical databases. J. Law Med. Ethics 41, 56–60 (2013). https://doi.org/10.1111/jlme.12040
Brunswicker, S., Bertino, E., Matei, S.: Big data for open digital innovation—a research roadmap. Big Data Res. 2, 53–58 (2015). https://doi.org/10.1016/j.bdr.2015.01.008
Saltz, J., Shamshurin, I., Connors, C.: Predicting data science sociotechnical execution challenges by categorizing data science projects. J. Assoc. Inf. Sci. Technol. 68, 2720–2728 (2017). https://doi.org/10.1002/asi.23873
Carbone, A., Jensen, M., Sato, A.-H.: Challenges in data science: a complex systems perspective. Chaos Solitons Fractals 90, 1–7 (2016). https://doi.org/10.1016/j.chaos.2016.04.020
Kazakci, A.O.: Data science as a new frontier for design. In: Weber, C., Husung, S., Cantamessa, M., Cascini, G., Marjanovic, D., Venkataraman, S. (eds.) Design Information and Knowledge Management, ICED 15, vol. 10. DESIGN SOC, Glasgow, England (2015)
Larson, D., Chang, V.: A review and future direction of agile, business intelligence, analytics and data science. Int. J. Inf. Manage. 36, 700–710 (2016). https://doi.org/10.1016/j.ijinfomgt.2016.04.013
Demchenko, Y., Belloum, A., Los, W., et al.: EDISON data science framework: a foundation for building data science profession for research and industry. In: 2016 8th IEEE International Conference on Cloud Computing Technology and Science (CLOUDCOM 2016), pp. 620–626. IEEE, New York, USA (2016)
Anya, O., Moore, B., Kieliszewski, C., et al.: Understanding the practice of discovery in enterprise big data science: an agent-based approach. Procedia Manuf. 3, 882–889 (2015). https://doi.org/10.1016/j.promfg.2015.07.345
Cao, L.: Data science: a comprehensive overview. ACM Comput. Surv. 50, 1–42 (2017). https://doi.org/10.1145/3076253
Chuprina, S., Alexandrov, V., Alexandrov, N.: Using ontology engineering methods to improve computer science and data science skills. Procedia Comput. Sci. 80:1780–1790 (2016)
Acknowledgements
The research work was partially funded by the European Commission, under the PORTUGAL 2020 structural fund (CENTRO2020), for the period of 2014–2020 (Project Ref. 2016/017728).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Almeida, A.S., Roque, L., da Cunha, P. (2019). How to Design an Interactive System for Data Science: Learning from a Literature Review. In: Andersson, B., Johansson, B., Barry, C., Lang, M., Linger, H., Schneider, C. (eds) Advances in Information Systems Development. Lecture Notes in Information Systems and Organisation, vol 34. Springer, Cham. https://doi.org/10.1007/978-3-030-22993-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-22993-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22992-4
Online ISBN: 978-3-030-22993-1
eBook Packages: Computer ScienceComputer Science (R0)