Skip to main content

How to Design an Interactive System for Data Science: Learning from a Literature Review

  • Conference paper
  • First Online:
Advances in Information Systems Development

Abstract

As part of an ongoing design science research project, this paper presents a systematic literature review and the classification of 214 papers scoping the work on Data Science (DS) in the fields of Information Systems and Human-Computer Interaction. The overall search was conducted on Web of Science, Science Direct and ACM Digital Library, for papers about the design of IT artefacts for Data Science, over the period of 1997 until 2017. The work identifies promising research clusters in the crossroads of IS, HCI and Design, but few studies were found with concrete guidance on how to design a system for DS, when targeting for broader technical and business user profiles and multi-domain applications. In this paper, we propose a DS lifecycle process and a set of design principles to guide the design of such a system to support the whole creative DS lifecycle process.

A prior version of this paper has been published in the ISD2018 Proceedings (http://aisel.aisnet.org/isd2014/proceedings2018).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chen, H., Chiang, R.H.L., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS Q. 36, 1165–1188 (2012)

    Article  Google Scholar 

  2. Manyika, J., Chui, M., Bughin, J., et al.: Big Data: The Next Frontier for Innovation, Competition, and Productivity, pp. 1–22. McKinsey Company (2011)

    Google Scholar 

  3. McAfee, A., Brynjolfsson, E., Davenport, T.H., et al.: Big data: the management revolution. Harv. Bus. Rev. 90, 61–67 (2012). https://doi.org/00475394

    Google Scholar 

  4. Mockus, A.: Operational data are missing, incorrect, and decontextualized. In: Perspectives on Data Science for Software Engineering, pp. 317–322 (2016)

    Chapter  Google Scholar 

  5. Hevner, A.R., March, S.T., Park, J., Ram, S.: Design science in information. Syst. Res. 28, 75–105 (2004). https://doi.org/10.2307/25148869

    Article  Google Scholar 

  6. Peffers, K., Tuunanen, T., Rothenberger, M.A., Chatterjee, S.: A design science research methodology for information systems research. J. Manage. Inf. Syst. 24, 45–77 (2008). https://doi.org/10.2753/MIS0742-1222240302

    Article  Google Scholar 

  7. Nagle, T., Sammon, D.: The development of a design research canvas for data practitioners. J. Decis. Syst. 25, 369–380 (2016). https://doi.org/10.1080/12460125.2016.1187386

    Article  Google Scholar 

  8. Webster, J., Watson, R.T.: Analyzing the past to prepare for the future: writing a literature review. MIS Q. 26, xii–xxiii (2002). https://doi.org/1210112213

    Google Scholar 

  9. Levy, Y., Ellis, T.J.: A systems approach to conduct an effective literature review in support of information systems research. Inf. Sci. J. 9, 181–212 (2006). https://doi.org/10.1049/cp.2009.0961

    Article  Google Scholar 

  10. Tranfield, D., Denyer, D., Smart, P.: Towards a methodology for developing evidence-informed management knowledge by means of systematic review*. Br. J. Manage. 14, 207–222 (2003). https://doi.org/10.1111/1467-8551.00375

    Article  Google Scholar 

  11. Lee, I.: Big data: dimensions, evolution, impacts, and challenges. Bus. Horiz. 60, 293–303 (2017). https://doi.org/10.1016/j.bushor.2017.01.004

    Article  Google Scholar 

  12. Storey, V.C., Song, I.-Y.: Big data technologies and management: what conceptual modeling can do. Data Knowl. Eng. 108, 50–67 (2017). https://doi.org/10.1016/j.datak.2017.01.001

    Article  Google Scholar 

  13. Cheng, S., Liu, B., Shi, Y., et al.: Evolutionary computation and big data: key challenges and future directions. In: Tan, Y., Shi, Y. (eds.) Data Mining and Big Data 2016, pp. 3–14. Springer International Publishing, AG, GEWERBESTRASSE 11, CHAM, CH-6330, Switzerland (2016)

    Chapter  Google Scholar 

  14. Philip Chen, C.L., Zhang, C.-Y., Chen, C.L.P., Zhang, C.-Y.: Data-intensive applications, challenges, techniques and technologies: a survey on Big Data. Inf. Sci. (Ny) 275, 314–347 (2014). https://doi.org/10.1016/j.ins.2014.01.015

    Article  Google Scholar 

  15. Rabl, T., Sadoghi, M., Jacobsen, H.-A., et al.: Solving big data challenges for enterprise application performance management. Proc. VLDB Endow. 5, 1724–1735 (2012). https://doi.org/10.14778/2367502.2367512

    Article  Google Scholar 

  16. Miller, H.G., Mork, P.: From data to decisions: a value chain for big data. IT Prof. 15, 57–59 (2013). https://doi.org/10.1109/MITP.2013.11

    Article  Google Scholar 

  17. Demirkan, H., Delen, D.: Leveraging the capabilities of service-oriented decision support systems: putting analytics and big data in cloud. Decis. Support Syst. 55, 412–421 (2013). https://doi.org/10.1016/j.dss.2012.05.048

    Article  Google Scholar 

  18. Kowalczyk, M., Buxmann, P.: An ambidextrous perspective on business intelligence and analytics support in decision processes: insights from a multiple case study. Decis. Support Syst. 80, 1–13 (2015). https://doi.org/10.1016/j.dss.2015.08.010

    Article  Google Scholar 

  19. Hashem, I.A.T., Yaqoob, I., Anuar, N.B., et al.: The rise of “big data” on cloud computing: review and open research issues. Inf. Syst. 47, 98–115 (2015). https://doi.org/10.1016/j.is.2014.07.006

    Article  Google Scholar 

  20. Haug, F.S.: Bad big data science. In: Joshi, J., Karypis, G., Liu, L., Hu, X., Ak, R., Xia, Y., Xu, W., Sato, A.H., Rachuri, S., Ungar, L., Yu, P.S., Govindaraju, R., Suzumura, T. (eds.) 2016 IEEE International Conference on Big Data (BIG DATA), pp. 2863–2871. IEEE, New York, USA (2016)

    Google Scholar 

  21. Hazen, B.T., Boone, C.A., Ezell, J.D., Jones-Farmer, L.A.: Data quality for data science, predictive analytics, and big data in supply chain management: an introduction to the problem and suggestions for research and applications. Int. J. Prod. Econ. 154, 72–80 (2014). https://doi.org/10.1016/j.ijpe.2014.04.018

    Article  Google Scholar 

  22. Vedula, S.S., Hager, G.D.: Surgical data science: the new knowledge domain. Innov. Surg. Sci. 2, 109+ (2017). https://doi.org/10.1515/iss-2017-0004

    Article  Google Scholar 

  23. Westra, B.L., Sylvia, M., Weinfurter, E.F., et al.: Big data science: a literature review of nursing research exemplars. Nurs. Outlook 65, 549–561 (2017). https://doi.org/10.1016/j.outlook.2016.11.021

    Article  Google Scholar 

  24. Roy, S., Ray, R., Roy, A., et al.: IoT, big data science & analytics, cloud computing and mobile app based hybrid system for smart agriculture. In: Chakrabarti, S., Saha, H. (eds.) 8th Annual Industrial Automation and Electromechanical Engineering Conference (IEMECON), pp. 303–304. IEEE, New York, USA (2017)

    Google Scholar 

  25. Woodard, J.: Big data and Ag-analytics: an open source, open data platform for agricultural & environmental finance, insurance, and risk. Agric. Financ. Rev. 76, 15–26 (2016). https://doi.org/10.1108/AFR-03-2016-0018

    Article  Google Scholar 

  26. Liu, M.-C., Huang, Y.-M.: The use of data science for education: the case of social-emotional learning. Smart Learn. Environ. 4, 1 (2017). https://doi.org/10.1186/s40561-016-0040-4

    Article  Google Scholar 

  27. Conte, R., Giardini, F.: Towards computational and behavioral social science. Eur. Psychol. 21, 131–140 (2016). https://doi.org/10.1027/1016-9040/a000257

    Article  Google Scholar 

  28. Chang, R.M., Kauffman, R.J., Kwon, Y.: Understanding the paradigm shift to computational social science in the presence of big data. Decis. Support Syst. 63, 67–80 (2014). https://doi.org/10.1016/j.dss.2013.08.008

    Article  Google Scholar 

  29. Bibri, S.E., Krogstie, J.: Smart sustainable cities of the future: an extensive interdisciplinary literature review. Sustain. Cities Soc. 31, 183–212 (2017). https://doi.org/10.1016/j.scs.2017.02.016

    Article  Google Scholar 

  30. Fischer, F., Fuchs, J., Mansmann, F., Keim, D.A.: BANKSAFE: visual analytics for big data in large-scale computer networks. Inf. Vis. 14, 51–61 (2015). https://doi.org/10.1177/1473871613488572

    Article  Google Scholar 

  31. Talbot, J., Lee, B., Kapoor, A., Tan, D.S.: EnsembleMatrix: interactive visualization to support machine learning with multiple classifiers. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1283–1292 (2009)

    Google Scholar 

  32. Bandi, A., Fellah, A.: Crafting a data visualization course for the tech industry. J. Comput. Sci. Coll. 33, 46–56 (2017)

    Google Scholar 

  33. Snyder, J.: Vernacular visualization practices in a citizen science project. In: Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pp. 2097–2111. ACM, New York, NY, USA (2017)

    Google Scholar 

  34. Bumblauskas, D., Nold, H., Bumblauskas, P., Igou, A.: Big data analytics: transforming data to action. Bus. Process. Manage. J. 23, 703–720 (2017). https://doi.org/10.1108/BPMJ-03-2016-0056

    Article  Google Scholar 

  35. Horita, F.E.A.A., de Albuquerque, J.P., Marchezini, V., Mendiondo, E.M.: Bridging the gap between decision-making and emerging big data sources: an application of a model-based framework to disaster management in Brazil. Decis. Support Syst. 97, 12–22 (2017). https://doi.org/10.1016/j.dss.2017.03.001

    Article  Google Scholar 

  36. Crotty, A., Galakatos, A., Zgraggen, E., et al.: The case for interactive data exploration accelerators (IDEAs). In: Proceedings of the Workshop on Human-in-the-Loop Data Analytics—HILDA’16, pp. 1–6. ACM Press, New York, New York, USA(2016)

    Google Scholar 

  37. Grainger, S., Mao, F., Buytaert, W.: Environmental data visualisation for non-scientific contexts: literature review and design framework. Environ. Model Softw. 85, 299–318 (2016)

    Article  Google Scholar 

  38. Howe, B., Franklin, M., Haas, L., et al.: Data science education: we’re missing the boat, again. In: 2017 IEEE 33RD International Conference on Data Engineering, pp. 1473–1474. IEEE, New York, USA (2017)

    Google Scholar 

  39. Dichev, C., Dicheva, D., Salem, W., et al.: Towards data science literacy. Procedia Comput. Sci. 108, 2151–2160 (2017). https://doi.org/10.1016/j.procs.2017.05.240

    Article  Google Scholar 

  40. Newman, R., Chang, V., Walters, R.J., Wills, G.B.: Model and experimental development for business data science. Int. J. Inf. Manage. 36, 607–617 (2016). https://doi.org/10.1016/j.ijinfomgt.2016.04.004

    Article  Google Scholar 

  41. Das, M., Cui, R., Campbell, D. R., et al.: Towards methods for systematic research on big data. In 2015 IEEE International Conference on Big Data, pp. 2072–2081. IEEE, New York, USA (2015)

    Google Scholar 

  42. Hoffman, S., Podgurski, A.: Big bad data: law, public health, and biomedical databases. J. Law Med. Ethics 41, 56–60 (2013). https://doi.org/10.1111/jlme.12040

    Article  Google Scholar 

  43. Brunswicker, S., Bertino, E., Matei, S.: Big data for open digital innovation—a research roadmap. Big Data Res. 2, 53–58 (2015). https://doi.org/10.1016/j.bdr.2015.01.008

    Article  Google Scholar 

  44. Saltz, J., Shamshurin, I., Connors, C.: Predicting data science sociotechnical execution challenges by categorizing data science projects. J. Assoc. Inf. Sci. Technol. 68, 2720–2728 (2017). https://doi.org/10.1002/asi.23873

    Article  Google Scholar 

  45. Carbone, A., Jensen, M., Sato, A.-H.: Challenges in data science: a complex systems perspective. Chaos Solitons Fractals 90, 1–7 (2016). https://doi.org/10.1016/j.chaos.2016.04.020

    Article  MathSciNet  MATH  Google Scholar 

  46. Kazakci, A.O.: Data science as a new frontier for design. In: Weber, C., Husung, S., Cantamessa, M., Cascini, G., Marjanovic, D., Venkataraman, S. (eds.) Design Information and Knowledge Management, ICED 15, vol. 10. DESIGN SOC, Glasgow, England (2015)

    Google Scholar 

  47. Larson, D., Chang, V.: A review and future direction of agile, business intelligence, analytics and data science. Int. J. Inf. Manage. 36, 700–710 (2016). https://doi.org/10.1016/j.ijinfomgt.2016.04.013

    Article  Google Scholar 

  48. Demchenko, Y., Belloum, A., Los, W., et al.: EDISON data science framework: a foundation for building data science profession for research and industry. In: 2016 8th IEEE International Conference on Cloud Computing Technology and Science (CLOUDCOM 2016), pp. 620–626. IEEE, New York, USA (2016)

    Google Scholar 

  49. Anya, O., Moore, B., Kieliszewski, C., et al.: Understanding the practice of discovery in enterprise big data science: an agent-based approach. Procedia Manuf. 3, 882–889 (2015). https://doi.org/10.1016/j.promfg.2015.07.345

    Article  Google Scholar 

  50. Cao, L.: Data science: a comprehensive overview. ACM Comput. Surv. 50, 1–42 (2017). https://doi.org/10.1145/3076253

    Article  Google Scholar 

  51. Chuprina, S., Alexandrov, V., Alexandrov, N.: Using ontology engineering methods to improve computer science and data science skills. Procedia Comput. Sci. 80:1780–1790 (2016)

    Article  Google Scholar 

Download references

Acknowledgements

The research work was partially funded by the European Commission, under the PORTUGAL 2020 structural fund (CENTRO2020), for the period of 2014–2020 (Project Ref. 2016/017728).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ana Sofia Almeida .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Almeida, A.S., Roque, L., da Cunha, P. (2019). How to Design an Interactive System for Data Science: Learning from a Literature Review. In: Andersson, B., Johansson, B., Barry, C., Lang, M., Linger, H., Schneider, C. (eds) Advances in Information Systems Development. Lecture Notes in Information Systems and Organisation, vol 34. Springer, Cham. https://doi.org/10.1007/978-3-030-22993-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-22993-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-22992-4

  • Online ISBN: 978-3-030-22993-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics