Abstract
The 17th Conference on Database Systems for Business, Technology, and Web (BTW2017) of the German Informatics Society (GI) took place in March 2017 at the University of Stuttgart in Germany. A Data Science Challenge was organized for the first time at a BTW conference by the University of Stuttgart and Sponsor IBM. We challenged the participants to solve a data analysis task within one month and present their results at the BTW. In this article, we give an overview of the organizational process surrounding the Challenge, and introduce the task that the participants had to solve. In the subsequent sections, the final four competitor groups describe their approaches and results.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig1_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig2_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig3_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig4_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig5_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig6_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig7_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig8_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig9_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig10_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig11_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig12_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig13_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig14_HTML.gif)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs13222-017-0263-8/MediaObjects/13222_2017_263_Fig15_HTML.gif)
Similar content being viewed by others
Notes
Especially because, in Manhattan alone, about 2.8 million taxi trips take place each week [8]
References
Australian Bureau of Statistics (2017) Time series analysis: the basics
Chaudhuri S, Dayal U (1997) An overview of data warehousing and olap technology. ACM Sigmod Rec 26(1):65–74
Cortes C, Vapnik V (1995) Support-Vector Networks. Mach Learn 20(3):273–297
Ho T (1995) Random decision forests. In: Proceedings of the 3rd International Conference on Document Analysis and Recognition, pp 278–282
Keim D, Andrienko G, Fekete JD, Görg C, Kohlhammer J, Melançon G (2008) Visual analytics: definition, process, and challenges. In: Information visualization. Springer, Berlin Heidelberg, pp 154–175
Kisilevich S, Mansmann F, Nanni M, Rinzivillo S (2010) Spatio-temporal clustering. Springer, Boston, pp 855–874
Maciejewski R, Rudolph S, Hafen R, Abusalah A, Yakout M, Ouzzani M, Cleveland WS, Grannis S, Ebert DS (2010) A visual analytics approach to understanding spatiotemporal hotspots. IEEE Trans Vis Comput Graph 16(2):205–220
NYC Taxi & Limousine Commision (2017) Tlc trip record data
Slocum TA, McMaster RB, Kessler FC, Howard HH (2005) Thematic cartography and geographic visualization. geographic information science. Pearson, Prentice Hall
Thomas J, Kielman J (2009) Challenges for visual analytics. Inf Vis 8(4):309–314
Van Brummelen G (2013) Heavenly mathematics: the forgotten art of spherical trigonometry. Princeton University Press, Princeton
World Health Organization (2017) Top 10 causes of death
Acknowledgements
The organizers of the Data Science Challenge thank the participants and the jury for the time they invested. Furthermore, we would like to thank INFOS for sponsoring the price money of the event.
This work was partially funded by the IST-Hochschule University of Applied Sciences and by the PhD programOnline Participation, supported by the North Rhine-Westphalian funding scheme Fortschrittskollegs.
This work was partly funded by the German Federal Ministry of Education and Research within the project Competence Center for Scalable Data Services and Solutions (ScaDS) Dresden/Leipzig (BMBF 01IS14014B) and Explicit Privacy-Preserving Host Intrusion Detection System EXPLOIDS (BMBF 16KIS0522K).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hirmer, P., Waizenegger, T., Falazi, G. et al. The First Data Science Challenge at BTW 2017. Datenbank Spektrum 17, 207–222 (2017). https://doi.org/10.1007/s13222-017-0263-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13222-017-0263-8