ABSTRACT
In this teaching tip and courseware note we describe a series of hands on activities and exercises that we've used to introduce the notion of big data analytics to a wide range of audience. These exercises range in complexity from a paper and pencil thought exercise, to using Google Trends for simple explorations, to using a spread sheet to simulate the iterative nature of Google's PageRank algorithm, to programming with a Python based map-reduce framework. These exercises have been used in courses to train high school teachers in data science, full semester university courses (undergraduate and graduate), and CS education outreach efforts. Feedback has been positive as to their efficacy.
- Thomas H. Davenport and Jeanne G. Harris. 2010. Analytics at Work: Smarter Decisions, Better Results. Harvard Business Review Press.Google Scholar
- Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report 1999-66. Stanford InfoLab. http://ilpubs.stanford.edu:8090/422/Google Scholar
- Seth Stephens-Davidowitz. 2017. Everybody Lies: Big Data, New Data, and What the Internet can tell us about who we really are. William Morrow. Google ScholarDigital Library
- Ben Willington. 2015. How Software in Half of NYC Cabs Generates $5.2 Million a Year in Extra Tips. (January 2015). Retrieved Jan 21, 2018 from http://iquantny.tumblr.com/post/107245431809/ how-software-in-half-of-nyc-cabs-generates-52 1 https://pythonhosted.org/mrjob Abstract 1 Data Driven Decision Making 2 Data Detective: How much to tip? 3 How does ``Big'' make a difference? 4 Need for Massive Parallelism 5 MapReduce and PageRank 6 Reflection ReferencesGoogle Scholar
Index Terms
- Introducing big data analytics in high school and college
Recommendations
Introducing big data topics: a multicourse experience report from Norway
TEEM '15: Proceedings of the 3rd International Conference on Technological Ecosystems for Enhancing MulticulturalityIn the last few years we have witnessed an explosion of interest in Big Data in both academic and industry arenas. Big Data is about the capture, storage, analysis and visualization of huge volumes of data in both structured and unstructured forms ...
Developing a game-based learning curriculum for "Big Data" in middle school (abstract only)
SIGCSE '14: Proceedings of the 45th ACM technical symposium on Computer science educationExposing students early to computer science may influence their choice of career, and there is increasing recognition that even for students who do not pursue computer science careers, computational literacy is important. This poster reports on a ...
Computer programming in high school vs. college
SIGCSE '96: Proceedings of the twenty-seventh SIGCSE technical symposium on Computer science educationAlthough there exists a computer science certification exam for high school teachers in some states, there appear to be no general requirements for teaching computer science at the high school level in most states. This may imply that CS material taught ...
Comments