skip to main content
10.1145/2839509.2850516acmconferencesArticle/Chapter ViewAbstractPublication PagessigcseConference Proceedingsconference-collections
abstract

Enhancing Teaching of Big Data by Using Real World Datasets

Published:17 February 2016Publication History

ABSTRACT

This lightning talk will focus on our experience of teaching a graduate level Big Data course. Traditionally, such courses have relied on "WordCount" style problems, which involve computing the simple count of words in a corpus of documents using the distributed MapReduce framework. While this is certainly a good way of introducing the students to the BigData framework, more real world examples are needed to motivate students. Further, since a majority of courses require students to work on a large project as part of this course, it is essential that they have access to a diverse and interesting set of data. In our course, we experimented with various data sources, such as text from real-time, streaming news articles, twitter feeds, and property price data from various zip codes in a county. The students were involved in gathering the data, designing and implementing MapReduce style algorithms for distributed processing, and presenting their findings. The feedback was extremely positive and we would like to develop this approach further. In this talk, we will present some ideas on how to collect and analyze real world datasets that are suitable for Big Data analysis. We would also encourage further inputs from the audience about this topic.

Index Terms

  1. Enhancing Teaching of Big Data by Using Real World Datasets

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            SIGCSE '16: Proceedings of the 47th ACM Technical Symposium on Computing Science Education
            February 2016
            768 pages
            ISBN:9781450336857
            DOI:10.1145/2839509

            Copyright © 2016 Owner/Author

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 17 February 2016

            Check for updates

            Qualifiers

            • abstract

            Acceptance Rates

            SIGCSE '16 Paper Acceptance Rate105of297submissions,35%Overall Acceptance Rate1,595of4,542submissions,35%

            Upcoming Conference

            SIGCSE Virtual 2024