skip to main content
10.1145/3159450.3159550acmconferencesArticle/Chapter ViewAbstractPublication PagessigcseConference Proceedingsconference-collections
research-article

A Functional Approach to Data Science in CS1

Published: 21 February 2018 Publication History

Abstract

As part of the development of a new interdisciplinary initiative in data science that draws from statistics, mathematics, computer science, and the social sciences, we have developed a new introductory CS course that emphasizes data science and that we refer to as DataCSCi. Unlike other introductory data science courses, such as Berkeley's Data 8, our course retains the broad array of concepts necessary not only to introduce programming principles related to data science, but also to prepare students for the second course in our standard introductory computer science sequence. In particular, the course includes coverage of recursion (numeric and structural), unit testing, linked data structures, and other concepts we rely upon in subsequent courses in computer science. At the same time, we introduce students to a wide variety of techniques and approaches that support them in their subsequent work in data science, including techniques for wrangling, cleaning, and visualizing data. We achieve this combination of breadth and depth through two core approaches: We focus on a spiral "use then implement" approach and we focus on a functional model of programming using Scheme/Racket. While Python and R are the most commonly used languages for data science, we find that Scheme works particularly well to introduce students to concepts both complex, like map-reduce, and simple, like list filtering. In this paper, we report on the design of the curriculum, particularly the capstone project and the ways in which we incorporate the burgeoning subfield of data science for social good.

References

[1]
John Backus. 1978. Can Programming Be Liberated from the Von Neumann Style?: A Functional Style and Its Algebra of Programs. Commun. ACM, Vol. 21, 8 (Aug. 1978), 613--641. 1145/2493394.2493397
[2]
Laurie Williams. 2007. Lessons Learned from Seven Years of Pair Programming at North Carolina State University. SIGCSE Bull., Vol. 39, 4 (Dec. 2007), 79--83. 0097--8418

Cited By

View all
  • (2024)Engaging CS1 Students with Audio Themed AssignmentsJournal of Computing Sciences in Colleges10.5555/3665609.366562039:8(158-172)Online publication date: 17-May-2024
  • (2024)Investigating Student Mistakes in Introductory Data Science ProgrammingProceedings of the 55th ACM Technical Symposium on Computer Science Education V. 110.1145/3626252.3630884(1258-1264)Online publication date: 7-Mar-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGCSE '18: Proceedings of the 49th ACM Technical Symposium on Computer Science Education
February 2018
1174 pages
ISBN:9781450351034
DOI:10.1145/3159450
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 February 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CS1
  2. cross-disciplinary courses
  3. data science
  4. functional programming
  5. map-reduce
  6. racket
  7. scheme

Qualifiers

  • Research-article

Funding Sources

  • Roy J. Carver Charitable Trust

Conference

SIGCSE '18
Sponsor:

Acceptance Rates

SIGCSE '18 Paper Acceptance Rate 161 of 459 submissions, 35%;
Overall Acceptance Rate 1,787 of 5,146 submissions, 35%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)3
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Engaging CS1 Students with Audio Themed AssignmentsJournal of Computing Sciences in Colleges10.5555/3665609.366562039:8(158-172)Online publication date: 17-May-2024
  • (2024)Investigating Student Mistakes in Introductory Data Science ProgrammingProceedings of the 55th ACM Technical Symposium on Computer Science Education V. 110.1145/3626252.3630884(1258-1264)Online publication date: 7-Mar-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media