skip to main content
research-article
Public Access

Computing with CORGIS: Diverse, Real-world Datasets for Introductory Computing

Published: 08 March 2017 Publication History

Abstract

To successfully bring introductory computing to non-CS majors, one needs to create a curriculum that will appeal to students from diverse disciplines. Several educational theories emphasize the need for introductory contexts that align with students' long-term goals and are perceived as useful. Data Science, using algorithms to manipulate real-world data and interpreting the results, has emerged as a field with cross-disciplinary value, and has strong potential as an appealing context for introductory computing courses. However, it is not easy to find, clean, and integrate datasets that will satisfy a broad variety of learners. The CORGIS project (https://think.cs.vt.edu/corgis) enables instructors to easily incorporate data science into their classroom. Specifically, it provides over 40 datasets in areas including history, politics, medicine, and education. Additionally, the CORGIS infrastructure supports the integration of new datasets with simple libraries for Java, Python, and Racket, thus empowering introductory students to write programs that manipulate real data. Finally, the CORGIS web-based tools allow learners to visualize and explore datasets without programming, enabling data science lessons on day one. We have incorporated CORGIS assignments into an introductory course for non-majors to study their impact on learners' motivation, with positive initial results. These results indicate that external adopters are likely to find the CORGIS tools and materials useful in their own pedagogical pursuits.

References

[1]
R. E. Anderson, M. D. Ernst, R. Ordónez, P. Pham, and S. A. Wolfman. Introductory programming meets the real world: Using real problems and data in CS1. In Proceedings of the 45th ACM Technical Symposium on Computer Science Education, SIGCSE '14, pages 465--466, 2014.
[2]
A. C. Bart, E. Tilevich, S. Hall, T. Allevato, and C. A. Shaffer. Transforming introductory computer science projects via real-time web data. In Proceedings of the 45th ACM Technical Symposium on Computer Science Education, SIGCSE '14, pages 289--294, 2014.
[3]
A. C. Bart, E. Tilevich, C. A. Shaffer, and D. Kafura. Position paper: From interest to usefulness with BlockPy, a block-based, educational environment. In Blocks and Beyond Workshop (Blocks and Beyond), 2015 IEEE, pages 87--89, 2015.
[4]
D. Burlinson, M. Mehedint, C. Grafer, K. Subramanian, J. Payton, P. Goolkasian, M. Youngblood, and R. Kosara. BRIDGES: A system to enable creation of engaging data structures assignments with real-world data and visualizations. In Proceedings of the 47th ACM Technical Symposium on Computing Science Education, pages 18--23. ACM, 2016.
[5]
J.-I. Choi and M. Hannafin. Situated cognition and learning environments: Roles, structures, and implications for design. Educational Technology Research and Development, 43(2):53--69, 1995.
[6]
P. DePasquale. Exploiting on-line data sources as the basis of programming projects. In Proceedings of the 37th SIGCSE Technical Symposium on Computer Science Education, SIGCSE '06, pages 283--287, 2006.
[7]
A. E. Egger. Engaging students in earthquakes via real-time data and decisions. Science, 336(6089):1654--1655, 2012.
[8]
M. Guzdial. Exploring hypotheses about media computation. In Proceedings of the ninth annual international ACM conference on International computing education research, pages 19--26. ACM, 2013.
[9]
M. Guzdial and A. E. Tew. Imagineering inauthentic legitimate peripheral participation: an instructional design approach for motivating computing education. In Proceedings of the second international workshop on Computing education research, pages 51--58. ACM, 2006.
[10]
O. A. Hall-Holt and K. R. Sanft. Statistics-infused introduction to computer science. In Proceedings of the 46th ACM Technical Symposium on Computer Science Education, SIGCSE '15, pages 138--143, 2015.
[11]
N. A. Hamid. A generic framework for engaging online data sources in introductory programming courses. In Proceedings of the 2016 ACM Conference on Innovation and Technology in Computer Science Education, 2016.
[12]
B. D. Jones. Motivating students to engage in learning: The MUSIC model of academic motivation. International Journal of Teaching and Learning in Higher Education, 21(2):272--285, 2009.
[13]
J. Lave and E. Wenger. Situated learning: Legitimate peripheral participation. Cambridge university press, 1991.
[14]
J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, and A. H. Byers. Big data: The next frontier for innovation, competition, and productivity. 2011.
[15]
N. Parlante. Codingbat. Com (Retrieved 1/08/2011 from http://codingbat. com, 2011), 2015.
[16]
D. G. Sullivan. A data-centric introduction to computer science for non-majors. In Proceeding of the 44th ACM Technical Symposium on Computer Science Education, SIGCSE '13, pages 71--76, 2013.
[17]
L. Torgo. Data Mining with R, learning with case studies. Chapman and Hall/CRC, 2010.

Cited By

View all
  • (2023)AI and parallelism in CS1Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i13.26876(15798-15806)Online publication date: 7-Feb-2023
  • (2023)Constructionist approaches to critical data literacy: A reviewProceedings of the 22nd Annual ACM Interaction Design and Children Conference10.1145/3585088.3589367(112-123)Online publication date: 19-Jun-2023
  • (2023)Assessing the Impact of Specifications Grading on a Data Visualization Course2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10342899(1-6)Online publication date: 18-Oct-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Inroads
ACM Inroads  Volume 8, Issue 2
June 2017
73 pages
ISSN:2153-2184
EISSN:2153-2192
DOI:10.1145/3095781
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 March 2017
Published in INROADS Volume 8, Issue 2

Permissions

Request permissions for this article.

Check for updates

Badges

  • Best Paper

Author Tags

  1. CORGIS
  2. authenticity
  3. big data
  4. computational thinking
  5. data science
  6. motivation
  7. pedagogy
  8. real-world data

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)187
  • Downloads (Last 6 weeks)32
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)AI and parallelism in CS1Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v37i13.26876(15798-15806)Online publication date: 7-Feb-2023
  • (2023)Constructionist approaches to critical data literacy: A reviewProceedings of the 22nd Annual ACM Interaction Design and Children Conference10.1145/3585088.3589367(112-123)Online publication date: 19-Jun-2023
  • (2023)Assessing the Impact of Specifications Grading on a Data Visualization Course2023 IEEE Frontiers in Education Conference (FIE)10.1109/FIE58773.2023.10342899(1-6)Online publication date: 18-Oct-2023
  • (2021)Visual recipes for slicing and dicing data: teaching data wrangling using subgoal graphicsProceedings of the 21st Koli Calling International Conference on Computing Education Research10.1145/3488042.3488063(1-10)Online publication date: 17-Nov-2021
  • (2021)One Size Doesn't Fit All: Diversifying Data Science Course Projects by Student Background and InterestsProceedings of the 26th ACM Conference on Innovation and Technology in Computer Science Education V. 110.1145/3430665.3456375(67-73)Online publication date: 26-Jun-2021
  • (2021)Mapping Materials to Curriculum Standards for Design, Alignment, Audit, and SearchProceedings of the 52nd ACM Technical Symposium on Computer Science Education10.1145/3408877.3432388(295-301)Online publication date: 3-Mar-2021
  • (2021)CS-MaterialsJournal of Parallel and Distributed Computing10.1016/j.jpdc.2021.05.014157:C(316-330)Online publication date: 1-Nov-2021
  • (2020)Learning Machine Learning with Personal Data Helps Stakeholders Ground Advocacy Arguments in Model MechanicsProceedings of the 2020 ACM Conference on International Computing Education Research10.1145/3372782.3406252(67-78)Online publication date: 10-Aug-2020
  • (2019)Classifying Pedagogical Material to Improve Adoption of Parallel and Distributed Computing Topics2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)10.1109/IPDPSW.2019.00060(312-319)Online publication date: May-2019
  • (2018)Visualization, Assessment and Analytics in Data Structures Learning ModulesProceedings of the 49th ACM Technical Symposium on Computer Science Education10.1145/3159450.3159460(864-869)Online publication date: 21-Feb-2018

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Magazine Site

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media