skip to main content
10.1145/3545945.3569764acmconferencesArticle/Chapter ViewAbstractPublication PagessigcseConference Proceedingsconference-collections
research-article

Ultra-Lightweight Early Prediction of At-Risk Students in CS1

Published:03 March 2023Publication History

ABSTRACT

Early prediction of students at risk of doing poorly in CS1 can enable early interventions or class adjustments. Preferably, prediction methods would be lightweight, not requiring much extra activity or data-collection work from instructors beyond what they already do. Previous methods included giving surveys, collecting (potentially sensitive) demographic data, introducing clicker questions into lectures, or using locally-developed systems that analyze programming behavior, each requiring some effort by instructors. Today, a widely used textbook / learning system in CS1 classes is zyBooks, used by several hundred thousand students annually. The system automatically collects data related to reading, homework, and programming assignments. For a 300+ student CS1 class, we found that three data metrics, auto-collected by that system in early weeks (1-4), were good at predicting performance on the week-6 midterm exam: non-earnest completion of the assigned readings, struggle on the coding homework, and low scores on the programming assignments, with correlation magnitudes of 0.44, 0.58, and 0.72, respectively. We combined those metrics in a decision tree model to predict students at-risk of failing the midterm exam (<70%, meaning D or F), and achieved 85% prediction accuracy with 82% sensitivity and 89% specificity, which is higher than previously-published early-prediction approaches. The approach may mean that thousands of instructors already using zyBooks (or a similar system) can get a more accurate early prediction of at-risk students, without requiring extra effort or activities, and avoiding collection of sensitive demographic data.

References

  1. Ahadi, A., Lister, R., Haapala, H., & Vihavainen, A. (2015, August). Exploring machine learning methods to automatically identify students in need of assistance. In Proceedings of the eleventh annual international conference on international computing education research (pp. 121--130).Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Alspaugh, C. A. (1972). Identification of some components of computer programming aptitude. Journal for Research in Mathematics Education, 89--98.Google ScholarGoogle ScholarCross RefCross Ref
  3. CodeHS, https://codehs.com/, accessed 2022.Google ScholarGoogle Scholar
  4. Campbell, J., Horton, D., & Craig, M. (2016, July). Factors for success in online CS1. In Proceedings of the 2016 acm conference on innovation and technology in computer science education (pp. 320--325).Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Codio, https://www.codio.com/, accessed 2022.Google ScholarGoogle Scholar
  6. Edgcomb, A. and Vahid, F., 2015, October. How many points should be awarded for interactive textbook reading assignments?. In 2015 IEEE Frontiers in Education Conference (FIE) (pp. 1--4). IEEE.Google ScholarGoogle Scholar
  7. Fire, M., Katz, G., Elovici, Y., Shapira, B., & Rokach, L. (2012). Predicting student exam's scores by analyzing social network data. In Active Media Technology (pp. 584--595). Springer Berlin Heidelberg. [7] Fowler, G. C., & Glorfeld, L. W. (1981). Predicting aptitude in introductoryGoogle ScholarGoogle ScholarDigital LibraryDigital Library
  8. Fowler, G. C., & Glorfeld, L. W. (1981). Predicting aptitude in introductory computing: A classification model. AEDS Journal, 14(2), 96--109.Google ScholarGoogle ScholarCross RefCross Ref
  9. Gordanier, J., Hauk, W., & Sankaran, C. (2019). Early intervention in college classes and improved student outcomes. Economics of Education Review, 72, 23--29.Google ScholarGoogle ScholarCross RefCross Ref
  10. Gordon, C., Lysecky, R. and Vahid, F., 2021. The rise of the zyLab program auto-grader in introductory CS courses. zyBooks.com, White Paper.Google ScholarGoogle Scholar
  11. Harrington, B. (2017, May). Get On Track, Stay On Track: Combining Early Intervention and Study Skills in a CS1 Seminar. In Proceedings of the 22nd Western Canadian Conference on Computing Education (pp. 1--1).Google ScholarGoogle Scholar
  12. Jiang, S., Warschauer, M., Williams, A. E., O?Dowd, D., & Schenke, K. (2014). Predicting MOOC Performance with Week 1 Behavior. In Proceedings of the 7th International Conference on Educational Data Mining.Google ScholarGoogle Scholar
  13. Kloft, M., Stiehler, F., Zheng, Z., & Pinkwart, N. (2014). Predicting MOOC Dropout over Weeks Using Machine Learning Methods. EMNLP 2014, 60Google ScholarGoogle ScholarCross RefCross Ref
  14. Krause-Levy, S., Porter, L., Simon, B., & Alvarado, C. (2020, February). Investigating the impact of employing multiple interventions in a cs1 course. In Proceedings of the 51st ACM Technical Symposium on Computer Science Education (pp. 1082--1088).Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Liao, S. N., Zingaro, D., Laurenzano, M. A., Griswold, W. G., & Porter, L. (2016, August). Lightweight, early identification of at-risk CS1 students. In Proceedings of the 2016 acm conference on international computing education research (pp. 123--131).Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Liao, S. N., Valstar, S., Thai, K., Alvarado, C., Zingaro, D., Griswold, W. G., & Porter, L. (2019, July). Behaviors of higher and lower performing students in CS1. In Proceedings of the 2019 ACM Conference on Innovation and Technology in Computer Science Education (pp. 196--202).Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Luxton-Reilly, A., Ajanovski, V. V., Fouh, E., Gonsalvez, C., Leinonen, J., Parkinson, J., Poole, M., and Thota, N. (2019). Pass rates in introductory programming and in other stem disciplines. In Proceedings of the Working Group Reports on Innovation and Technology in Computer Science Education (pp. 53--71).Google ScholarGoogle Scholar
  18. Navstem CS1 report, https://navstem.com/, 2021.Google ScholarGoogle Scholar
  19. Quille, K., & Bergin, S. (2019). CS1: how will they do? How can we help? A decade of research and practice. Computer Science Education, 29(2--3), 254--282.Google ScholarGoogle Scholar
  20. Quille, K., & Bergin, S. (2018, July). Programming: predicting student success early in CS1. a re-validation and replication study. In Proceedings of the 23rd annual ACM conference on innovation and technology in computer science education (pp. 15--20).Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Rountree, N., Rountree, J., & Robins, A. (2002). Predictors of success and failure in a CS1 course. ACM SIGCSE Bulletin, 34(4), 121--124.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Rountree, N., Rountree, J., Robins, A., & Hannah, R. (2004). Interacting factors that predict success and failure in a CS1 course. ACM SIGCSE Bulletin, 36(4), 101--104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Rosé, C. P., & Siemens, G. (2014). Shared task on prediction of dropout over time in massively open online courses. EMNLP 2014, 39.Google ScholarGoogle ScholarCross RefCross Ref
  24. Runestone Academy, https://runestone.academy, accessed 2022.Google ScholarGoogle Scholar
  25. Sharkey, M., & Sanders, R. (2014). A Process for Predicting MOOC Attrition.EMNLP 2014, 50.Google ScholarGoogle Scholar
  26. Ventura Jr, P. R. (2005). Identifying predictors of success for an objects-first CS1.Google ScholarGoogle ScholarCross RefCross Ref
  27. Ventura, P., & Ramamurthy, B. (2004). Wanted: CS1 students. No experience required. ACM SIGCSE Bulletin, 36(1), 240--244.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. A. Vihavainen. Predicting students' performance in an introductory programming course using data from students' own programming process. In IEEE 13th International Conference on Advanced Learning Technologies, pages 498--499, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Watson, C., Li, F. W., & Godwin, J. L. (2014, March). No tests required: comparing traditional and dynamic predictors of programming success. In Proceedings of the 45th ACM technical symposium on Computer science education (pp. 469--474).Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. West, M., Herman, G.L. and Zilles, C., 2015, June. Prairielearn: Mastery-based online problem solving with adaptive scoring and recommendations driven by machine learning. In 2015 ASEE Annual Conference & Exposition (pp. 26--1238). https://www.prairielearn.org/, accessed 2022.Google ScholarGoogle Scholar
  31. Wilson, B. C., & Shrock, S. (2001, February). Contributing to success in an introductory computer science course: a study of twelve factors. In ACM SIGCSE Bulletin (Vol. 33, No. 1, pp. 184--188). ACM.Google ScholarGoogle Scholar
  32. Zhang, Y., Fei, Q., Quddus, M., & Davis, C. (2014). An examination of the impact of early intervention on learning outcomes of at-risk students. Research in Higher Education Journal, 26.Google ScholarGoogle Scholar
  33. zyBooks.com, accessed 2022.Google ScholarGoogle Scholar

Index Terms

  1. Ultra-Lightweight Early Prediction of At-Risk Students in CS1

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGCSE 2023: Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 1
      March 2023
      1481 pages
      ISBN:9781450394314
      DOI:10.1145/3545945

      Copyright © 2023 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 3 March 2023

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,595of4,542submissions,35%

      Upcoming Conference

      SIGCSE Virtual 2024

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader