Skip to main content

Abstract

Early prediction of student grades is important for identifying students at-risk of failing or students not achieving their goals, and providing timely interventions. There is a need to develop methods that are both accurate and interpretable, to help teachers understand student behaviour and take appropriate actions. In this paper, we present a decision tree based approach for predicting students’ final performance in online computer programming courses for high school students, based on log data from the first part of the course. We define and extract suitable features characterising student behaviour and show that it is possible to build compact and accurate decision trees, achieving an overall accuracy of 76% and 82–83% at one-third and mid-course respectively. We provide insights about the important factors affecting student performance and how the rules can be used to improve the student learning outcomes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://grokacademy.org/challenge.

References

  1. Asif, R., Merceron, A., Ali, S.A., Haider, N.G.: Analyzing undergraduate students’ performance using educational data mining. Comput. Educ. 113, 177–194 (2017)

    Article  Google Scholar 

  2. Dsilva, V., Schleiss, J., Stober, S.: Trustworthy academic risk prediction with explainable boosting machines. In: Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O.C., Dimitrova, V. (eds.) AIED 2023. LNCS, vol. 13916, pp. 463–475. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-36272-9_38

  3. Koprinska, I., Stretton, J., Yacef, K.: Predicting student performance from multiple data sources. In: Conati, C., Heffernan, N., Mitrovic, A., Verdejo, M. (eds.) AIED 2015. LNCS, vol. 9112, pp. 678–681. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19773-9_90

  4. Koprinska, I., Stretton, J., Yacef, K.: Students at risk: detection and remediation. In: Educational Data Mining, pp. 512–515 (2015)

    Google Scholar 

  5. McBroom, J., Paassen, B., Jeffries, B., Koprinska, I., Yacef, K.: Progress networks as a tool for analysing student programming difficulties. In: Australasian Computing Education Conference, pp. 158–167 (2021)

    Google Scholar 

  6. Pereira, F.D., et al.: Early dropout prediction for programming courses supported by online judges. In: Isotani, S., Millán, E., Ogan, A., Hastings, P., McLaren, B., Luckin, R. (eds.) AIED 2019. LNCS (LNAI), vol. 11626, pp. 67–72. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-23207-8_13

    Chapter  Google Scholar 

  7. Romero, C., Espejo, P.G., Zafra, A., Romero, J.R., Ventura, S.: Web usage mining for predicting final marks of students that use Moodle courses. Comput. Appl. Eng. Educ. 21(1), 135–146 (2013)

    Article  Google Scholar 

  8. Zhang, V., Jefffries, B., Koprinska, I.: Predicting progress in a large-scale online programming course. In: Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O.C., Dimitrova, V. (eds.) AIED 2023. LNCS, vol. 13916, pp. 810–816. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-36272-9_76

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Ziwei Wang , Irena Koprinska or Bryn Jeffries .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, Z., Koprinska, I., Jeffries, B. (2024). Interpretable Methods for Early Prediction of Student Performance in Programming Courses. In: Olney, A.M., Chounta, IA., Liu, Z., Santos, O.C., Bittencourt, I.I. (eds) Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky. AIED 2024. Communications in Computer and Information Science, vol 2151. Springer, Cham. https://doi.org/10.1007/978-3-031-64312-5_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-64312-5_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-64311-8

  • Online ISBN: 978-3-031-64312-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics