Skip to main content

Advertisement

Unleashing the Power of Predictive Analytics to Identify At-Risk Students in Computer Science

  • Original research
  • Published:
Technology, Knowledge and Learning Aims and scope Submit manuscript

Abstract

Predicting academic performance for students majoring in computer science has long been a significant field of research in computing education. Previous studies described that accurate prediction of students’ early-stage performance could identify low-performing students and take corrective action to improve performance. Besides, adopting machine learning algorithms with predictive analytics has proven possible and meaningful. The traditional approach of looking after students without uncovering the root causes of poor performance has shifted dramatically into improving the quality of the educational processes of students, teachers, and stakeholders. Thus, this study employed predictive analytics to develop an early warning prediction model using computing science degree performance data at a public institution. Predictive models based on our data analysis revealed that low, medium, and high-performing students could be predicted with an accuracy of 88% using only the grades of the courses they took in the second year. Moreover, 96% accuracy was achieved when all course grades were used in predictive models. The courses that are important in determining the overall performance of the students were also analyzed. By employing a multi-method approach, utilizing a large dataset spanning four academic years, and including a diverse sample of 430 students, our study offers a robust foundation to researchers, designers, and computer science educators for understanding and predicting student performance. The enhanced generalizability and implications for educational practice position our study as a valuable contribution to the field, paving the way for further advancements in predictive analytics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Akçapınar, G., Altun, A., & Aşkar, P. (2019a). Using learning analytics to develop early-warning system for at-risk students. International Journal of Educational Technology in Higher Education, 16(1), 1–20.

    Article  Google Scholar 

  • Akçapınar, G., Hasnine, M. N., Majumdar, R., Flanagan, B., & Ogata, H. (2019b). Developing an early-warning system for spotting at-risk students by using eBook interaction logs. Smart Learning Environments, 6(1), 4.

    Article  Google Scholar 

  • Aldowah, H., Al-Samarraie, H., & Fauzy, W. M. (2019). Educational data mining and learning analytics for 21st century higher education: A review and synthesis. Telematics and Informatics, 37, 13–49.

    Article  Google Scholar 

  • Arnold, K. E., & Pistilli, M. D. (2012). April. Course signals at Purdue: Using learning analytics to increase student success. In Proceedings of the 2nd international conference on learning analytics and knowledge (pp. 267–270).

  • Ayán, M. N. R., & García, M. T. C. (2008). Prediction of university students’ academic achievement by linear and logistic models. The Spanish journal of psychology, 11(1), 275–288.

    Article  Google Scholar 

  • Bañeres, D., Rodríguez, M. E., Guerrero-Roldán, A. E., & Karadeniz, A. (2020). An early warning system to detect at-risk students in online higher education. Applied Sciences, 10(13), 4427.

    Article  Google Scholar 

  • Beckham, N. R., Akeh, L. J., Mitaart, G. N. P., & Moniaga, J. V. (2023). Determining factors that affect student performance using various machine learning methods. Procedia Computer Science, 216, 597–603.

    Article  Google Scholar 

  • Berrar, D. (2019). Cross-Validation. In S. Ranganathan, M. Gribskov, K. Nakai, & C. Schönbach (Eds.), Encyclopedia of Bioinformatics and Computational Biology (pp. 542–545). Academic Press. https://doi.org/10.1016/B978-0-12-809633-8.20349-X.

  • Chanlekha, H., & Niramitranon, J. (2018). September. Student performance prediction model for early-identification of at-risk students in traditional classroom settings. In Proceedings of the 10th International Conference on Management of Digital EcoSystems (pp. 239–245).

  • Chung, J. Y., & Lee, S. (2019). Dropout early warning systems for high school students using machine learning. Children and Youth Services Review, 96, 346–353.

    Article  Google Scholar 

  • Cohen, J. (1960). A coefficient of Agreement for Nominal Scales. Educational Psychological Measurement, 20(1), 37–46.

    Article  Google Scholar 

  • Cohen, L., Manion, L., & Morrison, K. (2007). Research methods in education (6th ed.). Routledge. https://doi.org/10.4324/9780203029053

  • Daud, A., Aljohani, N. R., Abbasi, R. A., Lytras, M. D., Abbas, F., & Alowibdi, J. S. (2017). April. Predicting student performance using advanced learning analytics. In Proceedings of the 26th international conference on world wide web companion (pp. 415–421).

  • Dinov, I. D. (2023). Variable importance and feature selection. Data Science and Predictive Analytics: Biomedical and Health applications using R (pp. 579–639). Cham: Springer International Publishing.

    Chapter  Google Scholar 

  • Galdino, G. S., de Sandes-Freitas, T. V., de Andrade, L. G. M., Adamian, C. M. C., Meneses, G. C., da Silva Junior, G. B., & de Daher, F., E (2023). Development and validation of a simple machine learning tool to predict mortality in leptospirosis. Scientific Reports, 13(1), 4506.

    Article  Google Scholar 

  • Hasan, R., Palaniappan, S., Raziff, A. R. A., Mahmood, S., & Sarker, K. U. (2018). August. Student academic performance prediction by using decision tree algorithm. In 2018 4th International Conference on Computer and Information Sciences (ICCOINS) (pp. 1–5). IEEE.

  • Hasan, R., Palaniappan, S., Mahmood, S., Abbas, A., Sarker, K. U., & Sattar, M. U. (2020). Predicting Student performance in higher Educational Institutions using Video Learning Analytics and Data Mining techniques. Applied Sciences, 10(11), 3894.

    Article  Google Scholar 

  • He, H., & Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on knowledge and data engineering, 21(9), 1263–1284. https://doi.org/10.1109/TKDE.2008.239.

    Article  Google Scholar 

  • Herodotou, C., Rienties, B., Verdin, B., & Boroowa, A. (2019). Predictive learning analytics ‘at scale’: Guidelines to successful implementation in Higher Education based on the case of the Open University UK. Journal of Learning Analytics, 6(1), 85–95.

    Article  Google Scholar 

  • Hoffait, A. S., & Schyns, M. (2017). Early detection of university students with potential difficulties. Decision Support Systems, 101, 1–11.

    Article  Google Scholar 

  • Huang, S., & Fang, N. (2013). Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models. Computers and Education, 61, 133–145.

    Article  Google Scholar 

  • Huang, A. Y., Lu, O. H., Huang, J. C., Yin, C. J., & Yang, S. J. (2020). Predicting students’ academic performance by using educational big data and learning analytics: Evaluation of classification methods and learning logs. Interactive Learning Environments, 28(2), 206–230.

    Article  Google Scholar 

  • Kaliisa, R., Kluge, A., & Mørch, A. I. (2020). Combining checkpoint and process learning analytics to support Learning Design decisions in blended learning environments. Journal of Learning Analytics, 7(3), 33–47. https://doi.org/10.18608/jla.2020.73.4.

    Article  Google Scholar 

  • Kasem, A., Shahrin, S. N. A. M., & Wan, A. T. (2018). October. Learning analytics in Universiti Teknologi Brunei: predicting graduates performance. In 2018 Fourth International Conference on Advances in Computing, Communication and Automation (ICACCA) (pp. 1–5). IEEE.

  • Kumara, B. A., Kodabagi, M. M., Choudhury, T., & Um, J. S. (2021). Improved email classification through enhanced data preprocessing approach. Spatial Information Research, 29(2), 247–255. https://doi.org/10.1007/s41324-020-00378-y.

    Article  Google Scholar 

  • Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the Boruta package. Journal Of Statistical Software, 36(11), 1–13.

    Article  Google Scholar 

  • Kuzilek, J., Hlosta, M., Herrmannova, D., Zdrahal, Z., Vaclavek, J., & Wolff, A. (2015). OU Analyse: Analysing at-risk students at the Open University. Learning Analytics Review, 1–16.

  • Lykourentzou, I., Giannoukos, I., Mpardis, G., Nikolopoulos, V., & Loumos, V. (2009). Early and dynamic student achievement prediction in e-learning courses using neural networks. Journal of the American Society for Information Science and Technology, 60(2), 372–380.

    Article  Google Scholar 

  • Magill, M. A. (1997). June. Classroom models for illustrating dynamics principles part i.–particle kinematics and kinetics. In Proc. of the ASEE Annual Conf.

  • Majumder, S., & Mason, C. M. (2023). Applying an interpretable machine learning approach to assess intraspecific trait variation under landscape-scale population differentiation. bioRxiv, 2023–04.

  • McMahon, B. M., & Sembiante, S. F. (2020). Re-envisioning the purpose of early warning systems: Shifting the mindset from student identification to meaningful prediction and intervention. Review of Education, 8(1), 266–301.

    Article  Google Scholar 

  • Molinaro, A. M., Simon, R., & Pfeiffer, R. M. (2005). Prediction error estimation: a comparison of resampling methods. Bioinformatics, 21(15), 3301–3307. https://doi.org/10.1093/bioinformatics/bti499

  • Nabil, A., Seyam, M., & Abou-Elfetouh, A. (2021). Prediction of students’ academic performance based on courses’ grades using deep neural networks. Ieee Access: Practical Innovations, Open Solutions, 9, 140731–140746.

    Article  Google Scholar 

  • Oyelere, S. S. (2020). Grades of Computer Science Students in a Nigerian University (Version 1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.3898452.

  • Qushem, U. B., Christopoulos, A., Oyelere, S. S., Ogata, H., & Laakso, M. J. (2021). Multimodal Technologies in Precision Education: Providing New Opportunities or adding more Challenges? Education Sciences, 11(7), 338.

    Article  Google Scholar 

  • Romero, C., Romero, J. R., & Ventura, S. (2014). A survey on pre-processing educational data. Educational data mining (pp. 29–64). Cham: Springer.

    Chapter  Google Scholar 

  • Sandhu, H., Kumar, R. N., & Garg, P. (2021). Machine learning-based modeling to predict inhibitors of acetylcholinesterase. Molecular Diversity, 1–10.

  • Şen, B., Uçar, E., & Delen, D. (2012). Predicting and analyzing secondary education placement-test scores: A data mining approach. Expert Systems with Applications, 39(10), 9468–9476.

    Article  Google Scholar 

  • Sunday, K., Ocheja, P., Hussain, S., Oyelere, S. S., Samson, B. O., & Agbo, F. J. (2020). Analyzing student performance in Programming Education using classification techniques. International Journal of Emerging Technologies in Learning (IJET), 15(02), 127. https://doi.org/10.3991/ijet.v15i02.11527.

    Article  Google Scholar 

  • Trakunphutthirak, R., Cheung, Y., & Lee, V. C. (2019). July. A study of educational data mining: Evidence from a thai university. In Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 33, No. 01, pp. 734–741).

  • Veenstra, C. P., Dey, E. L., & Herrin, G. D. (2008). Is modeling of freshman engineering success different from modeling of non-engineering success? Journal of Engineering Education, 97(4), 467–479.

    Article  Google Scholar 

  • Ware, W. B., & Galassi, J. P. (2006). Using correlational and prediction data to enhance student achievement in K-12 schools: A practical application for school counselors. Professional School Counseling, 344–356.

  • Williams, L., Titus, K. J., & Pittman, J. M. (2021). January. How Early is Early Enough: Correlating Student Performance with Final Grades. In Proceedings of 5th Conference on Computing Education Practice (pp. 13–16).

  • Wright, M. C., McKay, T., Hershock, C., Miller, K., & Tritz, J. (2014) Better than expected: Using learning analytics to promote student success in gateway science. Change: The Magazine of Higher Learning, 46(1), 28–34. https://doi.org/10.1080/00091383.2014.867209

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gökhan Akçapınar.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix I

Table 3 The table contains a list of courses taken by students and courses attributed to Dataset 1, Dataset 2, and Dataset 3

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Qushem, U.B., Oyelere, S.S., Akçapınar, G. et al. Unleashing the Power of Predictive Analytics to Identify At-Risk Students in Computer Science. Tech Know Learn 29, 1385–1400 (2024). https://doi.org/10.1007/s10758-023-09674-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10758-023-09674-6

Keywords