Skip to main content

Advertisement

Log in

Predictive modelling and analytics of students’ grades using machine learning algorithms

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

The outbreak of COVID-19 has caused significant disruption in all sectors and industries around the world. To tackle the spread of the novel coronavirus, the learning process and the modes of delivery had to be altered. Most courses are delivered traditionally with face-to-face or a blended approach through online learning platforms. In addition, researchers and educational specialists around the globe always had a keen interest in predicting a student’s performance based on the student’s information such as previous exam results obtained and experiences. With the upsurge in using online learning platforms, predicting the student’s performance by including their interactions such as discussion forums could be integrated to create a predictive model. The aims of the research are to provide a predictive model to forecast students’ performance (grade/engagement) and to analyse the effect of online learning platform’s features. The model created in this study made use of machine learning techniques to predict the final grade and engagement level of a learner. The quantitative approach for student’s data analysis and processing proved that the Random Forest classifier outperformed the others. An accuracy of 85% and 83% were recorded for grade and engagement prediction respectively with attributes related to student profile and interaction on a learning platform.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  • Adem, A., Çakıt, E., & Dağdeviren, M. (2022). Selection of suitable distance education platforms based on human-computer interaction criteria under fuzzy environment. Neural Computing and Applications, 1–13.

  • Adnan, M., Habib, A., Ashraf, J., Mussadiq, S., Raza, A. A., Abid, M., Bashir, M., & Khan, S. U. (2021). Predicting at-Risk Students at Different Percentages of Course Length for Early Intervention Using Machine Learning Models. IEEE Access, [online] 9, pp.7519–7539. Available at: https://ieeexplore.ieee.org/document/9314000 Accessed 10 December 2021.

  • Al-Shabandar, R., Hussain, A., Laws, A., Keight, R., Lunn, J., & Radi, N. (2017). ‘Machine learning approaches to predict learning outcomes in Massive open online courses’, Proceedings of the International Joint Conference on Neural Networks.

  • Alyahyan, E., & Düştegör, D. (2020). Predicting academic success in higher education: literature review and best practices. International Journal of Educational Technology in Higher Education, 17(1).

  • Ashmore, R., Calinescu, R., & Paterson, C. (2019). Assuring the machine learning lifecycle: Desiderata, methods, and challenges.

  • Bakki, A., Oubahssi, L., Cherkaoui, C., & George, S. (2015). Motivation and engagement in MOOCs: How to increase learning motivation by adapting pedagogical scenarios? Design for Teaching and Learning in a Networked World, pp. 556–559.

  • Beysolow, Y. (2017). Introduction to deep learning using R: A step-by-step guide to learning and implementing deep learning models using R. Ca Apress.

    Book  Google Scholar 

  • Bisong, E. (2019). Building machine learning and deep learning models on google cloud platform: A comprehensive guide for beginners (1st. ed.). Apress.

  • Bujang, S. D. A., Selamat, A., Ibrahim, R., Krejcar, O., Herrera-Viedma, E., Fujita, H., & Ghani, N. AMd. (2021). Multiclass prediction model for student grade prediction using machine learning. IEEE Access, 9, 95608–95621.

    Article  Google Scholar 

  • Chilukuri, K. C. (2020). A novel framework for active learning in engineering education mapped to course outcomes. Procedia Computer Science, 172, 28–33.

    Article  Google Scholar 

  • Cocea, M., & Weibelzahl, S. (2011). Disengagement detection in online learning: Validation studies and perspectives. IEEE Transactions on Learning Technologies, [online] 4(2), pp.114–124. Available at: https://ieeexplore.ieee.org/abstract/document/5518758 [Accessed 26 Dec. 2021].

  • Cohen, E., & Nycz, M. (2006). Learning objects and e-learning: An informing science perspective. Interdisciplinary Journal of e-Skills and Lifelong Learning, 2, 023–034.

    Article  Google Scholar 

  • Coman, C., Țîru, L. G., Meseșan-Schmitz, L., Stanciu, C., & Bularca, M. C. (2020). Online teaching and learning in higher education during the coronavirus pandemic: Students’ perspective. Sustainability, 12(24), 10367.

    Article  Google Scholar 

  • Costa, E. B., Fonseca, B., Santana, M. A., de Araújo, F. F., & Rego, J. (2017). Evaluating the effectiveness of educational data mining techniques for early prediction of students’ academic failure in introductory programming courses. Computers in Human Behavior, 73, 247–256.

    Article  Google Scholar 

  • Daniel, B. (2014). Big Data and analytics in higher education: Opportunities and challenges. British Journal of Educational Technology, 46(5), 904–920.

    Article  Google Scholar 

  • Deepa, B. G., & Senthil, S. (2020). “Constructive effect of ranking optimal features using random forest, support vector machine and naïve bayes forbreast cancer diagnosis.” Big Data Analytics and Intelligence: A Perspective for Health Care, First Edition, Emerald Insight.

  • Dewan, M. A. A., Murshed, M., & Lin, F. (2019). Engagement detection in online learning: a review. Smart Learning Environments, 6(1). https://doi.org/10.1186/s40561-018-0080-z

  • Dhawan, S. (2020). Online learning: A Panacea in the time of COVID-19 crisis. Journal of Educational Technology Systems, 49(1), 5–22.

    Article  Google Scholar 

  • Di Franco, G., & Santurro, M. (2020). Machine learning, artificial neural networks and social research. Quality & Quantity, 55(3), 1007–1025.

    Article  Google Scholar 

  • Gil-García, R., & Pons-Porrata, A. (2006). A new nearest neighbor rule for text categorization.

  • Hafeez, M. A., Rashid, M., Tariq, H., Abideen, Z. U., Alotaibi, S. S., & Sinky, M. H. (2021). Performance improvement of decision tree: A robust classifier using Tabu search algorithm. Applied Sciences, 11(15), 6728.

    Article  Google Scholar 

  • Hall, P., Park, B., & Samworth, R. (2008). ‘Choice of neighbor order in nearest-neighbor classification’, The Annals of Statistics, 36.

  • Jayashree, G., & Priya, C. (2019). Design of visibility for order lifecycle using datawarehouse. International Journal of Engineering and Advanced Technology, 8(6), 4700–4707.

    Article  Google Scholar 

  • Jongbo, O. C. (2014). The role of research design in a purpose driven enquiry. Review of Public Administration and Management, 3(6), 87–94.

    Google Scholar 

  • Kamiri, J. & Mariga, G. (2021). Research methods in machine learning: A content analysis. international journal of computer and information technology (pp. 2279-0764)

  • Kimball, R., & Ross, M. (2013). The data warehouse toolkit (3rd ed.). Wiley, Cop.

    Google Scholar 

  • Ko, C. Y., & Leu, F.-Y. (2021). Examining successful attributes for undergraduate students by applying machine learning techniques. IEEE Transactions on Education, 64(1), 50–57.

    Article  Google Scholar 

  • Krawczyk, B., Minku, L. L., Gama, J., Stefanowski, J., & Woźniak, M. (2017). Ensemble learning for data stream analysis: A survey. Information Fusion, 37, 132–156.

    Article  Google Scholar 

  • Landset, S., Khoshgoftaar, T. M., Richter, A. N., & Hasanin, T. (2015). A survey of open source tools for machine learning with big data in the Hadoop ecosystem. Journal of Big Data, 2(1).

  • Liu, Z., Yang, C., Rüdian, S., Liu, S., Zhao, L., & Wang, T. (2019). Temporal emotion-aspect modeling for discovering what students are concerned about in online course forums. Interactive Learning Environments, 27(5–6), 598–627. https://doi.org/10.1080/10494820.2019.1610449

    Article  Google Scholar 

  • Liu, S., Liu, S., Liu, Z., Peng, X., & Yang, Z. (2022a). Automated detection of emotional and cognitive engagement in MOOC discussions to predict learning achievement, Computers & Education, Volume 181. ISSN, 104461, 0360–1315. https://doi.org/10.1016/j.compedu.2022.104461

    Article  Google Scholar 

  • Liu, Z., Zhang, N., Peng, X., Liu, S., Yang, Z., Peng, J., Su, Z., & Chen, J. (2022b). Exploring the relationship between social interaction, cognitive processing and learning achievements in a MOOC discussion forum. Journal of Educational Computing Research., 60(1), 132–169. https://doi.org/10.1177/07356331211027300

    Article  Google Scholar 

  • Liu, Z., Kong, X., Liu, S., et al. (2022c). Looking at MOOC discussion data to uncover the relationship between discussion pacings, learners’ cognitive presence and learning achievements. Education and Information Technologies. https://doi.org/10.1007/s10639-022-10943-7

    Article  Google Scholar 

  • Marczyk, G. R., Dematteo, D., & Festinger, D. (2005). Essentials of research design and methodology. John Wiley & Sons.

    MATH  Google Scholar 

  • Michelucci, U. (2019). Advanced applied deep learning. Apress.

    Book  Google Scholar 

  • Moscoso-Zea, O., Paredes-Gualtor, J., & Lujan-Mora, S. (2018). A holistic view of data warehousing in education. IEEE Access, 6, 64659–64673.

    Article  Google Scholar 

  • Moscoso-Zea, O., & Lujan-Mora, S. (2017). Knowledge management in higher education institutions for the generation of organizational knowledge. In 2017 12th Iberian Conference on Information Systems and Technologies (CISTI).

  • Moubayed, A., Injadat, M., Nassif, A., Lutfiyya, H., & Shami, A. (2018). ‘E-Learning: Challenges and research opportunities using machine learning data analytics’, IEEE Access.

  • Mourdi, Y., Sadgal, M., El Kabtane, H., & Berrada Fathi, W. (2019). A machine learning-based methodology to predict learners’ dropout, success or failure in MOOCs. International Journal of Web Information Systems, 15(5), 489–509.

    Article  Google Scholar 

  • Müller, A. C., & Guido, S. (2017). Introduction to machine learning with Python : a guide for data scientists. O’reilly.

    Google Scholar 

  • Nayak, J., Naik, B., & Behera, H. S. (2015). A comprehensive survey on support vector machine in data mining tasks: Applications & challenges. International Journal of Database Theory and Application, 8(1), 169–186.

    Article  Google Scholar 

  • Patel, H. H., & Prajapati, P. (2018). Study and analysis of decision tree based classification algorithms. International Journal of Computer Sciences and Engineering, 6(10).

  • Patil, A. P., Ganesan, K., & Kanavalli, A. (2018). ‘Effective deep learning model to predict student grade point averages’, 2017 IEEE International Conference on Computational Intelligence and Computing Research, ICCIC 2017.

  • Perveen, A. (2016). Synchronous and asynchronous e-language learning: A case study of virtual university of Pakistan. Open Praxis, [online] 8(1). Available at: https://files.eric.ed.gov/fulltext/EJ1093436.pdf [Accessed 20 November 2021].

  • Petrovski, A., Petruseva, S., & Zileska, P. .V. (2015). Multiple Linear regression model for predicting bidding price. Technics Technologies Education Management, 10(1), 386–393.

    Google Scholar 

  • Raschka, S., & Mirjalili, V. (2017). Python machine learning: Machine learning and deep learning with Python, scikit-learn, and TensorFlow (2nd ed.). Packt Publishing.

    Google Scholar 

  • Russell, R. (2018). Machine learning step-by-step guide to implement machine learning algorithms with Python. Editorial: Columbia, Sc.

  • Salmela-Aro, K., & Read, S. (2017). Study engagement and burnout profiles among Finnish higher education students. Burnout Research, 7, 21–28.

    Article  Google Scholar 

  • Sarker, I. H., Kayes, A. S. M., & Watters, P. (2019). Effectiveness analysis of machine learning classification models for predicting personalized context-aware smartphone usage. Journal of Big Data, 6(1).

  • Shahiri, A. M., Husain, W., & Rashid, N. A. (2015). A review on predicting student’s performance using data mining techniques. Procedia Computer Science, 72, 414–422.

    Article  Google Scholar 

  • Shida, N., Osman, S., & Abdullah, A. H. (2019). Students’ perceptions of the use of asynchronous discussion forums, quizzes, and uploaded resources. International Journal of Recent Technology and Engineering, 8(2S9), 704–708.

    Google Scholar 

  • Silvola, A., Näykki, P., Kaveri, A., & Muukkonen, H. (2021). Expectations for supporting student engagement with learning analytics: An academic path perspective. Computers & Education, 168, 104192.

    Article  Google Scholar 

  • Singh, S. K. (2011). Database systems: concepts, design and applications. Dorling Kindersley, India.

    Google Scholar 

  • Sorour, S., Mine, T., Goda, K., & Hirokawa, S. (2015). A predictive model to evaluate student performance. Journal of Information Processing, 23(2).

  • Sungkur, R. K., & Maharaj, M. (2022). A review of intelligent techniques for implementing SMART learning environments. In: Sikdar, B., Prasad Maity, S., Samanta, J., Roy, A. (Eds.), Proceedings of the 3rd International Conference on Communication, Devices and Computing. Lecture Notes in Electrical Engineering, vol 851. Springer, Singapore. https://doi.org/10.1007/978-981-16-9154-6_69

  • Sungkur, R. K., & Maharaj, M. S. (2021). Design and implementation of a SMART Learning environment for the Upskilling of Cybersecurity professionals in Mauritius. Education and Information Technologies, 26, 3175–3201. https://doi.org/10.1007/s10639-020-10408-9

    Article  Google Scholar 

  • Tarik, A., Aissa, H., & Yousef, F. (2021). Artificial intelligence and machine learning to predict student performance during the COVID-19. Procedia Computer Science, 184, 835–840.

    Article  Google Scholar 

  • Theobald, O. (2017). Machine learning for absolute beginners: A plain english introduction (2nd ed.). Scatterplot Press.

    Google Scholar 

  • Uyanık, G. K., & Güler, N. (2013). A Study on Multiple Linear Regression Analysis. Procedia - Social and Behavioral Sciences, [online] 106, Available at: https://www.sciencedirect.com/science/article/pii/S1877042813046429 [Accessed: 20 November 2021].

  • Wabwoba, F., & Ikoha, A. (2011). Information Technology research in developing nations: Major research methods and publication outlets. International Journal of Information and Communication Technology Research., 1(6), 253–257.

    Google Scholar 

  • Wibawa, A. P., Kurniawan, A. C., Murti, D. M. P., Adiperkasa, R. P., Putra, S. M., Kurniawan, S. A., & Nugraha, Y. R. (2019). Naïve Bayes classifier for journal quartile classification. International Journal of Recent Contributions from Engineering, Science & IT (iJES), 7(2), 91.

    Article  Google Scholar 

  • Williamson, B. (2018). The hidden architecture of higher education: building a big data infrastructure for the “smarter university.” International Journal of Educational Technology in Higher Education, 15(1).

  • Yadav, S. K., & Pal, S. (2012). ‘Data Mining: A Prediction for Performance Improvement of Engineering Students using Classification’, 2(2), Available at: http://arxiv.org/abs/1203.3832 [Accessed 6 Nov. 2021].

  • Yin, X. (2021). Construction of student information management system based on data mining and clustering algorithm. Complexity, 2021, 1–11.

    Google Scholar 

  • Zhang, Z. (2016). ‘Introduction to machine learning: K-nearest neighbors’, Annals of Translational Medicine, 4.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roopesh Kevin Sungkur.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Badal, Y.T., Sungkur, R.K. Predictive modelling and analytics of students’ grades using machine learning algorithms. Educ Inf Technol 28, 3027–3057 (2023). https://doi.org/10.1007/s10639-022-11299-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-022-11299-8

Keywords