Skip to main content

Advertisement

Log in

Educational Data Mining: Dropout Prediction in XuetangX MOOCs

  • Published:
Neural Processing Letters Aims and scope Submit manuscript

Abstract

With the rapid development of educational data mining and learning analytics, this study tries to make sense of education data and improve teachers’ competence and teaching experience. In recent years, massive open online courses (MOOCs) have become the first choice of online learning for tens of millions of people around the world. However, the dropout rates for MOOCs are high. The goal of dropout prediction is to predict whether learners will exhibit learning behavior in several consecutive days in the future. Therefore, in this study, we consider the correlation information of learners’ learning behaviors for several consecutive days. Through the in-depth statistical analysis of learners’ learning behavior, it is found that learners’ learning behavior on the next day is similar to that of the previous day. Based on this characteristic, we propose a Lie group region covariance matrix to represent the local correlation information of learning behavior and construct a convolutional neural network model with a multidilation pooling module to extract the local correlation high-level features of learning behavior for dropout prediction. In addition, extensive experiments show that the local correlation of learners’ learning behavior cannot be ignored, which is fully considered in our model. Compared with the existing methods, our method achieves the best experimental results in accuracy, F-measure, precision, and recall, which is better than the current methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Crues RW (2018) How do gender, learning goals, and forum participation predict persistence in a computer science MOOC? ACM Trans Comput Educ (TOCE) 18(4):1–14

    Article  Google Scholar 

  2. Ramesh A et al (2014) Learning latent engagement patterns of students in online courses. In: Twenty-eighth AAAI conference on artificial intelligence, pp 1–7

  3. He J et al (2015) Identifying at-risk students in massive open online courses. In: Twenty-ninth AAAI conference on artificial intelligence, pp 1749–1755

  4. Peña-Ayala A (2014) Educational data mining: a survey and a data mining-based analysis of recent works. Expert Syst Appl 41(4):1432–1462

    Article  Google Scholar 

  5. Vahdat M et al (2015) Advances in learning analytics and educational data mining. In: Proceedings of of ESANN2015, pp 297–306

  6. Dalipi F, Imran AS, Kastrati Z (2018) MOOC dropout prediction using machine learning techniques: review and research challenges. In: 2018 IEEE global engineering education conference (EDUCON). IEEE, pp 1007–1014

  7. Kloft M et al (2014) Predicting MOOC dropout over weeks using machine learning methods. In: Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs, pp 60–65

  8. Taylor C, Veeramachaneni K, O’Reilly U-M (2014) Likely to stop? predicting stopout in massive open online courses. arXiv preprint arXiv:1408.3382

  9. Liang J et al (2016) Big data application in education: dropout prediction in edx MOOCs. In: 2016 IEEE second international conference on multimedia big data (BigMM). IEEE, pp 440–443

  10. Ramesh A et al (2013) Modeling learner engagement in MOOCs using probabilistic soft logic. In: NIPS workshop on data driven education, pp 21–62

  11. Li W et al (2016) Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning. In: 2016 international joint conference on neural networks (IJCNN). IEEE, pp 3130–3137

  12. Balakrishnan G, Coetzee D (2013) Predicting student retention in massive open online courses using hidden markov models. Electrical Engineering and Computer Sciences University of California at Berkeley, vol 53, pp 57–58

  13. Fei M, Yeung D-Y (2015) Temporal models for predicting student dropout in massive open online courses. In: 2015 IEEE international conference on data mining workshop (ICDMW). IEEE, pp 256–263

  14. Whitehill J et al (2017) Delving deeper into MOOC student dropout prediction. arXiv preprint arXiv:1702.06404

  15. Wang W, Yu H, Miao C (2017) Deep model for dropout prediction in MOOCs. In: Proceedings of the 2nd international conference on crowd science and engineering, pp 26–32

  16. Márquez-Vera C et al (2013) Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data. Appl Intell 38(3):315–330

    Article  Google Scholar 

  17. Moon S, Potdar S, Martin L (2014) Identifying student leaders from MOOC discussion forums through language influence. In: Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs, p 15

  18. Kizilcec RF, Piech C, Schneider E (2013) Deconstructing disengagement: analyzing learner subpopulations in massive open online courses. In: Proceedings of the third international conference on learning analytics and knowledge, pp 170–179

  19. Dekker GW, Pechenizkiy M, Vleeshouwers JM (2009) Predicting students drop out: a case study. International Working Group on educational data mining

  20. Xu Y, Bo D, Zhang L (2020) Assessing the threat of adversarial examples on deep neural networks for remote sensing scene classification: attacks and defenses. IEEE Trans Geosci Remote Sens 59(2):1604–1617

    Article  Google Scholar 

  21. Xu Y et al (2018) Hyperspectral image classification via a random patches network. ISPRS J Photogramm Remote Sens 142:344–357

    Article  Google Scholar 

  22. Xu Y, Bo D, Zhang L (2019) Beyond the patchwise classification: spectral-spatial fully convolutional networks for hyperspectral image classification. IEEE Trans Big Data 6(3):492–506

    Article  MathSciNet  Google Scholar 

  23. Tuzel O, Porikli F, Meer P (2006) Region covariance: a fast descriptor for detection and classification. In: European conference on computer vision. Springer, Berlin, pp 589–600

  24. Xu C, Zhu G (2021) Intelligent manufacturing Lie group machine learning: real-time and efficient inspection system based on fog computing. J Intell Manuf 32(1):237–249

    Article  Google Scholar 

  25. Xu C, Zhu G, Shu J (2020) Robust joint representation of intrinsic mean and kernel function of Lie group for remote sensing scene classification. IEEE Geosci Remote Sens Lett 18(5):796–800

    Article  Google Scholar 

  26. Xu C, Zhu G, Shu J (2021) A lightweight intrinsic mean for remote sensing classification with Lie group kernel function. IEEE Geosci Remote Sens Lett 18(10):1741–1745

  27. Xu C, Zhu G, Shu J (2022) A lightweight and robust Lie group-convolutional neural networks joint representation for remote sensing scene classification. IEEE Trans Geosci Remote Sens 60:1–15

  28. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

  29. He K et al (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916

    Article  Google Scholar 

  30. XuetangX http://www.xuetangx.com/ (2019)

  31. Kddcup2015 https://biendata.com/competition/kddcup 2015/ (2015)

  32. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Article  Google Scholar 

  33. Sivic J, Zisserman A (2003) Video Google:a text retrieval approach to object matching in videos. In: IEEE international conference on computer vision, vol 3. IEEE Computer Society, pp 1470–1470

  34. Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):1–27

    Article  Google Scholar 

  35. Wen Y et al (2019) Consideration of the local correlation of learning behaviors to predict dropouts from MOOCs. Tsinghua Sci Technol 25(3):336–347

    Article  Google Scholar 

Download references

Acknowledgements

The authors would like to thank two anonymous reviewers for carefully reviewing this paper and giving valuable comments to improve this paper.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chengjun Xu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work is supported by the 13th Five-Year Plan for Education science in China. The Project Number is BIA180187.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, C., Zhu, G., Ye, J. et al. Educational Data Mining: Dropout Prediction in XuetangX MOOCs. Neural Process Lett 54, 2885–2900 (2022). https://doi.org/10.1007/s11063-022-10745-5

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11063-022-10745-5

Keywords

Navigation