Abstract
With the rapid development of educational data mining and learning analytics, this study tries to make sense of education data and improve teachers’ competence and teaching experience. In recent years, massive open online courses (MOOCs) have become the first choice of online learning for tens of millions of people around the world. However, the dropout rates for MOOCs are high. The goal of dropout prediction is to predict whether learners will exhibit learning behavior in several consecutive days in the future. Therefore, in this study, we consider the correlation information of learners’ learning behaviors for several consecutive days. Through the in-depth statistical analysis of learners’ learning behavior, it is found that learners’ learning behavior on the next day is similar to that of the previous day. Based on this characteristic, we propose a Lie group region covariance matrix to represent the local correlation information of learning behavior and construct a convolutional neural network model with a multidilation pooling module to extract the local correlation high-level features of learning behavior for dropout prediction. In addition, extensive experiments show that the local correlation of learners’ learning behavior cannot be ignored, which is fully considered in our model. Compared with the existing methods, our method achieves the best experimental results in accuracy, F-measure, precision, and recall, which is better than the current methods.
Similar content being viewed by others
References
Crues RW (2018) How do gender, learning goals, and forum participation predict persistence in a computer science MOOC? ACM Trans Comput Educ (TOCE) 18(4):1–14
Ramesh A et al (2014) Learning latent engagement patterns of students in online courses. In: Twenty-eighth AAAI conference on artificial intelligence, pp 1–7
He J et al (2015) Identifying at-risk students in massive open online courses. In: Twenty-ninth AAAI conference on artificial intelligence, pp 1749–1755
Peña-Ayala A (2014) Educational data mining: a survey and a data mining-based analysis of recent works. Expert Syst Appl 41(4):1432–1462
Vahdat M et al (2015) Advances in learning analytics and educational data mining. In: Proceedings of of ESANN2015, pp 297–306
Dalipi F, Imran AS, Kastrati Z (2018) MOOC dropout prediction using machine learning techniques: review and research challenges. In: 2018 IEEE global engineering education conference (EDUCON). IEEE, pp 1007–1014
Kloft M et al (2014) Predicting MOOC dropout over weeks using machine learning methods. In: Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs, pp 60–65
Taylor C, Veeramachaneni K, O’Reilly U-M (2014) Likely to stop? predicting stopout in massive open online courses. arXiv preprint arXiv:1408.3382
Liang J et al (2016) Big data application in education: dropout prediction in edx MOOCs. In: 2016 IEEE second international conference on multimedia big data (BigMM). IEEE, pp 440–443
Ramesh A et al (2013) Modeling learner engagement in MOOCs using probabilistic soft logic. In: NIPS workshop on data driven education, pp 21–62
Li W et al (2016) Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning. In: 2016 international joint conference on neural networks (IJCNN). IEEE, pp 3130–3137
Balakrishnan G, Coetzee D (2013) Predicting student retention in massive open online courses using hidden markov models. Electrical Engineering and Computer Sciences University of California at Berkeley, vol 53, pp 57–58
Fei M, Yeung D-Y (2015) Temporal models for predicting student dropout in massive open online courses. In: 2015 IEEE international conference on data mining workshop (ICDMW). IEEE, pp 256–263
Whitehill J et al (2017) Delving deeper into MOOC student dropout prediction. arXiv preprint arXiv:1702.06404
Wang W, Yu H, Miao C (2017) Deep model for dropout prediction in MOOCs. In: Proceedings of the 2nd international conference on crowd science and engineering, pp 26–32
Márquez-Vera C et al (2013) Predicting student failure at school using genetic programming and different data mining approaches with high dimensional and imbalanced data. Appl Intell 38(3):315–330
Moon S, Potdar S, Martin L (2014) Identifying student leaders from MOOC discussion forums through language influence. In: Proceedings of the EMNLP 2014 workshop on analysis of large scale social interaction in MOOCs, p 15
Kizilcec RF, Piech C, Schneider E (2013) Deconstructing disengagement: analyzing learner subpopulations in massive open online courses. In: Proceedings of the third international conference on learning analytics and knowledge, pp 170–179
Dekker GW, Pechenizkiy M, Vleeshouwers JM (2009) Predicting students drop out: a case study. International Working Group on educational data mining
Xu Y, Bo D, Zhang L (2020) Assessing the threat of adversarial examples on deep neural networks for remote sensing scene classification: attacks and defenses. IEEE Trans Geosci Remote Sens 59(2):1604–1617
Xu Y et al (2018) Hyperspectral image classification via a random patches network. ISPRS J Photogramm Remote Sens 142:344–357
Xu Y, Bo D, Zhang L (2019) Beyond the patchwise classification: spectral-spatial fully convolutional networks for hyperspectral image classification. IEEE Trans Big Data 6(3):492–506
Tuzel O, Porikli F, Meer P (2006) Region covariance: a fast descriptor for detection and classification. In: European conference on computer vision. Springer, Berlin, pp 589–600
Xu C, Zhu G (2021) Intelligent manufacturing Lie group machine learning: real-time and efficient inspection system based on fog computing. J Intell Manuf 32(1):237–249
Xu C, Zhu G, Shu J (2020) Robust joint representation of intrinsic mean and kernel function of Lie group for remote sensing scene classification. IEEE Geosci Remote Sens Lett 18(5):796–800
Xu C, Zhu G, Shu J (2021) A lightweight intrinsic mean for remote sensing classification with Lie group kernel function. IEEE Geosci Remote Sens Lett 18(10):1741–1745
Xu C, Zhu G, Shu J (2022) A lightweight and robust Lie group-convolutional neural networks joint representation for remote sensing scene classification. IEEE Trans Geosci Remote Sens 60:1–15
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
He K et al (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916
XuetangX http://www.xuetangx.com/ (2019)
Kddcup2015 https://biendata.com/competition/kddcup 2015/ (2015)
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Sivic J, Zisserman A (2003) Video Google:a text retrieval approach to object matching in videos. In: IEEE international conference on computer vision, vol 3. IEEE Computer Society, pp 1470–1470
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol (TIST) 2(3):1–27
Wen Y et al (2019) Consideration of the local correlation of learning behaviors to predict dropouts from MOOCs. Tsinghua Sci Technol 25(3):336–347
Acknowledgements
The authors would like to thank two anonymous reviewers for carefully reviewing this paper and giving valuable comments to improve this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work is supported by the 13th Five-Year Plan for Education science in China. The Project Number is BIA180187.
Rights and permissions
About this article
Cite this article
Xu, C., Zhu, G., Ye, J. et al. Educational Data Mining: Dropout Prediction in XuetangX MOOCs. Neural Process Lett 54, 2885–2900 (2022). https://doi.org/10.1007/s11063-022-10745-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-022-10745-5