Abstract
Dropping out of school is a problem in most countries worldwide and leads to many adverse effects on the family and society. Therefore, early predicting the risk of dropping out of high school can help educators have interventions and efficient solutions in reducing the high school dropout rate. In this work, we present a machine learning model to predict the risk of school dropout. This model was built from the dataset, including 10,219 student records with 807 dropouts (7.89%) in high schools in Ca Mau province. The results show that the models Naïve Bayes, Decision Tree with Bagging, Random Forest with Bagging give the best results with Area Under the Curve at 83.01%, 80.95%, 83.16%, and accuracy, precision, recall, f1-score are all over 80%. In addition, we also extracted important features playing a decisive contribution in predicting school dropout, including Grade Point Average, school code, Conduct, Age, and Class.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adelman, M., Haimovich, F., Ham, A., Vazquez, E.: Predicting school dropout with administrative data: new evidence from Guatemala and Honduras. Educ. Econ. 26(4), 356–372 (2018). https://doi.org/10.1080/09645292.2018.1433127
Aulck, L., Velagapudi, N., Blumenstock, J., West, J.: Predicting student dropout in higher education, pp. 16–20. University of Washington (2016)
Baker, R.S., Berning, A.W., Gowda, S.M., Zhang, S., Hawn, A.: Predicting k-12 dropout. J. Educ. Students Placed Risk (JESPAR) 25(1), 28–54 (2020). https://doi.org/10.1080/10824669.2019.1670065
Le, H.D.: Report No. 1495/BC-SGDDT on July 28, 2020, on assessing the performance of tasks of Ca Mau Department of Education and Training (in Vietnam) in the school year of 2019–2020 (2020)
Del Bonifro, F., Gabbrielli, M., Lisanti, G., Zingaro, S.P.: Student dropout prediction. In: Bittencourt, I.I., Cukurova, M., Muldner, K., Luckin, R., Millán, E. (eds.) AIED 2020. LNCS (LNAI), vol. 12163, pp. 129–140. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52237-7_11
Goulet, M., Clément, M.-E., Helie, S., Villatte, A.: Longitudinal association between risk profiles, school dropout risk, and substance abuse in adolescence. Child Youth Care Forum 49(5), 687–706 (2020). https://doi.org/10.1007/s10566-020-09550-9
Hassan, M., Mirza, T.: Prediction of school drop outs with the help of machine learning algorithms. GIS Sci. J. 7, 253–263 (2020)
Li, H., Lynch, C., Barnes, T.: Early prediction of course grades: models and feature selection. In: The Proceedings of the 11th International Conference on Educational Data Mining, EDM 2018, pp. 492–495 (2018)
Huynh-Ly, T.N., Thai-Nghe, N.: MyMediaLite: a system for predicting students’s course result using a free recommender system library. In: Information Technology Conference 2013. Can Tho University (2013)
Kabathova, J., Drlik, M.: Towards predicting student’s dropout in university courses using different machine learning techniques. Appl. Sci. 11(7) (2021). https://doi.org/10.3390/app11073130. https://www.mdpi.com/2076-3417/11/7/3130
Kemper, L., Vorhoff, G., Wigger, B.U.: Predicting student dropout: a machine learning approach. Eur. J. High. Educ. 10(1), 28–47 (2020). https://doi.org/10.1080/21568235.2020.1718520
Kiss, B., Nagy, M., Molontay, R., Csabay, B.: Predicting dropout using high school and first-semester academic achievement measures. In: 2019 17th International Conference on Emerging eLearning Technologies and Applications (ICETA), pp. 383–389 (2019). https://doi.org/10.1109/ICETA48886.2019.9040158
Le, T.D., Tran, N.M.T.: Why children in Vietnam drop out of school and what they do after that. Young Lives, Oxford, UK (2013)
Luong, H.H., Thi, T.H.N., Le, A.D., Nguyen, H.T.: Feature selection with random forests predicting metagenome-based disease. In: Solanki, A., Sharma, S.K., Tarar, S., Tomar, P., Sharma, S., Nayyar, A. (eds.) AIS2C2 2021. CCIS, vol. 1434, pp. 254–266. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-82322-1_19
Marbouti, F., Diefes-Dux, H.A., Madhavan, K.: Models for early prediction of at-risk students in a course using standards-based grading. Comput. Educ. 103, 1–15 (2016). https://doi.org/10.1016/j.compedu.2016.09.005. https://www.sciencedirect.com/science/article/pii/S0360131516301634
McNeal, R.B.: High school dropouts: a closer examination of school effects. Soc. Sci. Q. 78(1), 209–222 (1997). http://www.jstor.org/stable/42863687
Mduma, N., Kalegele, K., Machuve, D.: An ensemble predictive model based prototype for student drop-out in secondary schools. J. Inf. Syst. Eng. Manage. 4 (2019). https://doi.org/10.29333/jisem/5893
Márquez, C., Cano, A., Romero, C., Mohammad, A., Fardoun, H., Ventura, S.: Early dropout prediction using data mining: a case study with high school students. Exp. Syst. 33, 107–124 (2016). https://doi.org/10.1111/exsy.12135
Nguyen, P.H., Tian-Wei, S., Masatake, N.: Predicting the student learning outcomes based on the combination of Taylor approximation method and grey models. J. Sci. VNU J. Sci. Educ. Res. 31, 70–83 (2015)
Ogresta, J., Rezo, I., Kožljan, P., Paré, M.H., Ajduković, M.: Why do we drop out? Typology of dropping out of high school. Youth Soc. 53(6), 934–954 (2021). https://doi.org/10.1177/0044118X20918435
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12(85), 2825–2830 (2011). http://jmlr.org/papers/v12/pedregosa11a.html
Rumberger, R.W., Lim, S.A.: Why Students Drop Out of School: A Review of 25 Years of Research. University of California, Santa Barbara (2008)
Stevenson, N., Swain-Bradway, J., LeBeau, B.: Examining high school student engagement and critical factors in dropout prevention. Assess. Effective Interv. 46, 153450841985965 (2019). https://doi.org/10.1177/1534508419859655
Syvertsen, M., Vasantharajan, S., Moth, T., Enger, U., Koht, J.: Predictors of high school dropout, anxiety, and depression in genetic generalized epilepsy. Epilepsia Open 5(4), 611–615 (2020). https://doi.org/10.1002/epi4.12434. https://onlinelibrary.wiley.com/doi/abs/10.1002/epi4.12434
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dinh-Thanh, N., Thanh-Hai, N., Thi-Ngoc-Diem, P. (2021). Forecasting and Analyzing the Risk of Dropping Out of High School Students in Ca Mau Province. In: Dang, T.K., Küng, J., Chung, T.M., Takizawa, M. (eds) Future Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications. FDSE 2021. Communications in Computer and Information Science, vol 1500. Springer, Singapore. https://doi.org/10.1007/978-981-16-8062-5_15
Download citation
DOI: https://doi.org/10.1007/978-981-16-8062-5_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8061-8
Online ISBN: 978-981-16-8062-5
eBook Packages: Computer ScienceComputer Science (R0)