skip to main content
10.1145/3210713.3210727acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesacm-turcConference Proceedingsconference-collections
research-article

Exploring causes for the dropout on massive open online courses

Published:18 May 2018Publication History

ABSTRACT

With the booming of massive open online courses (MOOCs) in recent years, high drop-out rate and low completion rate remain major impediments. Analyzing the behavior of learners and exploring the reasons of dropout of course are of great significance for evaluating the teaching quality and improving the teaching effect. As educational data mining and machine learning rise, it is possible to explore the causes of high dropout rate and low completion rate on MOOCs. Here, we conducted analysis on data from five multi-disciplinary courses on icourse 163 platform and extracted more than 50 features for evaluating causes of withdrawing on MOOCs. A cause exploration method based on gradient boosting decision tree was implemented to extract the essential features that influence the learner's performance with the importance of each feature calculated. To get the best performed classification model, Cross-Validation was used in modeling and Grid Search was employed in parameter optimization. According to the importance of each feature, different subsets of features were selected and tested in turn. The results indicated nine features being the main factors affecting dropout. It could be expected to provide a guideline for a basis for evidence-based improvement of online education.

References

  1. A. Ramesh, D. Goldwasser, B. Huang, H. Daume III, and L. Getoor, "Learning latent engagement patterns of students in online courses," in Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. He, J. Bailey, B. I. Rubinstein, and R. Zhang, "Identifying at-risk students in massive open online courses," in Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Girish K. Balakrishnan, Derrick Coetzee. Predicting student retention in massive open online courses using Hidden Markov Model. Technical Report Identifier: EECS-2013-109Google ScholarGoogle Scholar
  4. Jiajun Liang*, Chao LI, Li Zheng*. Machine Learning Application in MOOCs:Dropout Prediction{C}. The 11th International Conference on Computer Science & Education (ICCSE 2016) August 23--25, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  5. Muñoz-Merino PJ, Ruipérez-Valiente JA, Alario-Hoyos C, et al. Precise Effectiveness Strategy for analyzing the effectiveness of students with educational resources and activities in MOOCs{J}. Computers in Human Behavior, 2015, 47(C):108--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Jiang Z, Zhang Y, Li X. Learning behavior analysis and prediction based on MOOC data{J}. Journal of Computer Research & Development, 2015(3):614--628.Google ScholarGoogle Scholar
  7. Hira S, Deshpande P S. Mining precise cause and effect rules in large time series data of socio-economic indicators{J}. Springerplus, 2016, 5(1):1625.Google ScholarGoogle ScholarCross RefCross Ref
  8. Ben Hoyle, Markus Michael Rau, Feature importance for machine learning redshifts applied to SDSS galaxies.Google ScholarGoogle Scholar
  9. Ma S, Li J, Liu L, et al. Mining combined causes in large data sets{J}. Knowledge-Based Systems, 2016, 92:104--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Muhammad R. Khan, Johua Manoj, Anikate Singh, et al. Behavioral Modeling for Churn Prediction: Early Indicators and Accurate Predictors of Custom Defection and Loyalty{J}. 2015, 92(3):677--680. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Li, W., Gao, M., et al. Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning. In 2016 International Joint Conference on Neural Networks (IJCNN), (pp. 3130--3137). IEEE.Google ScholarGoogle Scholar
  12. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System{J}. 2016:785--794. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Rokach L. Ensemble-based classifiers{J}. Artificial Intelligence Review, 2010, 33(1--2):1--39. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    ACM TURC '18: Proceedings of ACM Turing Celebration Conference - China
    May 2018
    139 pages
    ISBN:9781450364157
    DOI:10.1145/3210713

    Copyright © 2018 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 18 May 2018

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader