Skip to main content

The Variability of the Reasons for Student Dropout in Distance Learning and the Prediction of Dropout-Prone Students

  • Chapter
  • First Online:
Machine Learning Paradigms

Abstract

The adult education that is provided by Universities that use distance learning methods is without doubt inseparable from high dropout rates, frequently higher than those in conventional Universities. Dropping out in a University that provides distance education is caused by professional, academic, health and family and personal reasons. Limiting dropout is crucial and therefore, the aptitude to predict students’ dropping out could be very useful. We try to identify the most appropriate comprehensive learning algorithm using the most informative attributes for the prediction of students’ dropout. Additionally, we have explored the reasons of dropping out in order to examine on a large scale whether they are affected over time and study these changes. The data used was provided by the Student Registry of the Hellenic Open University and additional data was collected by an interview-based survey. It was found that the most informative attributes are the student gender, the participation at the first face to face meeting and the marks on the first two written assignments. A web-based application, which is based on these attributes and can automatically recognize students with high probability of dropping out, was constructed in order to help tutors detect students at risk even at the beginning of the academic year.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Shin, N., Kim, J.: An exploration of learner progress and drop-out in Korea National Open University. Distance Educ. Int. J. 20(1), 81–95 (1999)

    Article  Google Scholar 

  2. Lau, L.K.: Institutional factors affecting student retention. Education 124(1), 126–137 (2003)

    Google Scholar 

  3. Mannan, M.A.: Student attrition and academic and social integration: application of Tinto’s model at the University of Papua New Guinea. High. Educ. 53(2), 147–165 (2007)

    Article  Google Scholar 

  4. Araque, F., Roldán, C., Salguero, A.: Factors influencing university dropout rates. Comput. Educ. 53, 563–574 (2009)

    Article  Google Scholar 

  5. Doherty, W.: An analysis of multiple factors affecting retention in web-based community college courses. Internet High. Educ. 9(4), 245–255 (2006)

    Article  Google Scholar 

  6. Pierrakeas, C., Xenos, M., Panagiotakopoulos, C., Vergidis, D.: A comparative study of dropout rates and causes for two different distance education courses. Int. Rev. Res. Open Distance Learn. 5(2), 1–15 (2004)

    Article  Google Scholar 

  7. Romero, C., Ventura, S.: Data mining in education. Wiley Interdiscip. Rev. Data Min. Knowl. Discovery 3(1), 12–27 (2013)

    Article  Google Scholar 

  8. Dupin-Bryant, P.A.: Pre-entry variables related to retention in online distance education. Am. J. Distance Educ. 18(4), 199–206 (2004)

    Article  Google Scholar 

  9. Xenos, M., Pierrakeas, C., Pintelas, P.: A survey on student dropout rates and dropout causes concerning the students in the course of informatics of the Hellenic Open University. Comput. Educ. 39, 361–377 (2002)

    Article  Google Scholar 

  10. Morris, L.V., Wu, S.S., Finnegan, C.L.: Predicting retention in online general education courses. Am. J. Distance Educ. 19(1), 23–36 (2005)

    Article  Google Scholar 

  11. Parker, A.: Identifying predictors of academic persistence in distance education. J. U. S. Distance Learn. Assoc. 17(1), 55–61 (2003)

    Google Scholar 

  12. Levy, Y.: Comparing dropouts and persistence in e-learning courses. Comput. Educ. 48(2), 185–204 (2007)

    Article  Google Scholar 

  13. Herzog, S.: Estimating student retention and degree-completion time: decision trees and neural networks vs regression. New Dir. Inst. Res. 131, 17–33 (2006)

    Google Scholar 

  14. Atwell, R.H., Ding, W., Ehasz, M., Johnson, S., Wang, M.: Using data mining techniques to predict student development and retention. In: Proceedings of the National Symposium on Student Retention, 9–11 Oct 2006, Albuquerque, New Mexico

    Google Scholar 

  15. Superby, J.F., Vandamme, J.P., Meskens, N.: Determination of factors influencing the achievement of the first-year university students using data mining methods. In: 8th International Conference on Intelligent Tutoring Systems (ITS 2006), Jhongli, Taiwan, pp. 37–44 (2006)

    Google Scholar 

  16. Wegner, L., Flisher, A.J., Chikobvu, P., Lombard, C., King, G.: Leisure boredom and high school dropout in Cape Town, South Africa. J. Adolesc. 31(3), 421–431 (2008)

    Article  Google Scholar 

  17. Moseley, L.G., Mead, D.M.: Predicting who will drop out of nursing courses: a machine learning exercise. Nurse Educ. Today 28, 469–475 (2008)

    Article  Google Scholar 

  18. Lin, S.H.: Data mining for student retention management. J. Comput. Sci. Colleges 27(4), 92–99 (2012)

    Google Scholar 

  19. Lykourentzou, I., Giannoukos, I., Nikopoulos, V., Mpardis, G., Loumos, V.: Dropout prediction in e-learning courses through the combination of machine learning techniques. Comput. Educ. 53, 950–965 (2009)

    Article  Google Scholar 

  20. Delen, D.: A comparative analysis of machine learning techniques for student retention management. Decis. Support Syst. 49, 498–506 (2010)

    Article  Google Scholar 

  21. Lee, Y., Choi, J.: A review of online course dropout research: Implications for practice and future research. Educ. Technol. Res. Dev. 59(5), 593–618 (2011)

    Article  Google Scholar 

  22. Nandeshwar, A., Menzies, T., Nelson, A.: Learning patterns of university student retention. Expert Syst. Appl. 38, 14984–14996 (2011)

    Article  Google Scholar 

  23. Sittichai, R.: Why are there dropouts among university students? Experiences in a Thai University. Int. J. Educ. Dev. 32, 283–289 (2012)

    Article  Google Scholar 

  24. Hu, Ya-Han, Lo, Chia-Lun, Shih, Sheng-Pao: Developing early warning systems to predict students’ online learning performance. Comput. Hum. Behav. 36, 469–478 (2014)

    Article  Google Scholar 

  25. Kassak, O., Kompan, M., Bielikova, M.: Student behavior in a web-based educational system: Exit intent prediction. Eng. Appl. Artif. Intell. 51, 136–149 (2016)

    Article  Google Scholar 

  26. Márquez-Vera, C., Cano, A., Romero, C., Noaman, A.Y.M., Mousa Fardoun, H., Ventura, S.: Early dropout prediction using data mining: a case study with high school students. Expert Syst. 33(1), 107–124 (2016)

    Article  Google Scholar 

  27. Peña-Ayala, A.: Educational data mining: a survey and a data mining-based analysis of recent works. Expert Syst. Appl. 41(4) (Part 1), 1432–1462 (2014)

    Google Scholar 

  28. Williams, G.: Data Mining with Rattle and R: The Art of Excavating Data for Knowledge Discovery. Springer, New York (Use R!) (2011)

    Google Scholar 

  29. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  30. Cohen, W.W.: Fast effective rule induction. In: Twelfth International Conference on Machine Learning (ICML-95), Lake Tahoe, California, pp. 115–123

    Google Scholar 

  31. Domingos, P., Pazzani, M.: On the optimality of the simple Bayesian classifier under zero-one loss. Mach. Learn. 29, 103–130 (1997)

    Article  Google Scholar 

  32. Aha, D.: Lazy Learning. Kluwer Academic Publishers, Dordrecht (1997)

    Book  Google Scholar 

  33. Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann, San Francisco (2011)

    Google Scholar 

  34. Hosmer, D., Lemeshow, S.: Applied Logistic Regression, 2nd edn. Wiley, New York (2005). ISBN: 9780471356325

    Google Scholar 

  35. Burges, C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Discov. 2, 1–47 (1998)

    Article  Google Scholar 

  36. Platt, J.: Using sparseness and analytic QP to speed training of support vector machines. In: Kearns, M.S., Solla, S.A., Cohn, D.A. (eds.) Advances in Neural Information Processing Systems (NIPS). MIT Press, MA (1999)

    Google Scholar 

  37. Anitha, D., Deisy, C.: Proposing a novel approach for classification and sequencing of learning objects in E-learning systems based on learning style. J. Intell. Fuzzy Syst. 29(2), 539–552 (2015)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sotiris Kotsiantis .

Editor information

Editors and Affiliations

Appendix

Appendix

The tool is available in the web page: http://www.math.upatras.gr/~sotos/tool1/.

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Pierrakeas, C., Koutsonikos, G., Lipitakis, AD., Kotsiantis, S., Xenos, M., Gravvanis, G.A. (2020). The Variability of the Reasons for Student Dropout in Distance Learning and the Prediction of Dropout-Prone Students. In: Virvou, M., Alepis, E., Tsihrintzis, G., Jain, L. (eds) Machine Learning Paradigms. Intelligent Systems Reference Library, vol 158. Springer, Cham. https://doi.org/10.1007/978-3-030-13743-4_6

Download citation

Publish with us

Policies and ethics