skip to main content
10.1145/3576050.3576146acmotherconferencesArticle/Chapter ViewAbstractPublication PageslakConference Proceedingsconference-collections
short-paper

Cluster-Based Performance of Student Dropout Prediction as a Solution for Large Scale Models in a Moodle LMS

Published:13 March 2023Publication History

ABSTRACT

Learning management systems provide a wide breadth of data waiting to be analyzed and utilized to enhance student and faculty experience in higher education. As universities struggle to support students’ engagement, success and retention, learning analytics is being used to build predictive models and develop dashboards to support learners and help them stay engaged, to help teachers identify students needing support, and to predict and prevent dropout. Learning with Big Data has its challenges, however: managing great quantities of data requires time and expertise. To predict students at risk, many institutions use machine learning algorithms with LMS data for a given course or type of course, but only a few are trying to make predictions for a large subset of courses. This begs the question: “How can student dropout be predicted on a very large set of courses in an institution Moodle LMS?” In this paper, we use automation to improve student dropout prediction for a very large subset of courses, by clustering them based on course design and similarity, then by automatically training, testing, and selecting machine learning algorithms for each cluster. We developed a promising methodology that outlines a basic framework that can be adjusted and optimized in many ways and that further studies can easily build on and improve.

References

  1. Kimberly E. Arnold and Matthew D. Pistilli. 2012. Course signals at Purdue: Using learning analytics to increase student success. In Proceedings of the 2nd international conference on learning analytics and knowledge, 267–270.Google ScholarGoogle Scholar
  2. Matthew Berland, Ryan S. Baker, and Paulo Blikstein. 2014. Educational data mining and learning analytics: Applications to constructionist research. Technology, Knowledge and Learning 19, 1 (2014), 205–220.Google ScholarGoogle ScholarCross RefCross Ref
  3. Tadeusz Caliński and Jerzy Harabasz. 1974. A dendrite method for cluster analysis. Communications in Statistics-theory and Methods 3, 1 (1974), 1–27.Google ScholarGoogle ScholarCross RefCross Ref
  4. Michel Desmarais and François Lemieux. 2013. Clustering and visualizing study state sequences. In Educational Data Mining 2013.Google ScholarGoogle Scholar
  5. Xu Du, Juan Yang, Brett E. Shelton, Jui-Long Hung, and Mingyan Zhang. 2021. A systematic meta-review and analysis of learning analytics research. Behaviour & information technology 40, 1 (2021), 49–62.Google ScholarGoogle Scholar
  6. Jennifer A Fredricks, Phyllis C Blumenfeld, and Alison H Paris. 2004. School Engagement: Potential of the Concept, State of the Evidence. Review of Educational Research 74, 1 (March 2004), 59–109. DOI:https://doi.org/10.3102/00346543074001059Google ScholarGoogle ScholarCross RefCross Ref
  7. Yamini Goel and Rinkaj Goyal. 2020. On the effectiveness of self-training in mooc dropout prediction. Open Computer Science 10, 1 (2020), 246–258.Google ScholarGoogle ScholarCross RefCross Ref
  8. Dirk Ifenthaler and Jane Yin-Kim Yau. 2020. Utilising learning analytics to support study success in higher education: a systematic review. Educational Technology Research and Development 68, 4 (2020), 1961–1990.Google ScholarGoogle ScholarCross RefCross Ref
  9. Sandeep M. Jayaprakash, Erik W. Moody, Eitel JM Lauría, James R. Regan, and Joshua D. Baron. 2014. Early alert of academically at-risk students: An open source analytics initiative. Journal of Learning Analytics 1, 1 (2014), 6–47.Google ScholarGoogle ScholarCross RefCross Ref
  10. Dr Nicole Johnson. 2021. 2021 National Report: Lessons from the COVID-19 pandemic. (2021), 21.Google ScholarGoogle Scholar
  11. Gaëlle Molinari, Bruno Poellhuber, Jean Heutte, Elise Lavoué, Denise Sutter Widmer, and Pierre-André Caron. 2016. L'engagement et la persistance dans les dispositifs de formation en ligne: regards croisés. Distances et médiations des savoirs. Distance and Mediation of Knowledge 13 (2016).Google ScholarGoogle Scholar
  12. Rudolph S. Parrish and Horace J. Spencer III. 2004. Effect of normalization on significance testing for oligonucleotide microarrays. Journal of biopharmaceutical statistics 14, 3 (2004), 575–589.Google ScholarGoogle ScholarCross RefCross Ref
  13. Bruno Poellhuber, Normand Roy, and Ibtihel Bouchoucha. 2019. Understanding participant's behaviour in massively open online courses. International Review of Research in Open and Distributed Learning 20, 1 (2019).Google ScholarGoogle ScholarCross RefCross Ref
  14. Anita Rácz, Dávid Bajusz, and Károly Héberger. 2021. Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification. Molecules 26, 4 (2021), 1111.Google ScholarGoogle ScholarCross RefCross Ref
  15. Silhouettes Rousseeuw. A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20 , 53.Google ScholarGoogle Scholar
  16. Usha Ruby and Vamsidhar Yendapalli. 2020. Binary cross entropy with deep learning technique for image classification. Int. J. Adv. Trends Comput. Sci. Eng 9, 10 (2020).Google ScholarGoogle Scholar
  17. Niall Sclater, Alice Peasgood, and Joel Mullan. 2016. Learning analytics in higher education. London: Jisc. Accessed February 8, 2017 (2016), 176.Google ScholarGoogle Scholar
  18. Ketan Rajshekhar Shahapure and Charles Nicholas. 2020. Cluster quality analysis using silhouette score. In 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), IEEE, 747–748.Google ScholarGoogle ScholarCross RefCross Ref
  19. George Siemens, Dragan Gasevic, Caroline Haythornthwaite, Shane Dawson, S. Buckingham Shum, Rebecca Ferguson, Erik Duval, Katrien Verbert, and RSJD Baker. 2011. Open Learning Analytics: an integrated & modularized platform. In Proceedings of the 2Nd International Conference on Learning Analytics and Knowledge, Open University Press Maidenhead, Vancouver, 252–254.Google ScholarGoogle Scholar
  20. Jimin Tan, Jianan Yang, Sai Wu, Gang Chen, and Jake Zhao. 2021. A critical look at the current train/test split in machine learning. arXiv preprint arXiv:2106.04525 (2021).Google ScholarGoogle Scholar
  21. Wei Wang, Han Yu, and Chunyan Miao. 2017. Deep model for dropout prediction in MOOCs. In Proceedings of the 2nd international conference on crowd science and engineering, 26–32.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Cluster-Based Performance of Student Dropout Prediction as a Solution for Large Scale Models in a Moodle LMS

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          LAK2023: LAK23: 13th International Learning Analytics and Knowledge Conference
          March 2023
          692 pages
          ISBN:9781450398657
          DOI:10.1145/3576050

          Copyright © 2023 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 13 March 2023

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper
          • Research
          • Refereed limited

          Acceptance Rates

          Overall Acceptance Rate236of782submissions,30%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format