Abstract
Cloud computing-based online education has played a vital role in enabling uninterrupted learning during crises such as the COVID-19 pandemic. This study explored the key variables associated with cloud computing that can effectively support the operation of online education platforms. By analyzing real data from 63 online learning platforms, the study utilizes the unsupervised random forest (RF) method, ranking the importance of cloud computing-related variables using the Gini index and permutation importance and Boruta. The study's contributions include the application of innovative techniques such as Addcl1 and K-means clustering for data pre-processing, unsupervised RF dissimilarity calculated by partitioning around medoids (PAM) clustering and multidimensional scaling (MDS) plots, and the comparison of RF accuracy (98.4%) with other machine learning techniques using WEKA software. The results highlight variables such as public cloud, commercial sourcing, course number limitations, and synchronized tools as important factors in the successful implementation of cloud computing-based online learning platforms. The theoretical and practical implications of the study results can be used by researchers and practitioners to structure cloud computing-based online learning platforms that could successfully function for distance learning during crises like the COVID-19 pandemic.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Afanador, N. L., Smolinska, A., Tran, T. N., & Blanchet, L. (2016). Unsupervised random forest: A tutorial with case studies. Journal of Chemometrics, 30(5), 232–241. https://doi.org/10.1002/cem.2790
Aissaoui, K., Amane, M., Berrada, M., & Madani, M. A. (2022). A new framework to secure cloud based e-learning systems. In S. Bennani, Y. Lakhrissi, G. Khaissidi, A. Mansouri, & Y. Khamlichi (Eds.), Lecture notes in electrical engineering (pp. 65–75). Springer Singapore.
Al-Samarraie, H., & Saeed, N. (2018). A systematic review of cloud computing tools for collaborative learning: Opportunities and challenges to the blended-learning environment. Computers and Education, 124, 77–91. https://doi.org/10.1016/j.compedu.2018.05.016
Anshari, M., Alas, Y., & Guan, L. S. (2016). Developing online learning resources: Big data, social networks, and cloud computing to support pervasive knowledge. Education and Information Technologies, 21(6), 1663–1677. https://doi.org/10.1007/s10639-015-9407-3
Archer, E. (2020). rfPermute: Estimate permutation p-values for random forest importance metrics. rfPermute. https://github.com/EricArcher/rfPermute. Accessed 15 Apr 2023.
Arpaci, I. (2017). Antecedents and consequences of cloud computing adoption in education to achieve knowledge management. Computers in Human Behavior, 70, 382–390. https://doi.org/10.1016/j.chb.2017.01.024
Arpaci, I. (2019). A hybrid modeling approach for predicting the educational use of mobile cloud computing services in higher education. Computers in Human Behavior, 90, 181–187. https://doi.org/10.1016/j.chb.2018.09.005
Belgiu, M., & Drăgu, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Biau, G., & Scornet, E. (2016). A random forest guided tour. TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1201/9780367816377-11
Breiman, L., & Cutler, A. (2004). Random forests. https://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm. Accessed 27 Mar 2023.
Chen, X., & Ishwaran, H. (2012). Random forests for genomic data analysis. Genomics, 99(6), 323–329. https://doi.org/10.1016/j.ygeno.2012.04.003
Chung, S., Jagatha, V., & Johnson, D. (2023). Cloud-based development environment: A case study of teaching a cross-platform mobile app course. In 24th Annual Conference on Information Technology Education (SIGITE ’23) (pp. 148–149). ACM. https://doi.org/10.1145/3585059.3611429
Cox, T. F., & Cox, M. A. A. (2000). Multidimensional scaling. Chapman and Hall/CRC.
Dai, Z., Zhang, Q., Zhao, L., Zhu, X., & Zhou, D. (2023). Cloud-edge computing technology-based internet of things system for smart classroom environment. International Journal of Emerging Technologies in Learning, 18(8), 79–96. https://doi.org/10.3991/ijet.v18i08.28299
Denton, D. W. (2012). Enhancing instruction through constructivism, cooperative learning, and cloud computing. TechTrends, 56(4), 34–41. https://doi.org/10.1007/s11528-012-0585-1
Despotović-Zrakić, M., Simić, K., Labus, A., Milić, A., & Jovanić, B. (2013). Scaffolding environment for e-learning through cloud computing. Educational Technology and Society, 16(3), 301–314.
Doelitzscher, F., Sulistio, A., Reich, C., Kuijs, H., & Wolf, D. (2011). Private cloud for collaboration and e-Learning services: From IaaS to SaaS. Computing, 91(1), 23–42. https://doi.org/10.1007/s00607-010-0106-z
Dong, B., Zheng, Q., Qiao, M., Shu, J., & Yang, J. (2009). BlueSky cloud framework: An e-learning framework. In International Conference on Cloud Computing (pp. 577–582). Springer Berlin Heidelberg.
Dong, B., Zheng, Q., Yang, J., Li, H., & Qiao, M. (2009). An e-learning ecosystem based on cloud computing infrastructure. In IEEE International Conference on Advanced Learning Technologies (pp. 125–127). IEEE.
Donthu, N., & Gustafsson, A. (2020). Effect of COVID-10 on business and research. Journal of Business Research, 117, 284–289. https://doi.org/10.1016/j.jbusres.2020.06.008
Elghazel, H., & Aussem, A. (2010). Feature selection for unsupervised learning using random cluster ensembles. In Proceedings - IEEE International Conference on Data Mining, ICDM (pp. 168–175). IEEE. https://doi.org/10.1109/ICDM.2010.137
Englund, C., & Verikas, A. (2012). A novel approach to estimate proximity in a random forest: An exploratory study. Expert Systems with Applications, 39(17), 13046–13050. https://doi.org/10.1016/j.eswa.2012.05.094
Ercan, T. (2010). Effective use of cloud computing in educational institutions. Procedia - Social and Behavioral Sciences, 2(2), 938–942. https://doi.org/10.1016/j.sbspro.2010.03.130
Garg, R. (2020). MCDM-based parametric selection of cloud deployment models for an academic organization. IEEE Transactions on Cloud Computing, 1–1. IEEE. https://doi.org/10.1109/TCC.2020.2980534
González-Martínez, J. A., Bote-Lorenzo, M. L., Gómez-Sánchez, E., & Cano-Parra, R. (2015). Cloud computing and education: A state-of-the-art survey. Computers and Education, 80, 132–151. https://doi.org/10.1016/j.compedu.2014.08.017
Hiran, K. K., & Dadhich, M. (2024). Predicting the core determinants of cloud-edge computing adoption (CECA) for sustainable development in the higher education institutions of Africa: A high order SEM-ANN analytical approach. Technological Forecasting and Social Change, 199, 1–17. https://doi.org/10.1016/j.techfore.2023.122979
Horvath, S., & Shi, T. (2006). R software tutorial: Random forest clustering applied to renal cell carcinoma. Learning, 2, 18–22.
Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the boruta package. Journal of Statistical Software, 36(11), 1–13. https://doi.org/10.18637/jss.v036.i11
Lee, T. H., Ullah, A., & Wang, R. (2020). Bootstrap aggregating and random forest. In Macroeconomic Forecasting in the Era of Big Data (pp. 389–429). Springer Cham. https://doi.org/10.1007/978-3-030-31150-6_13
Liaw, A., & Wiener, M. (2003). Classification and regression by randomForest. R News, 2(3), 18–22.
Llano, A. S. J., Nguyen, T.-M.-T., Rodriguez, D., & Chardy, M. (2023). POEMA: A personal cloud for inclusive education. In Central European Conference on Information and Intelligent Systems (pp. 509–515). HAL.
Marian, M., Borcosi, I., Borcosi, C. A., Cusman, A., Toma, A., & Ionica, D. (2023). A case study on extending the use of cloud-based services within two romanian higher-education institutions. In 2023 22nd RoEduNet Conference: Networking in Education and Research (RoEduNet) (pp. 1–6). IEEE. https://doi.org/10.1109/RoEduNet60162.2023.10274937
Masud, A. H., & Huang, X. (2012). An e-learning system architecture based on cloud computing. World Academy of Science, Engineering and Technology, 6, 736–740.
Mell, P., & Grance, T. (2011). The NIST definition of cloud computing. National Institute of Standards and Technology, Special Publication, 800 (2011), 145
Nowling Lab. (2015). Categorical variable encoding and feature importance bias with random forests. Nowling Lab. http://rnowling.github.io/machine/learning/2015/08/10/random-forest-bias.html. Accessed 23 Mar 2023
Oshiro, T.M., Perez, P.S., Baranauskas, J.A. (2012). How many trees in a random forest? In: Perner, P. (eds) Machine learning and data mining in pattern recognition. MLDM 2012. Lecture notes in computer science (pp. 154–168). Springer. https://doi.org/10.1007/978-3-642-31537-4_13
Pardeshi, V. H. (2014). Cloud computing for higher education institutes: architecture, strategy and recommendations for effective adaptation. In Procedia Economics and Finance (pp. 589–599). Elsevier. https://doi.org/10.1016/s2212-5671(14)00224-x
Peerbhay, K. Y., Mutanga, O., & Ismail, R. (2015). Random forests unsupervised classification: The detection and mapping of solanum mauritianum infestations in plantation forestry using hyperspectral data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 8(6), 3107–3122. https://doi.org/10.1109/JSTARS.2015.2396577
Pocatilu, P., Alecu, F., & Vetrici, M. (2010). Measuring the efficiency of cloud computing for e-learning systems. WSEAS Transactions on Computers, 9(1), 42–51.
Qasem, Y. A. M., Abdullah, R., Jusoh, Y. Y., Atan, R., & Asadi, S. (2021). Analyzing continuance of cloud computing in higher education institutions: Should we stay, or should we go? Sustainability, 13(9), 1–37. https://doi.org/10.3390/su13094664
Rodriguez-Galiano, V. F., Ghimire, B., Rogan, J., Chica-Olmo, M., & Rigol-Sanchez, J. P. (2012). An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS Journal of Photogrammetry and Remote Sensing, 67(1), 93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002
Schweinberger, M. (2020). Tree-based models in R. Brisbane: The University of Queensland. https://slcladal.github.io/advancedstatztrees.html. Accessed 27 Mar 2023.
Shaikhina, T., Lowe, D., Daga, S., Briggs, D., Higgins, R., & Khovanova, N. (2019). Decision tree and random forest models for outcome prediction in antibody incompatible kidney transplantation. Biomedical Signal Processing and Control, 52, 456–462. https://doi.org/10.1016/j.bspc.2017.01.012
Shi, T., & Horvath, S. (2006). Unsupervised learning with random forest predictors. Journal of Computational and Graphical Statistics, 15(1), 118–138. https://doi.org/10.1198/106186006X94072
Strobl, C., Boulesteix, A. L., Zeileis, A., & Hothorn, T. (2007). Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinformatics, 8(25), 1–21. https://doi.org/10.1186/1471-2105-8-25
Sultan, N. (2010). Cloud computing for education: A new dawn? International Journal of Information Management, 30(2), 109–116. https://doi.org/10.1016/j.ijinfomgt.2009.09.004
Thavi, R. R., Narwane, V. S., Jhaveri, R. H., & Raut, R. D. (2021). To determine the critical factors for the adoption of cloud computing in the educational sector in developing countries – a fuzzy DEMATEL approach. Kybernetes, 51(11), 3340–3365. https://doi.org/10.1108/K-12-2020-0864
Vouk, M., Averitt, S., Bugaev, M., Kurth, A., Peeler, A., Shaffer, H., et al. (2008). “Powered by VCL” - using virtual computing laboratory (VCL) technology to power cloud computing. In International Conference on Virtual Computing Iniviative (pp. 1–10). IEEE.
Wang, H., Wang, C., Lv, B., & Pan, X. (2015). Improved variable importance measure of random forest via combining of proximity measure and support vector machine for stable feature selection. Journal of Information and Computational Science, 12(8), 3241–3252. https://doi.org/10.12733/jics20105854
Zheng, X., Zhu, S., & Lin, Z. (2013). Capturing the essence of word-of-mouth for social commerce: Assessing the quality of online e-commerce reviews by a semi-supervised approach. Decision Support Systems, 56(1), 211–222.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
No potential conflict of interest was reported by the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Supplementary material
Supplementary data associated with this article can be found online at https://github.com/HuiHAN0601/RF_Data
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Han, H., Trimi, S. Analysis of cloud computing-based education platforms using unsupervised random forest. Educ Inf Technol 29, 15905–15932 (2024). https://doi.org/10.1007/s10639-024-12457-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10639-024-12457-w