Analysis of cloud computing-based education platforms using unsupervised random forest

Han, Hui; Trimi, Silvana

doi:10.1007/s10639-024-12457-w

Analysis of cloud computing-based education platforms using unsupervised random forest

Published: 05 February 2024

Volume 29, pages 15905–15932, (2024)
Cite this article

Education and Information Technologies Aims and scope Submit manuscript

386 Accesses
1 Altmetric
Explore all metrics

Abstract

Cloud computing-based online education has played a vital role in enabling uninterrupted learning during crises such as the COVID-19 pandemic. This study explored the key variables associated with cloud computing that can effectively support the operation of online education platforms. By analyzing real data from 63 online learning platforms, the study utilizes the unsupervised random forest (RF) method, ranking the importance of cloud computing-related variables using the Gini index and permutation importance and Boruta. The study's contributions include the application of innovative techniques such as Addcl1 and K-means clustering for data pre-processing, unsupervised RF dissimilarity calculated by partitioning around medoids (PAM) clustering and multidimensional scaling (MDS) plots, and the comparison of RF accuracy (98.4%) with other machine learning techniques using WEKA software. The results highlight variables such as public cloud, commercial sourcing, course number limitations, and synchronized tools as important factors in the successful implementation of cloud computing-based online learning platforms. The theoretical and practical implications of the study results can be used by researchers and practitioners to structure cloud computing-based online learning platforms that could successfully function for distance learning during crises like the COVID-19 pandemic.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Impact Prediction of Online Education During COVID-19 Using Machine Learning: A Case Study

Predicting Student Retention in Smart Learning Environments Using Machine Learning

Use of Data Mining to Identify the Technological Resources that Contribute to School Performance in Large-Scale Evaluations of Brazilian High School

Discover the latest articles, news and stories from top researchers in related subjects.

Digital Education and Educational Technology

References

Afanador, N. L., Smolinska, A., Tran, T. N., & Blanchet, L. (2016). Unsupervised random forest: A tutorial with case studies. Journal of Chemometrics, 30(5), 232–241. https://doi.org/10.1002/cem.2790
Article Google Scholar
Aissaoui, K., Amane, M., Berrada, M., & Madani, M. A. (2022). A new framework to secure cloud based e-learning systems. In S. Bennani, Y. Lakhrissi, G. Khaissidi, A. Mansouri, & Y. Khamlichi (Eds.), Lecture notes in electrical engineering (pp. 65–75). Springer Singapore.
Google Scholar
Al-Samarraie, H., & Saeed, N. (2018). A systematic review of cloud computing tools for collaborative learning: Opportunities and challenges to the blended-learning environment. Computers and Education, 124, 77–91. https://doi.org/10.1016/j.compedu.2018.05.016
Article Google Scholar
Anshari, M., Alas, Y., & Guan, L. S. (2016). Developing online learning resources: Big data, social networks, and cloud computing to support pervasive knowledge. Education and Information Technologies, 21(6), 1663–1677. https://doi.org/10.1007/s10639-015-9407-3
Article Google Scholar
Archer, E. (2020). rfPermute: Estimate permutation p-values for random forest importance metrics. rfPermute. https://github.com/EricArcher/rfPermute. Accessed 15 Apr 2023.
Arpaci, I. (2017). Antecedents and consequences of cloud computing adoption in education to achieve knowledge management. Computers in Human Behavior, 70, 382–390. https://doi.org/10.1016/j.chb.2017.01.024
Article Google Scholar
Arpaci, I. (2019). A hybrid modeling approach for predicting the educational use of mobile cloud computing services in higher education. Computers in Human Behavior, 90, 181–187. https://doi.org/10.1016/j.chb.2018.09.005
Article Google Scholar
Belgiu, M., & Drăgu, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Article Google Scholar
Biau, G., & Scornet, E. (2016). A random forest guided tour. TEST, 25(2), 197–227. https://doi.org/10.1007/s11749-016-0481-7
Article MathSciNet Google Scholar
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1201/9780367816377-11
Article Google Scholar
Breiman, L., & Cutler, A. (2004). Random forests. https://www.stat.berkeley.edu/~breiman/RandomForests/cc_home.htm. Accessed 27 Mar 2023.
Chen, X., & Ishwaran, H. (2012). Random forests for genomic data analysis. Genomics, 99(6), 323–329. https://doi.org/10.1016/j.ygeno.2012.04.003
Article Google Scholar
Chung, S., Jagatha, V., & Johnson, D. (2023). Cloud-based development environment: A case study of teaching a cross-platform mobile app course. In 24th Annual Conference on Information Technology Education (SIGITE ’23) (pp. 148–149). ACM. https://doi.org/10.1145/3585059.3611429
Cox, T. F., & Cox, M. A. A. (2000). Multidimensional scaling. Chapman and Hall/CRC.
Book Google Scholar
Dai, Z., Zhang, Q., Zhao, L., Zhu, X., & Zhou, D. (2023). Cloud-edge computing technology-based internet of things system for smart classroom environment. International Journal of Emerging Technologies in Learning, 18(8), 79–96. https://doi.org/10.3991/ijet.v18i08.28299
Article Google Scholar
Denton, D. W. (2012). Enhancing instruction through constructivism, cooperative learning, and cloud computing. TechTrends, 56(4), 34–41. https://doi.org/10.1007/s11528-012-0585-1
Article MathSciNet Google Scholar
Despotović-Zrakić, M., Simić, K., Labus, A., Milić, A., & Jovanić, B. (2013). Scaffolding environment for e-learning through cloud computing. Educational Technology and Society, 16(3), 301–314.
Google Scholar
Doelitzscher, F., Sulistio, A., Reich, C., Kuijs, H., & Wolf, D. (2011). Private cloud for collaboration and e-Learning services: From IaaS to SaaS. Computing, 91(1), 23–42. https://doi.org/10.1007/s00607-010-0106-z
Article Google Scholar
Dong, B., Zheng, Q., Qiao, M., Shu, J., & Yang, J. (2009). BlueSky cloud framework: An e-learning framework. In International Conference on Cloud Computing (pp. 577–582). Springer Berlin Heidelberg.
Dong, B., Zheng, Q., Yang, J., Li, H., & Qiao, M. (2009). An e-learning ecosystem based on cloud computing infrastructure. In IEEE International Conference on Advanced Learning Technologies (pp. 125–127). IEEE.
Donthu, N., & Gustafsson, A. (2020). Effect of COVID-10 on business and research. Journal of Business Research, 117, 284–289. https://doi.org/10.1016/j.jbusres.2020.06.008
Article Google Scholar
Elghazel, H., & Aussem, A. (2010). Feature selection for unsupervised learning using random cluster ensembles. In Proceedings - IEEE International Conference on Data Mining, ICDM (pp. 168–175). IEEE. https://doi.org/10.1109/ICDM.2010.137
Englund, C., & Verikas, A. (2012). A novel approach to estimate proximity in a random forest: An exploratory study. Expert Systems with Applications, 39(17), 13046–13050. https://doi.org/10.1016/j.eswa.2012.05.094
Article Google Scholar
Ercan, T. (2010). Effective use of cloud computing in educational institutions. Procedia - Social and Behavioral Sciences, 2(2), 938–942. https://doi.org/10.1016/j.sbspro.2010.03.130
Article Google Scholar
Garg, R. (2020). MCDM-based parametric selection of cloud deployment models for an academic organization. IEEE Transactions on Cloud Computing, 1–1. IEEE. https://doi.org/10.1109/TCC.2020.2980534
González-Martínez, J. A., Bote-Lorenzo, M. L., Gómez-Sánchez, E., & Cano-Parra, R. (2015). Cloud computing and education: A state-of-the-art survey. Computers and Education, 80, 132–151. https://doi.org/10.1016/j.compedu.2014.08.017
Article Google Scholar
Hiran, K. K., & Dadhich, M. (2024). Predicting the core determinants of cloud-edge computing adoption (CECA) for sustainable development in the higher education institutions of Africa: A high order SEM-ANN analytical approach. Technological Forecasting and Social Change, 199, 1–17. https://doi.org/10.1016/j.techfore.2023.122979
Article Google Scholar
Horvath, S., & Shi, T. (2006). R software tutorial: Random forest clustering applied to renal cell carcinoma. Learning, 2, 18–22.
Google Scholar
Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the boruta package. Journal of Statistical Software, 36(11), 1–13. https://doi.org/10.18637/jss.v036.i11
Article Google Scholar
Lee, T. H., Ullah, A., & Wang, R. (2020). Bootstrap aggregating and random forest. In Macroeconomic Forecasting in the Era of Big Data (pp. 389–429). Springer Cham. https://doi.org/10.1007/978-3-030-31150-6_13
Liaw, A., & Wiener, M. (2003). Classification and regression by randomForest. R News, 2(3), 18–22.
Google Scholar
Llano, A. S. J., Nguyen, T.-M.-T., Rodriguez, D., & Chardy, M. (2023). POEMA: A personal cloud for inclusive education. In Central European Conference on Information and Intelligent Systems (pp. 509–515). HAL.
Marian, M., Borcosi, I., Borcosi, C. A., Cusman, A., Toma, A., & Ionica, D. (2023). A case study on extending the use of cloud-based services within two romanian higher-education institutions. In 2023 22nd RoEduNet Conference: Networking in Education and Research (RoEduNet) (pp. 1–6). IEEE. https://doi.org/10.1109/RoEduNet60162.2023.10274937
Masud, A. H., & Huang, X. (2012). An e-learning system architecture based on cloud computing. World Academy of Science, Engineering and Technology, 6, 736–740.
Google Scholar
Mell, P., & Grance, T. (2011). The NIST definition of cloud computing. National Institute of Standards and Technology, Special Publication, 800 (2011), 145
Nowling Lab. (2015). Categorical variable encoding and feature importance bias with random forests. Nowling Lab. http://rnowling.github.io/machine/learning/2015/08/10/random-forest-bias.html. Accessed 23 Mar 2023
Oshiro, T.M., Perez, P.S., Baranauskas, J.A. (2012). How many trees in a random forest? In: Perner, P. (eds) Machine learning and data mining in pattern recognition. MLDM 2012. Lecture notes in computer science (pp. 154–168). Springer. https://doi.org/10.1007/978-3-642-31537-4_13
Pardeshi, V. H. (2014). Cloud computing for higher education institutes: architecture, strategy and recommendations for effective adaptation. In Procedia Economics and Finance (pp. 589–599). Elsevier. https://doi.org/10.1016/s2212-5671(14)00224-x
Peerbhay, K. Y., Mutanga, O., & Ismail, R. (2015). Random forests unsupervised classification: The detection and mapping of solanum mauritianum infestations in plantation forestry using hyperspectral data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 8(6), 3107–3122. https://doi.org/10.1109/JSTARS.2015.2396577
Article Google Scholar
Pocatilu, P., Alecu, F., & Vetrici, M. (2010). Measuring the efficiency of cloud computing for e-learning systems. WSEAS Transactions on Computers, 9(1), 42–51.
Google Scholar
Qasem, Y. A. M., Abdullah, R., Jusoh, Y. Y., Atan, R., & Asadi, S. (2021). Analyzing continuance of cloud computing in higher education institutions: Should we stay, or should we go? Sustainability, 13(9), 1–37. https://doi.org/10.3390/su13094664
Article Google Scholar
Rodriguez-Galiano, V. F., Ghimire, B., Rogan, J., Chica-Olmo, M., & Rigol-Sanchez, J. P. (2012). An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS Journal of Photogrammetry and Remote Sensing, 67(1), 93–104. https://doi.org/10.1016/j.isprsjprs.2011.11.002
Article Google Scholar
Schweinberger, M. (2020). Tree-based models in R. Brisbane: The University of Queensland. https://slcladal.github.io/advancedstatztrees.html. Accessed 27 Mar 2023.
Shaikhina, T., Lowe, D., Daga, S., Briggs, D., Higgins, R., & Khovanova, N. (2019). Decision tree and random forest models for outcome prediction in antibody incompatible kidney transplantation. Biomedical Signal Processing and Control, 52, 456–462. https://doi.org/10.1016/j.bspc.2017.01.012
Article Google Scholar
Shi, T., & Horvath, S. (2006). Unsupervised learning with random forest predictors. Journal of Computational and Graphical Statistics, 15(1), 118–138. https://doi.org/10.1198/106186006X94072
Article MathSciNet Google Scholar
Strobl, C., Boulesteix, A. L., Zeileis, A., & Hothorn, T. (2007). Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinformatics, 8(25), 1–21. https://doi.org/10.1186/1471-2105-8-25
Article Google Scholar
Sultan, N. (2010). Cloud computing for education: A new dawn? International Journal of Information Management, 30(2), 109–116. https://doi.org/10.1016/j.ijinfomgt.2009.09.004
Article Google Scholar
Thavi, R. R., Narwane, V. S., Jhaveri, R. H., & Raut, R. D. (2021). To determine the critical factors for the adoption of cloud computing in the educational sector in developing countries – a fuzzy DEMATEL approach. Kybernetes, 51(11), 3340–3365. https://doi.org/10.1108/K-12-2020-0864
Vouk, M., Averitt, S., Bugaev, M., Kurth, A., Peeler, A., Shaffer, H., et al. (2008). “Powered by VCL” - using virtual computing laboratory (VCL) technology to power cloud computing. In International Conference on Virtual Computing Iniviative (pp. 1–10). IEEE.
Wang, H., Wang, C., Lv, B., & Pan, X. (2015). Improved variable importance measure of random forest via combining of proximity measure and support vector machine for stable feature selection. Journal of Information and Computational Science, 12(8), 3241–3252. https://doi.org/10.12733/jics20105854
Article Google Scholar
Zheng, X., Zhu, S., & Lin, Z. (2013). Capturing the essence of word-of-mouth for social commerce: Assessing the quality of online e-commerce reviews by a semi-supervised approach. Decision Support Systems, 56(1), 211–222.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Norwegian University of Science and Technology (NTNU), Trondheim, Norway
Hui Han
Department of Supply Chain Management and Analytics, College of Business, University of Nebraska, Lincoln, NE, USA
Silvana Trimi

Authors

Hui Han
View author publications
You can also search for this author in PubMed Google Scholar
Silvana Trimi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hui Han.

Ethics declarations

Conflict of interest

No potential conflict of interest was reported by the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Supplementary material

Supplementary data associated with this article can be found online at https://github.com/HuiHAN0601/RF_Data

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Han, H., Trimi, S. Analysis of cloud computing-based education platforms using unsupervised random forest. Educ Inf Technol 29, 15905–15932 (2024). https://doi.org/10.1007/s10639-024-12457-w

Download citation

Received: 10 August 2023
Accepted: 09 January 2024
Published: 05 February 2024
Issue Date: August 2024
DOI: https://doi.org/10.1007/s10639-024-12457-w

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis of cloud computing-based education platforms using unsupervised random forest

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Impact Prediction of Online Education During COVID-19 Using Machine Learning: A Case Study

Predicting Student Retention in Smart Learning Environments Using Machine Learning

Use of Data Mining to Identify the Technological Resources that Contribute to School Performance in Large-Scale Evaluations of Brazilian High School

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Analysis of cloud computing-based education platforms using unsupervised random forest

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Impact Prediction of Online Education During COVID-19 Using Machine Learning: A Case Study

Predicting Student Retention in Smart Learning Environments Using Machine Learning

Use of Data Mining to Identify the Technological Resources that Contribute to School Performance in Large-Scale Evaluations of Brazilian High School

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation