Abstract
The enrollment rate of freshmen has always been a headache for colleges and universities. It is also very difficult to accurately predict the number of freshmen before they register. In recent years, deep learning and machine learning technology have made a breakthrough and are widely used in data processing, edge computing, and situation awareness. So far, no researcher has used machine learning methods to forecast the enrollment of new students, because the intuitive feeling is that the registration of a freshman is very subjective, dependent on several factors. To date, the number of freshmen in universities has always been forecasted using traditional methods, that is, phone calls and fee payment status inquiries. On the basis of the historical admission enrollment data of a university, this study used a variety of machine learning methods, including decision tree, random forest, and back propagation neural network, for forecasting. Based on the results of our research, it is possible to predict whether a freshman will register in a university, which makes this work very worthy of further study.
Similar content being viewed by others
References
Saettler A, Laber E, Pereira FDM (2017) Decision tree classification with bounded number of errors. Inf Process Lett 127(1):27–31. https://doi.org/10.1016/j.ipl.2017.06.011
Biau G, Scornet E (2016) A random forest guided tour. Test 25(2):197–227. https://doi.org/10.1007/s11749-016-0481-7
Zhang L, Luo JH, Yang SY (2009) Forecasting box office revenue of movies with BP neural network. Expert Syst Appl 36(3):6580–6587
Sejung P, Han Woo P (2020) A webometric network analysis of electronic word of mouth (eWOM) characteristics and machine learning approach to consumer comments during a crisis. El Profesional de la Informacion 29(5):1–14. https://doi.org/10.3145/epi.2020.sep.16
Pande M, Mulay P (2020) Bibliometric survey of quantum machine learning. Sci Technol Libr 39(4):369–382. https://doi.org/10.1080/0194262x.2020.1776193
Luo F, Guo W, Yu Y, Chen G (2017) A multi-label classification algorithm based on kernel extreme learning machine. Neurocomputing 260:313–320
Huang X, Guo W, Liu G, Chen G (2017) MLXR: multi-layer obstacle-avoiding X-architecture Steiner tree construction for VLSI routing. Sci China Inf Sci 60(1):19102. https://doi.org/10.1007/s11432-015-0850-4
Guo W, Liu G, Chen G, Peng S (2014) A hybrid multi-objective PSO algorithm with local search strategy for VLSI partitioning. Front Comput Sci 8(2):203–216
Xiong L, Zhang D, Li K, Zhang L (2020) The extraction algorithm of color disease spot image based on Otsu and watershed. Soft Comput 24(10):7253–7263. https://doi.org/10.1007/s00500-019-04339-y
Chen CM, Wang KH, Yeh KH, Xiang B, Wu TY (2019) Attacks and solutions on a three-party password-based authenticated key exchange protocol for wireless communications. J Ambient Intell Humaniz Comput 10(8):3133–3142. https://doi.org/10.1007/s12652-018-1029-3
Tian SK, Dai N, Li LL, Li WW, Sun YC, Cheng XS (2020) Three-dimensional mandibular motion trajectory-tracking system based on BP neural network. Math Biosci Eng 17(5):5709–5726
Bonissone P, Cadenas JM, Garrido MC, Díaz-Valladares RA (2010) A fuzzy random forest. Int J Approx Reason 51(7):729–747
Alabdulkarim A, Al-Rodhaan M, Tian YA, I-Dhelaan A (2019) A privacy-preserving algorithm for clinical decision-support systems using random forest. CMC Comput Mater Con 58(1):585–601. https://doi.org/10.32604/cmc.2019.05637
Sylvester EVA, Bentzen P, Bradbury IR, Clement M, Pearce J, Horne J, Beiko RG (2018) Applications of random forest feature selection for fine-scale genetic population assignment. Evol Appl 11(2):153–165
Resende PA, Drummond AC (2018) A Survey of Random Forest Based Methods for Intrusion Detection Systems. ACM Comput Surv 51(3):36
Yu B, Wang HZ, Shan WX, Yao BZ (2018) Prediction of bus travel time using random forests based on near neighbors. Comput-Aided Civ Infrastruct Eng 33(4):333–350
Goh YC, Cai XQ, Theseira W, Ko G, Khor KA (2020) Evaluating human versus machine learning performance in classifying research abstracts. Scientometrics 125(2):1197–1212. https://doi.org/10.1007/s11192-020-03614-2
Niu Y, Chen J, Guo W (2018) Meta-metric for saliency detection evaluation metrics based on application preference. Multimed Tools Appl 77(20):26351–26369
Xiong L, Tang G, Chen Y-C, Hu YX, Chen R-S (2020) Color disease spot image segmentation algorithm based on chaotic particle swarm optimization and FCM. J Supercomput 76(3):8756–8770. https://doi.org/10.1007/s11227-020-03171-8
Zhang L, Wang F, Sun T, Xu B (2018) A constrained optimization method based on BP neural network. Neural Comput Appl 29(2):413–421
Wang J, Qin JH, Xiang XY, Tan Y, Pan N (2019) CAPTCHA recognition based on deep convolutional neural network. Math Biosci Eng 16(5):5851–5861
Luo M, Zhong S, Yao L, Tu W, Nsengiyumva W, Chen W (2020) Thin thermally grown oxide thickness detection in thermal barrier coatings based on SWT-BP neural network algorithm and terahertz technology. Appl Opt 59(13):4097–4104. https://doi.org/10.1364/AO.392748
Han T, Jiang DX, Zhao Q, Wang L, Yin K (2018) Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery. Trans Inst Measurement Control 40(8):2681–2693
Mu YH, Qiu B, Wei SY, Song T, Zheng ZP, Guo P (2019) Regression prediction of photometric redshift based on particle warm optimization neural network algorithm. Spectrosc Spectr Anal 39(9):2693–2697
Zhao G, Zhang Y, Shi Y, Lan H, Yang Q (2019) The application of BP neural networks to analysis the national vulnerability. Comput Mater Continua 58(2):421–436
Pan JS, Hu P, Chu SC (2019) Novel parallel heterogeneous meta-heuristic and its communication strategies for the prediction of wind power. Process 7(11):845. https://doi.org/10.3390/pr7110845
Zhang D, Dongru H, Kang L, Zhang W (2019) The generative adversarial networks and its application in machine vision. Enterp Inf Syst. https://doi.org/10.1080/17517575.2019.1701714
Guo W, Lin B, Chen G, Chen Y, Liang F (2018) Cost-driven scheduling for deadline-based workflow across multiple clouds. IEEE Trans Netw Serv Manag 15(4):1571–1585
Wang S, Guo W (2017) Robust co-clustering via dual local learning and high-order matrix factorization. Knowl-Based Syst 138:176–187
Chen CM, Xiang B, Liu Y, Wang KH (2019) A secure authentication protocol for internet of vehicles. IEEE Access 7:12047–12057
Liu G, Chen Z, Zhuang Z, Guo W, Chen G (2020) A unified algorithm based on HTS and self-adapting PSO for the construction of octagonal and rectilinear SMT. Soft Comput 24(6):3943–3961. https://doi.org/10.1007/s00500-019-04165-2
Pan JS, Lee CY, Sghaier A, Zeghid M, Xie J (2019) Novel systolization of subquadratic space complexity multipliers based on toeplitz matrix-vector product approach. IEEE Trans Very Large Scale Integr (VLSI) Syst 27(7):1614–1622
Guo Y, Du L, Chen J (2019) Max-margin multi-scale convolutional factor analysis model with application to image classification. Expert Syst Appl 133:21–33. https://doi.org/10.1016/j.eswa.2019.04.012
Acknowledgements
This work was supported in part by the National Nature Science Foundation of China under Grants 61872452, 61872451, and 61702365, in part by the Macao FDCT under Grants 0098/2018/A3, 0076/2019/A2, and 0037/2020/A1. Li Feng is the corresponding author.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Yang, L., Feng, L., Zhang, L. et al. Predicting freshmen enrollment based on machine learning. J Supercomput 77, 11853–11865 (2021). https://doi.org/10.1007/s11227-021-03763-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-021-03763-y