Abstract
Server load prediction can be utilized for load-balancing and load-sharing in distributed systems. The use of machine learning (ML) algorithms for load estimation in distributed system applications can increase the availability and performance of servers. Hence, a number of machine learning algorithms have been applied thus far for server load estimation. This study focuses on increasing the performance of game servers by accurately predicting the workload of game servers in short, medium and long term prediction situations. While doing this, various machine learning techniques have been applied and the algorithms that give the best results are presented. In terms of implementation, companies using their servers and data centers can try to increase their level of satisfaction by using these algorithms. A prediction model is developed and the estimation performances of a number of fundamental ML methods i.e., Naïve Bayes (NB), Generalized Linear Model (GLM), Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Gradient Boosted Trees (GBT), Support Vector Machine (SVM), Fast Large Margin (FLM), Convolutional Neural Network CNN are analyzed. The data used during the training stage is obtained by listening to the TCP/IP packet traffic and the real-data is extracted by performing an extensive analysis of the total transferred-data that includes also the payload. In the analysis phase, the goodput is considered in order to reveal exact resource requirements. Comprehensive simulations are performed under various conditions for high accuracy performance analysis. Experimental results indicate that the proposed ML-based prediction shows promising performance in terms of load prediction when compared to the common approaches present in the literature.
Similar content being viewed by others
References
Bolboacă SD, Jäntschi L, Sestraş AF, Sestraş RE, Pamfil DC (2011) Pearson-fisher chi-square statistic revisited 2:528-545. ISSN 2078-2489
Cao R, Yu Z, Marbach T, Li J, Wang G, Liu X (2018) Load prediction for data centers based on database service. In: 42nd IEEE International Conference on Computer Software & Applications
Chee JD (2015) Pearson's product-moment correlation:sample analysis. https://www.researchgate.net/publication/277324930_Pearson's_Product-Moment_Correlation_Sample_Analysis
Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. KDD:785–794
Davis I, Hemmati H, Holt RC, Godfrey MW, Neuse D, Mankovskii S (May 2013) Storm prediction in a cloud. In: Proceedings of the 2013 5th International Workshop on Principles of Engineering Service-Oriented Systems, PESOS 2013, pp. 37–40
Engineering and Operations in the Bell System (1983) AT&T Bell; 2nd edition. ISBN: 978-0932764041
Fan RE, Chang KW, Hsieh CJ, Wang XR, Lin CJ (2008) Liblinear: A library for large linear classification. J Mach Learn Res 9:1871–1874
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. https://arxiv.org/abs/1512.03385
Hopfield JJ (April 1982) Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci USA 79:2554–2558
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24:417–441
https://datatracker.ietf.org/meeting/87/materials/slides-87-tsvarea-1, 2013 https://doi.org/10.1109/ICCDA.2010.5541172, 2010
https://openai.com/, 2015
https://www.computerscience.gcse.guru/theory/von-neumann-architecture, 2015
https://www.ibm.com/ibm/history/ibm100/us/en/icons/deepblue/, 1997
Islam S, Keung J, Lee K, Liu A (2012) Empirical prediction models for adaptive resource provisioning in the cloud. Futur Gener Comput Syst 28(1):155–162
Jamel M, Selim SZ (1994) "The construction of decision tree", 61:177–188
Jensen RR, Hardin PJ, Yu G (2009) Artificial Neural Networks and Remote Sensing. https://onlinelibrary.wiley.com/doi/full/10.1111/j.1749-8198.2008.00215.x
Kassambara A (2017) http://www.sthda.com/english/articles/31-principal-component-methods-in-r-practical-guide/112-pca-principal-component-analysis-essentials/
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet Classification with Deep Convolutional. Neural Networks
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. NIPS
Le Cun BB, Denker JS, Henderson D, Howard RE, Hub-bard W, Jackel LD (1990) Handwritten digit recognition with a back-propagation network. In: NIPS
Li L, Zhang X (2010) Study of data mining algorithm based on decision tree. International Conference On Computer Design and Applications. https://doi.org/10.1109/ICCDA.2010.5541172
Liaw A, Wiener M (2002) Classification and regression by random Forest. R News 2(3):1820
Luo B, Ye S-w (2005) Server performance prediction using recurrent neural network. Comput Eng Des 8:57
McCulloch WS, Pitts WH (1943) A logical calculus of the ideas immanent in nervous activity
Pollack A (1992) https://www.nytimes.com/1992/06/05/business/fifth-generation-became-japan-s-lost-generation.html
Rodriguez J (2016) https://towardsdatascience.com/deepmind-quietly-open-sourced-three-new-impressive-reinforcement-learning-frameworks-f99443910b16
Rosenblatt F (1958) The Perceptron: A probabilistic model for information storage and organization in the brain. Cornell Aeronautical Laboratory, Vol. 6:66
Rumelhart DE, Hinton GE, Williams RJ (October 1986) Learning representations by back-propagating errors. Institute for Cognitive Science, C-015, University of California, San Diego, La Jolla, California 92093, USA. Nature Vol 323–329
Samuel AL (1959) Some studies in machine learning using the game of checkers
Sankesara H (2019) Introducing Symmetry in Segmentation. https://towardsdatascience.com/u-net-b229b32b4a71
Song B, Yu Y, Zhou Y, Wang Z, Du S (2017) Host load prediction with long short-term memory in cloud computing. J Supercomput:1–15
Sverdlik Y (2016) Delta: Data center outage cost us $150m. http://www.datacenterknowledge.com/archives/2016/09/08/delta-data-center-outage-cost-us-150m
Taigman Y, Yang M, Ranzato M, Wolf L (2014) DeepFace: Closing the gap to human-level performance in face verification. CVPR
Turing AM (1950) Computing Machinery and Intelligence. Mind 49:433–460
Weckworth J (2013) The lack of transparency can be seen as a rootcause of outages and incidents. https://journal.uptimeinstitute.com/data-center-outages-incidents-industry-transparency/
Yu Y, Jindal V, Yen I, Bastani F (June 2016) Integrating clustering and learning for improved workload prediction in the cloud. In: Proceedings of the 2016 IEEE 9th International Conference on Cloud Computing (CLOUD). San Francisco, Californa, USA, pp 876–879
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Özer, Ç., Çevik, T. & Gürhanlı, A. A machine learning-based framework for predicting game server load. Multimed Tools Appl 80, 9527–9546 (2021). https://doi.org/10.1007/s11042-020-10067-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10067-5