Abstract
Calibrations and validations of Computational Fluid Dynamics (CFD) applications are significantly time-consuming. To reduce the execution time of the CFD applications, parallel-computing approach is often employed. In addition, high performance computing systems and cloud computing solutions are also appropriate tools to the CFD applications. One of the challenging problems is to schedule tasks on virtualized machines of the cloud-based high performance systems. Instead of employing an adaptive algorithm to cope with the uncertainty of the virtualized resources, in this study, we propose an idea to predict the execution time of Telemac-2D, which is a CFD application. The predicted execution time is very essential in all scheduling algorithms. The application is executed several times with different settings of model’s parameters and allocated resources to produce an experimental dataset. The dataset is then used to predict the execution time of the application by utilizing a machine learning-based approach. The predictive model consists of two steps that classify and predict the execution. The C4.5 algorithm is used to classify the execution ending status whereas Multi-layer Perceptron (MLP) and a mixture of MLPs (MiMLP) are used to predict the execution time. The experiments indicate that the predictive model is appropriate to predict the execution of the Telemac-2D application since the accuracy of the C4.5 algorithm is 100 % and R and MARE of MiMLP are 0.957 and 17.090, respectively.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
References
Anto, S., et al.: Stochastic based optimal resource provisioning in cloud computing. Int. J. Adv. Res. Comput. Eng. Technol. (IJARCET) 2(2), 670 (2013)
Chaisiri, S., Lee, B.S., Niyato, D.: Optimization of resource provisioning cost in cloud computing. IEEE Trans. Serv. Comput. 5(2), 164–177 (2012)
Doyle, J., Giotsas, V., Anam, M.A., Andreopoulos, Y.: Cloud instance management and resource prediction for computation-as-a-service platforms
Frank, R.: The perceptron: a probabilistic model for information storage and organization in the brain, cornell aeronautical laboratory. Psychol. Rev. 65, 386–408 (1958)
Haykin, S.: Neural Networks and Learning Machines. Pearson International Edition, Upper Saddle River (2009)
Huang, L., Jia, J., Yu, B., Chun, B.G., Maniatis, P., Naik, M.: Predicting execution time of computer programs using sparse polynomial regression. In: Advances in Neural Information Processing Systems, pp. 883–891 (2010)
Ipek, E., de Supinski, B.R., Schulz, M., McKee, S.A.: An approach to performance prediction for parallel applications. In: Cunha, J.C., Medeiros, P.D. (eds.) Euro-Par 2005. LNCS, vol. 3648, pp. 196–205. Springer, Heidelberg (2005)
Jamshed, S.: Using HPC for Computational Fluid Dynamics: A Guide to High Performance Computing for CFD Engineers. Academic Press, Cambridge (2015)
Kasperkiewicz, J., Racz, J., Dubrawski, A.: HPC strength prediction using artificial neural network. J. Comput. Civ. Eng. 9(4), 279–284 (1995)
Matsunaga, A., Fortes, J.A.: On the use of machine learning to predict the time and resources consumed by applications. In: Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp. 495–504. IEEE Computer Society (2010)
Oberkampf, W.L., Trucano, T.G.: Validation methodology in computational fluid dynamics. AIAA Pap. 2549, 19–22 (2000)
Priya, R., de Souza, B.F., Rossi, A.L., de Carvalho, A.C.: Predicting execution time of machine learning tasks for scheduling. Int. J. Hybrid Intell. Syst. 10(1), 23–32 (2013)
Quinlan, J.R.: Programs for Machine Learning. Morgan Kaufmann Publishers, Burlington (1993)
Quinlan, J.R.: Improved use of continuous attributes in c4.5. J. Artif. Intell. Res. 4, 77–90 (1996)
Trucanoa, T.G., Swiler, L.P., Igusa, T., Oberkampf, W.L., Pilch, M.: Calibration, validation, and sensitivity analysis: whats what. Reliab. Eng. Syst. Saf. 91, 1331–1357 (2006)
Thong, N., Loc, L.X., Tuan, H.D.: Impact assessment of flood diversion of Dong Thap Muoi for the Mekong Delta. Vietnam Water Resour. 9, 3–12 (2015)
Werbos, P.: Beyond regression: new tools for prediction and analysis in the behavioral sciences, PhD. thesis. Harvard University, Cambridge (1974)
Wu, X., Kumar, V., Quinlan, J.R., Ghosh, J., Yang, Q., Motoda, H., McLachlan, G.J., Ng, A., Liu, B., Philip, S.Y., et al.: Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008)
Acknowledgments
The authors would like to thank Faculty of Computer Science and Engineering, HCMC University of Technology for providing facilities for this study. The applications presented in this paper were tested on the High Performance Computing Center (HPCC) of the faculty. This research was funded by HCMC Department of Science and Technology, under contract number 39/2015/HD-SKHCN.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Hieu, D.N., Tieu Minh, T., Van Quang, T., Giang, B.X., Van Hoai, T. (2016). A Machine Learning-Based Approach for Predicting the Execution Time of CFD Applications on Cloud Computing Environment. In: Dang, T., Wagner, R., Küng, J., Thoai, N., Takizawa, M., Neuhold, E. (eds) Future Data and Security Engineering. FDSE 2016. Lecture Notes in Computer Science(), vol 10018. Springer, Cham. https://doi.org/10.1007/978-3-319-48057-2_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-48057-2_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48056-5
Online ISBN: 978-3-319-48057-2
eBook Packages: Computer ScienceComputer Science (R0)