Abstract
One of the most crucial task during the design of a computing infrastructure is the decision about the proper amount of equipments required to handle a specific workload while satisfying a set of performance objectives. This problem is emphasized even more in actual computer infrastructure such as clouds, where the user can provision the resources very easily thanks to the use of virtual machines. If the system has to handle a low workload, resources can be consolidated together to reduce the costs. If however the workload is very high, resources must be replicated to gain an acceptable service level. In this paper we derive the impact on several performance indexes for both consolidation and replication when considering both open and closed workloads. In particular, we present an analytical model to determine the best consolidation or replication options that match given performance objectives specified through a set of constraints. Depending on the particular type of workload and constraints, we present either closed form expressions, heuristics or an iterative algorithm to compute the minimum number of resources required.
Similar content being viewed by others
Notes
The minimum number of jobs is equal to the number of workload classes, because there must be at least one job for each class in order to define a proper model.
In some cases, when the PCs are particularly tight, the actual number of iterations can be larger than \(S\) to account for the random routing considered by the technique, but its complexity is still \(O(S)\).
References
Balbo, G., & Serazzi, G. (1996). Asymptotic analysis of multiclass closed queueing networks: Common bottlenecks. Performance Evaluation, 26(1), 51–72.
Benevenuto, F., Fernandes, C., Santos, M., Almeida, V. A. F., Almeida, J. M., Janakiraman, G. J., et al. (2006). Performance models for virtualized applications. In G. Min, B. D. Martino, L. T. Yang, M. Guo, & G. Rnger (Eds.), ISPA workshops, lecture notes in computer science (Vol. 4331, pp. 427–439). Heidelberg: Springer.
Bennani, M., & Menascé, D. (2005). Resource allocation for autonomic data centers using analytic performance models. Autonomic computing (pp. 229–240). Seattle: ICAC. doi:10.1109/ICAC.2005.50.
Bertoli, M., Casale, G., & Serazzi, G. (2006). Java modelling tools: an open source suite for queueing network modelling and workload analysis. In Proceedings of the 3rd Conference on Quantitative Evaluation of Systems (QEST), IEEE, pp. 119–120.
Bobroff, N., Kochut, A., & Beaty, K. (2007). Dynamic placement of virtual machines for managing sla violations. In 10th IFIP/IEEE International Symposium on Integrated Network Management, IM ’07 (pp. 119–128). doi:10.1109/INM.2007.374776.
Bushehrian, O. (2011). The application of fsp models in automatic optimization of software deployment. ASMTA, pp. 43–54.
Curino, C., Jones, E. P., Madden, S., & Balakrishnan, H. (2011). Workload-aware database monitoring and consolidation. In Proceedings of the 2011 international conference on Management of data, SIGMOD ’11 (pp. 313–324) ACM New York, NY, USA.
Ganapathi, A., Chen, Y., Fox, A., Katz, R., & Patterson, D. (2010). Statistics-driven workload modeling for the cloud. In IEEE 26th International Conference on Data Engineering Workshops (ICDEW) (pp. 87–92) doi:10.1109/ICDEW.2010.5452742.
Gribaudo, M., Piazzolla, P., & Serazzi, G. (2012). Consolidation and replication of vms matching performance objectives. ASMTA, pp. 106–120. doi:10.1007/978-3-642-30782-9_8.
Jackson, J. R. (1963). Jobshop-like queueing systems. Management Science, 10(1), 131–142.
Khanna, G., Beaty, K.A., Kar, G., & Kochut, A. (2006). Application performance management in virtualized server environments. NOMS, pp. 373–381.
Kokkinos, P., Christodoulopoulos, K., Kretsis, A., & Varvarigos, E. (2008). Data consolidation: A task scheduling and data migration technique for grid networks. In Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (pp. 722–727)., Washington, DC, USA: IEEE Computer Society.
Lazowska, E. D., Zahorjan, J., Graham, G. S., & Sevcik, K. C. (1984). Quantitative system performance. Englewood Cliffs: Prentice-Hall.
Menascé, D. A. (2005). Virtualization: Concepts, applications, and performance modeling. International CMG Conference, pp. 407–414.
Menascé, D.A., Bennani, M.N. (2006). Autonomic virtualized environments. In Proceedings of the International Conference on Autonomic and Autonomous Systems, ICAS ’06 (pp. 28). Washington, DC: IEEE Computer Society.
Ongaro, D., Cox, A. L., & Rixner, S. (2008). Scheduling i/o in virtual machine monitors. In Proceedings of the fourth ACM SIGPLAN/SIGOPS international conference on Virtual execution environments, VEE ’08 (pp. 1–10) New York: ACM.
Padala, P., Shin, K. G., Zhu, X., Uysal, M., Wang, Z., Singhal, S., et al. (2007). Adaptive control of virtualized resources in utility computing environments. In Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, EuroSys ’07 (pp. 289–302) New York: ACM.
VirtualBox. (2013). http://www.virtualbox.org.
VMware. (2013). http://www.vmware.com.
Watson, B. J., Marwah, M., Gmach, D., Chen, Y., Arlitt, M., & Wang, Z. (2010). Probabilistic performance modeling of virtualized resource allocation. In Proceedings of the 7th international conference on Autonomic computing, ICAC ’10 (pp. 99–108) New York: ACM.
Acknowledgments
This work has been partially supported by the “AWS in Education research grant” from Amazon, and by the “ForgeSDK” project sponsored by Reply S.R.L.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Cerotti, D., Gribaudo, M., Piazzolla, P. et al. Matching performance objectives for open and closed workloads by consolidation and replication. Ann Oper Res 239, 589–612 (2016). https://doi.org/10.1007/s10479-014-1591-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-014-1591-9