Abstract
Server consolidation is very attractive for cloud computing platforms to improve energy efficiency and resource utilization. Advances in multi-core processors and virtualization technologies have enabled many workloads to be consolidated in a physical server. However, current virtualization technologies do not ensure performance isolation among guest virtual machines, which results in degraded performance due to contention in shared resources along with violation of service level agreement (SLA) of the cloud service. In that sense, minimizing performance interference among co-located virtual machines is the key factor of successful server consolidation policy in the cloud computing platforms. In this work, we propose a performance model that considers interferences in the shared last-level cache and memory bus. Our performance interference model can estimate how much an application will hurt others and how much an application will suffer from others. We also present a virtual machine consolidation method called swim which is based on our interference model. Experimental results show that the average performance degradation ratio by swim is comparable to the optimal allocation.
Similar content being viewed by others
Notes
The number of LLC references is the same as the number of L2 misses.
References
Intel Core i7-950 Nehalem architecture processor. http://goo.gl/ZYvhO
KVM (Kernel-based Virtual Machine). http://www.linux-kvm.org/
VMware. http://www.vmware.com/
Barham P, Dragovic B, Fraser K, Hand S, Harris T, Ho A, Neugebauer R, Pratt I, Warfield A (2003) Xen and the art of virtualization. In: Proceedings of SOSP ’03, New York, NY, USA
Cho S, Jin L (2006) Managing distributed, shared l2 caches through os-level page allocation. In: Proceedings of the 39th annual IEEE/ACM international symposium on microarchitecture, MICRO ’39. IEEE Comput Soc, Washington, pp 455–468
Citrix: XenServer. http://www.citrix.com/
David H, Fallin C, Gorbatov E, Hanebutte UR, Mutlu O (2011) Memory power management via dynamic voltage/frequency scaling. In: Proceedings of the 8th ACM international conference on autonomic computing, ICAC ’11. ACM, New York, pp 31–40
Govindan S, Liu J, Kansal A, Sivasubramaniam A (2011) Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines. In: Proceedings of the 2nd ACM Symposium on Cloud Computing (SOCC)
Gupta D, Cherkasova L, Gardner R, Vahdat A (2006) Enforcing performance isolation across virtual machines in xen. In: Proceedings of the ACM/IFIP/USENIX 2006 international conference on middleware, Melbourne, Australia
Intel and VMware: Intelligent queueing technologies for virtualization. http://www.vmware.com/files/pdf/partners/intel/vmdq-white-paper-wp.pdf
Intel Corporation: Single-chip Cloud Computer. http://techresearch.intel.com/ProjectDetails.aspx?Id=1
Jaleel A, Borch E, Bhandaru M, Steely SC Jr., Emer J (2010) Achieving non-inclusive cache performance with inclusive caches: temporal locality aware (tla) cache management policies. In: Proceedings of the 2010 43rd annual IEEE/ACM international symposium on microarchitecture, MICRO ’43, pp 151–162
Mars J, Tang L, Hundt R, Skadron K, Soffa ML (2011) Bubble-up: increasing sensible co-locations for improved utilization in modern warehouse scale computers. In: Proceedings of the 44th annual IEEE/ACM international symposium on microarchitecture (MICRO)
Microsoft: Hyper-V. http://www.microsoft.com/server-cloud/
Mutlu O, Moscibroda T (2008) Parallelism-aware batch scheduling: enhancing both performance and fairness of shared dram systems. In: Proceedings of the 35th annual international symposium on computer architecture, ISCA ’08. IEEE Comput Soc, Washington, pp 63–74
Nesbit KJ, Laudon J, Smith JE (2007) Virtual private caches. In: Proceedings of the 34th annual international symposium on computer architecture, ISCA ’07. ACM, New York, pp 57–68
Qureshi MK, Jaleel A, Patt YN, Steely SC, Emer J (2007) Adaptive insertion policies for high performance caching. In: Proceedings of the 34th annual international symposium on computer architecture, ISCA ’07, pp 381–391
Qureshi MK, Patt YN (2006) Utility-based cache partitioning: a low-overhead, high-performance, runtime mechanism to partition shared caches. In: Proceedings of the 39th annual IEEE/ACM international symposium on microarchitecture, Orlando, FL, USA
Schuff DL, Kulkarni M, Pai VS (2010) Accelerating multicore reuse distance analysis with sampling and parallelization. In: Proceedings of the 19th international conference on parallel architectures and compilation techniques, PACT ’10, pp 53–64
Smaragdakis Y, Kaplan S, Wilson P (1999) Eelru: simple and effective adaptive page replacement. In: Proceedings of the 1999 ACM SIGMETRICS international conference on measurement and modeling of computer systems, SIGMETRICS ’99, pp 122–133
Standard Performance Evaluation Corporation: SPEC CPU2006. http://www.spec.org/cpu2006/
Tam DK, Azimi R, Soares LB, Stumm M (2009) Rapidmrc: approximating l2 miss rate curves on commodity systems for online optimizations. In: Proceedings of the 14th international conference on architectural support for programming languages and operating systems, ASPLOS ’09. ACM, New York, pp 121–132
Tilera Corporation: TILE-Gx Processor Family. http://www.tilera.com/products/processors/TILE-Gx_Family
Zhang J, Sivasubramaniam A, Wang Q, Riska A, Riedel E (2006) Storage performance virtualization via throughput and latency control. Transf Storage 2:283–308
Zhang X, Dwarkadas S, Shen K (2009) Towards practical page coloring-based multicore cache management. In: Proceedings of the 4th ACM European conference on computer systems (EuroSys)
Zhuravlev S, Blagodurov S, Fedorova A (2010) Addressing shared resource contention in multicore processors via scheduling. In: Proceedings of the fifteenth edition of ASPLOS on architectural support for programming languages and operating systems, ASPLOS ’10. ACM, New York, pp 129–142
Acknowledgements
This research was supported by Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology. (No. 2010-0020731) The ICT at Seoul National University provided research facilities for this study.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Kim, Sg., Eom, H. & Yeom, H.Y. Virtual machine consolidation based on interference modeling. J Supercomput 66, 1489–1506 (2013). https://doi.org/10.1007/s11227-013-0939-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-013-0939-2