skip to main content
research-article

Automatic virtual machine configuration for database workloads

Published: 15 February 2008 Publication History

Abstract

Virtual machine monitors are becoming popular tools for the deployment of database management systems and other enterprise software. In this article, we consider a common resource consolidation scenario in which several database management system instances, each running in a separate virtual machine, are sharing a common pool of physical computing resources. We address the problem of optimizing the performance of these database management systems by controlling the configurations of the virtual machines in which they run. These virtual machine configurations determine how the shared physical resources will be allocated to the different database system instances. We introduce a virtualization design advisor that uses information about the anticipated workloads of each of the database systems to recommend workload-specific configurations offline. Furthermore, runtime information collected after the deployment of the recommended configurations can be used to refine the recommendation and to handle changes in the workload. To estimate the effect of a particular resource allocation on workload performance, we use the query optimizer in a new what-if mode. We have implemented our approach using both PostgreSQL and DB2, and we have experimentally evaluated its effectiveness using DSS and OLTP workloads.

References

[1]
Agrawal, R., Chaudhuri, S., Das, A., and Narasayya, V. R. 2003. Automating layout of relational databases. In Proceedings of the International Conference on Data Engineering (ICDE).
[2]
Barham, P. T., Dragovic, B., Fraser, K., Hand, S., Harris, T. L., Ho, A., Neugebauer, R., Pratt, I., and Warfield, A. 2003. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP).
[3]
Bennani, M. and Menasce, D. A. 2005. Resource allocation for autonomic data centers using analytic performance models. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC).
[4]
Carey, M. J., Jauhari, R., and Livny, M. 1989. Priority in DBMS resource scheduling. In Proceedings of the International Conference on Very Large Data Bases (VLDB).
[5]
Clark, C., Fraser, K., Hand, S., Hansen, J. G., Jul, E., Limpach, C., Pratt, I., and Warfield, A. 2005. Live migration of virtual machines. In Proceedings of the Symposium on Networked Systems Design and Implementation (NSDI).
[6]
Dageville, B. and Zaït, M. 2002. SQL memory management in Oracle9i. In Proceedings of the International Conference on Very Large Data Bases (VLDB).
[7]
Davison, D. L. and Graefe, G. 1995. Dynamic resource brokering for multi-user query execution. In Proceedings of the ACM SIGMOD International Conference on Management of Data.
[8]
DBT3. OSDL Database Test Suite 3. http://sourceforge.net/projects/osdldbt.
[9]
Dias, K., Ramacher, M., Shaft, U., Venkataramani, V., and Wood, G. 2005. Automatic performance diagnosis and tuning in Oracle. In Proceedings of the Conference on Innovative Data Systems Research (CIDR).
[10]
Garofalakis, M. N. and Ioannidis, Y. E. 1996. Multi-dimensional resource scheduling for parallel queries. In Proceedings of the ACM SIGMOD International Conference on Management of Data.
[11]
Habib, I. 2008. Virtualization with KVM. Linux J. 2008, 166.
[12]
IBM Corporation. 2006. IBM DB2 v9 performance guide. ftp://ftp.software.ibm.com/ps/products/db2/info/vr9/pdf/letter/en_US/db2d3e90.pdf
[13]
Karve, A., Kimbrel, T., Pacifici, G., Spreitzer, M., Steinder, M., Sviridenko, M., and Tantawi, A. 2006. Dynamic placement for clustered Web applications. In Proceedings of the International Conference on WWW.
[14]
Khanna, G., Beaty, K., Kar, G., and Kochut, A. 2006. Application performance management in virtualized server environments. In Proceedings of the IEEE/IFIP Network Operations and Management Symposium (NOMS).
[15]
Liu, F., Zhao, Y., Wang, W., and Makaroff, D. 2004. Database server workload characterization in an E-Commerce environment. In Proceedings of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS).
[16]
Llanos, D. R. 2006. TPCC-UVA: An open-source TPC-C implementation for global performance measurement of computer systems. ACM SIGMOD Rec. 35, 4.
[17]
Martin, P., Li, H.-Y., Zheng, M., Romanufa, K., and Powley, W. 2000. Dynamic reconfiguration algorithm: Dynamically tuning multiple buffer pools. In Proceedings of the International Conference on Database and Expert Systems Applications (DEXA).
[18]
Mehta, M. and DeWitt, D. J. 1993. Dynamic memory allocation for multiple-query workloads. In Proceedings of the International Conference on Very Large Data Bases (VLDB).
[19]
Microsoft Corporation. 2008. Microsoft Windows server 2008 R2 hyper-V. http://www.microsoft.com/windowsserver2008/en/us/hyperv-main.aspx.
[20]
Microsoft Corporation. 2009. Virtualization case studies: Consolidation. http://www.microsoft.com/virtualization/casestudies/consolidation/default.mspx.
[21]
Narayanan, D., Thereska, E., and Ailamaki, A. 2005. Continuous resource monitoring for self-predicting DBMS. In Proceedings of the IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS).
[22]
Park, S.-M. and Humphrey, M. 2009. Self-Tuning virtual machines for predictable escience. In Proceedings of the IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID).
[23]
Rosenblum, M. and Garfinkel, T. 2005. Virtual machine monitors: Current technology and future trends. IEEE Comput. 38, 5.
[24]
Ruth, P., Rhee, J., Xu, D., Kennell, R., and Goasguen, S. 2006. Autonomic live adaptation of virtual computational environments in a multi-domain infrastructure. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC).
[25]
Shivam, P., Demberel, A., Gunda, P., Irwin, D. E., Grit, L. E., Yumerefendi, A. R., Babu, S., and Chase, J. S. 2007. Automated and on-demand provisioning of virtual machines for database applications. In Proceedings of the ACM SIGMOD International Conference on Management of Data. Demonstration.
[26]
Smith, J. E. and Nair, R. 2005. The architecture of virtual machines. IEEE Comput. 38, 5.
[27]
Soror, A. A., Aboulnaga, A., and Salem, K. 2007. Database virtualization: A new frontier for database tuning and physical design. In Proceedings of the Workshop on Self-Managing Database Systems (SMDB).
[28]
Soror, A. A., Minhas, U. F., Aboulnaga, A., Salem, K., Kokosielis, P., and Kamath, S. 2008. Automatic virtual machine configuration for database workloads. In Proceedings of the ACM SIGMOD International Conference on Management of Data.
[29]
Steinder, M., Whalley, I., and Chess, D. 2008. Server virtualization in autonomic management of heterogeneous workloads. SIGOPS Oper. Syst. Rev. 42, 1.
[30]
Storm, A. J., Garcia-Arellano, C., Lightstone, S., Diao, Y., and Surendra, M. 2006. Adaptive self-tuning memory in DB2. In Proceedings of the International Conference on Very Large Data Bases (VLDB).
[31]
Tang, C., Steinder, M., Spreitzer, M., and Pacifici, G. 2007. A scalable application placement controller for enterprise data centers. In Proceedings of the International Conference on World Wide Web.
[32]
Tesauro, G., Das, R., Walsh, W. E., and Kephart, J. O. 2005. Utility-Function-Driven resource allocation in autonomic systems. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC).
[33]
VMware. VMware. http://www.vmware.com/.
[34]
Wang, X., Du, Z., Chen, Y., and Li, S. 2008. Virtualization-based autonomic resource management for multi-tier web applications in shared data center. J. Syst. Softw.
[35]
Wang, X., Lan, D., Wang, G., Fang, X., Ye, M., Chen, Y., and Wang, Q. 2007. Appliance-Based autonomic provisioning framework for virtualized outsourcing data center. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC).
[36]
Watson, J. 2008. Virtualbox: bits and bytes masquerading as machines. Linux J. 2008, 166.
[37]
Weikum, G., Mönkeberg, A., Hasse, C., and Zabback, P. 2002. Self-Tuning database technology and information services: From wishful thinking to viable engineering. In Proceedings of the International Conference on Very Large Data Bases (VLDB).
[38]
XenSource. XenSource. http://www.xensource.com/.
[39]
Yu, P. S., syan Chen, M., ulrich Heiss, H., and Lee, S. 1992. On workload characterization of relational database environments. IEEE Trans. Softw. Engin. 18.

Cited By

View all
  • (2021)SA2-MCD: Secured Architecture for Allocation of Virtual Machine in Multitenant Cloud DatabasesBig Data Research10.1016/j.bdr.2021.10018724(100187)Online publication date: May-2021
  • (2021)Application and Trend with Success Factor Linked to Large Scaled Data: A Case StudyInnovations in Information and Communication Technologies (IICT-2020)10.1007/978-3-030-66218-9_30(265-270)Online publication date: 16-Jul-2021
  • (2020)SGPM: a privacy protected approach of time-constrained graph pattern matching in cloudWorld Wide Web10.1007/s11280-020-00784-0Online publication date: 3-Feb-2020
  • Show More Cited By

Index Terms

  1. Automatic virtual machine configuration for database workloads

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Database Systems
    ACM Transactions on Database Systems  Volume 35, Issue 1
    February 2010
    310 pages
    ISSN:0362-5915
    EISSN:1557-4644
    DOI:10.1145/1670243
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Accepted: 01 October 2009
    Revised: 01 July 2009
    Received: 01 October 2008
    Published: 15 February 2008
    Published in TODS Volume 35, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Virtualization
    2. virtual machine configuration

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)31
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 13 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)SA2-MCD: Secured Architecture for Allocation of Virtual Machine in Multitenant Cloud DatabasesBig Data Research10.1016/j.bdr.2021.10018724(100187)Online publication date: May-2021
    • (2021)Application and Trend with Success Factor Linked to Large Scaled Data: A Case StudyInnovations in Information and Communication Technologies (IICT-2020)10.1007/978-3-030-66218-9_30(265-270)Online publication date: 16-Jul-2021
    • (2020)SGPM: a privacy protected approach of time-constrained graph pattern matching in cloudWorld Wide Web10.1007/s11280-020-00784-0Online publication date: 3-Feb-2020
    • (2019)MgCrabProceedings of the VLDB Endowment10.14778/3303753.330376412:5(597-610)Online publication date: 1-Jan-2019
    • (2019)Decision-Making Approaches for Performance QoS in Distributed Storage Systems: A SurveyIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2019.2893940(1-1)Online publication date: 2019
    • (2019)Fulva: Efficient Live Migration for In-Memory Key-Value Stores with Zero Downtime2019 38th Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS47363.2019.00019(83-8309)Online publication date: Oct-2019
    • (2019)A Novel Auction-Based Query Pricing SchemaInternational Journal of Parallel Programming10.1007/s10766-017-0534-x47:4(759-780)Online publication date: 1-Aug-2019
    • (2016)EnsembleIEEE Transactions on Cloud Computing10.1109/TCC.2015.24696564:1(20-33)Online publication date: 1-Jan-2016
    • (2016)Virtual Network Mapping in Cloud Computing: A Graph Pattern Matching ApproachThe Computer Journal10.1093/comjnl/bxw063Online publication date: 4-Oct-2016
    • (2016)Multi-tier cloud infrastructure support for reliable global health awareness systemSimulation Modelling Practice and Theory10.1016/j.simpat.2016.06.00567(44-58)Online publication date: Sep-2016
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media