research-article

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning

Authors:

Tianheng Cheng,

Zekang LiAuthors Info & Claims

SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data

Pages 415 - 432

https://doi.org/10.1145/3299869.3300085

Published: 25 June 2019 Publication History

Abstract

Configuration tuning is vital to optimize the performance of database management system (DBMS). It becomes more tedious and urgent for cloud databases (CDB) due to the diverse database instances and query workloads, which make the database administrator (DBA) incompetent. Although there are some studies on automatic DBMS configuration tuning, they have several limitations. Firstly, they adopt a pipelined learning model but cannot optimize the overall performance in an end-to-end manner. Secondly, they rely on large-scale high-quality training samples which are hard to obtain. Thirdly, there are a large number of knobs that are in continuous space and have unseen dependencies, and they cannot recommend reasonable configurations in such high-dimensional continuous space. Lastly, in cloud environment, they can hardly cope with the changes of hardware configurations and workloads, and have poor adaptability. To address these challenges, we design an end-to-end automatic CDB tuning system, CDBTune, using deep reinforcement learning (RL). CDBTune utilizes the deep deterministic policy gradient method to find the optimal configurations in high-dimensional continuous space. CDBTune adopts a try-and-error strategy to learn knob settings with a limited number of samples to accomplish the initial training, which alleviates the difficulty of collecting massive high-quality samples. CDBTune adopts the reward-feedback mechanism in RL instead of traditional regression, which enables end-to-end learning and accelerates the convergence speed of our model and improves efficiency of online tuning. We conducted extensive experiments under 6 different workloads on real cloud databases to demonstrate the superiority of CDBTune. Experimental results showed that CDBTune had a good adaptability and significantly outperformed the state-of-the-art tuning tools and DBA experts.

References

[1]

Sanjay Agrawal, Nicolas Bruno, Surajit Chaudhuri, and Vivek R Narasayya. 2006. AutoAdmin: Self-Tuning Database SystemsTechnology. IEEE Data Eng. Bull., Vol. 29, 3 (2006), 7--15.

[2]

Sanjay Agrawal, Surajit Chaudhuri, Lubor Kollar, Arun Marathe, Vivek Narasayya, and Manoj Syamala. 2005. Database tuning advisor for microsoft sql server 2005. In ACM SIGMOD. ACM, 930--932.

Digital Library

[3]

Sanjay Agrawal, Vivek Narasayya, and Beverly Yang. 2004. Integrating vertical and horizontal partitioning into automated physical database design. In ACM SIGMOD. ACM, 359--370.

Digital Library

[4]

Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, and Bohan Zhang. 2017. Automatic Database Management System Tuning Through Large-scale Machine Learning. In ACM SIGMOD . 1009--1024.

Digital Library

[5]

Debabrota Basu, Qian Lin, Hoang Tam Vo, Hoang Tam Vo, Zihong Yuan, and Pierre Senellart. 2016. Regularized Cost-Model Oblivious Database Tuning with Reinforcement Learning .Springer Berlin Heidelberg. 96--132 pages.

Digital Library

[6]

Peter Belknap, Benoit Dageville, Karl Dias, and Khaled Yagoub. 2009. Self-tuning for SQL performance in Oracle database 11g. In ICDE. IEEE, 1694--1700.

Digital Library

[7]

Phil Bernstein et almbox. 1998. The Asilomar report on database research. ACM Sigmod record, Vol. 27, 4 (1998), 74--80.

Digital Library

[8]

Nicolas Bruno and Surajit Chaudhuri. 2005. Automatic physical database tuning: a relaxation-based approach. In ACM SIGMOD. ACM, 227--238.

Digital Library

[9]

Surajit Chaudhuri and Vivek Narasayya. 1998. AutoAdmin "what-if" index analysis utility. In ACM SIGMOD. 367--378.

Digital Library

[10]

Surajit Chaudhuri and Vivek Narasayya. 2007. Self-Tuning Database Systems: A Decade of Progress. In VLDB. 3--14.

Digital Library

[11]

Surajit Chaudhuri and Gerhard Weikum. 2000. Rethinking Database System Architecture: Towards a Self-Tuning RISC-Style Database System. In VLDB. 1--10.

Digital Library

[12]

Biplob K Debnath, David J Lilja, and Mohamed F Mokbel. 2008. SARD: A statistical approach for ranking database tuning parameters. In ICDEW. IEEE, 11--18.

Digital Library

[13]

Karl Dias, Mark Ramacher, Uri Shaft, Venkateshwaran Venkataramani, and Graham Wood. 2005. Automatic Performance Diagnosis and Tuning in Oracle. In CIDR. 84--94.

[14]

Songyun Duan, Vamsidhar Thummala, and Shivnath Babu. 2009. Tuning database configuration parameters with iTuned. VLDB Endowment, Vol. 2, 1 (2009), 1246--1257.

Digital Library

[15]

Yoav Goldberg. 2015. A Primer on Neural Network Models for Natural Language Processing. Computer Science (2015).

[16]

Goetz Graefe and Harumi A. Kuno. 2010. Self-selecting, self-tuning, incrementally optimized indexes. In EDBT . 371--381.

Digital Library

[17]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In CVPR. 770--778.

[18]

G. E. Hinton and R. R. Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science, Vol. 313, 5786 (2006), 504--507.

[19]

Stratos Idreos, Martin L. Kersten, and Stefan Manegold. 2007. Database Cracking. In CIDR. 68--78.

[20]

Stratos Idreos, Martin L. Kersten, and Stefan Manegold. 2009. Self-organizing tuple reconstruction in column-stores. In ACM SIGMOD . 297--308.

[21]

Stratos Idreos, Stefan Manegold, Harumi A. Kuno, and Goetz Graefe. 2011. Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores. PVLDB, Vol. 4, 9 (2011), 585--597.

[22]

Stratos Idreos, Kostas Zoumpatianos, Brian Hentschel, Michael S. Kester, and Demi Guo. 2018. The Data Calculator: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models. In ACM SIGMOD. 535--550.

[23]

Ihab F Ilyas, Volker Markl, Peter Haas, Paul Brown, and Ashraf Aboulnaga. 2004. CORDS: automatic discovery of correlations and soft functional dependencies. In ACM SIGMOD. ACM, 647--658.

Digital Library

[24]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In NIPS . 1097--1105.

Digital Library

[25]

Sushil Kumar. 2003. Oracle database 10g: The self-managing database.

[26]

Eva Kwan, Sam Lightstone, Adam Storm, and Leanne Wu. 2002. Automatic configuration for IBM DB2 universal database. In Proc. of IBM Perf Technical Report .

[27]

Yann Lecun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature, Vol. 521, 7553 (2015), 436.

[28]

Sam S Lightstone and Bishwaranjan Bhattacharjee. 2004. Automated design of multidimensional clustering tables for relational databases. In VLDB. VLDB, 1170--1181.

Digital Library

[29]

Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR, Vol. abs/1509.02971 (2015).

[30]

Vasilis Maglogiannis, Dries Naudts, Adnan Shahid, and Ingrid Moerman. 2018. A Q-Learning Scheme for Fair Coexistence Between LTE and Wi-Fi in Unlicensed Spectrum. IEEE Access, Vol. 6 (2018), 27278--27293.

[31]

Ryan Marcus and Olga Papaemmanouil. 2018. Deep reinforcement learning for join order enumeration. arXiv preprint arXiv:1803.00055 (2018).

Digital Library

[32]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. CoRR, Vol. abs/1312.5602 (2013).

[33]

Dushyanth Narayanan, Eno Thereska, and Anastassia Ailamaki. 2005. Continuous resource monitoring for self-predicting DBMS. In null. IEEE, 239--248.

Digital Library

[34]

Jennifer Ortiz, Magdalena Balazinska, Johannes Gehrke, and S Sathiya Keerthi. 2018. Learning State Representations for Query Optimization with Deep Reinforcement Learning. arXiv preprint arXiv:1803.08604 (2018).

[35]

Andrew Pavlo, Gustavo Angulo, Joy Arulraj, Haibin Lin, Jiexi Lin, Lin Ma, Prashanth Menon, Todd C Mowry, Matthew Perron, Ian Quah, et almbox. 2017. Self-Driving Database Management Systems. In CIDR .

[36]

Jun Rao, Chun Zhang, Nimrod Megiddo, and Guy Lohman. 2002. Automating physical database design in a parallel database. In ACM SIGMOD. ACM, 558--569.

Digital Library

[37]

Stefan Richter, Jorge-Arnulfo Quiané -Ruiz, Stefan Schuh, and Jens Dittrich. 2014. Towards zero-overhead static and adaptive indexing in Hadoop. VLDB J., Vol. 23, 3 (2014), 469--494.

Digital Library

[38]

Tom Schaul, John Quan, Ioannis Antonoglou, and David Silver. 2015. Prioritized Experience Replay. Computer Science (2015).

[39]

Felix Martin Schuhknecht, Alekh Jindal, and Jens Dittrich. 2013. The Uncracked Pieces in Database Cracking. PVLDB, Vol. 7, 2 (2013), 97--108.

Digital Library

[40]

Ankur Sharma, Felix Martin Schuhknecht, and Jens Dittrich. 2018. The Case for Automatic Database Administration using Deep Reinforcement Learning. (2018).

[41]

Adam J Storm, Christian Garcia-Arellano, Sam S Lightstone, Yixin Diao, and Maheswaran Surendra. 2006. Adaptive self-tuning memory in DB2. In VLDB. VLDB, 1081--1092.

Digital Library

[42]

David G Sullivan, Margo I Seltzer, and Avi Pfeffer. 2004. Using probabilistic reasoning to automate software tuning. Vol. 32. ACM.

Digital Library

[43]

R S Sutton and A G Barto. 2005. Reinforcement Learning: An Introduction, Bradford Book. IEEE Transactions on Neural Networks, Vol. 16, 1 (2005), 285--286.

Digital Library

[44]

Richard S Sutton and Andrew G Barto. 2011. Reinforcement learning: An introduction. (2011).

Digital Library

[45]

Wenhu Tian, Pat Martin, and Wendy Powley. 2003. Techniques for automatically sizing multiple buffer pools in DB2. In Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research . IBM Press, 294--302.

Digital Library

[46]

Dinh Nguyen Tran, Phung Chinh Huynh, Yong C Tay, and Anthony KH Tung. 2008. A new approach to dynamic self-tuning of database buffers. TOS, Vol. 4, 1 (2008), 3.

Digital Library

[47]

Kostas Tzoumas, Timos Sellis, and Christian S Jensen. 2008. A reinforcement learning approach for adaptive query processing. History (2008).

[48]

Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu, Ang Li, Shuaiwen Leon Song, Zenglin Xu, and Tim Kraska. 2018. SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks. (2018).

[49]

Wei Wang, Meihui Zhang, Gang Chen, HV Jagadish, Beng Chin Ooi, and Kian-Lee Tan. 2016. Database meets deep learning: Challenges and opportunities. ACM SIGMOD Record, Vol. 45, 2 (2016), 17--22.

Digital Library

[50]

Gerhard Weikum, Christof Hasse, Axel Mönkeberg, and Peter Zabback. 1994. The COMFORT automatic tuning project. Information systems, Vol. 19, 5 (1994), 381--432.

Digital Library

[51]

Gerhard Weikum, Axel Moenkeberg, Christof Hasse, and Peter Zabback. 2002. Self-tuning database technology and information services: from wishful thinking to viable engineering. In VLDB. Elsevier, 20--31.

Digital Library

[52]

David Wiese, Gennadi Rabinovitch, Michael Reichert, and Stephan Arenswald. 2008. Autonomic tuning expert: a framework for best-practice oriented autonomic database tuning. In 2008 conference of the center for advanced studies on collaborative research: meeting of minds. ACM, 3.

Digital Library

[53]

Khaled Yagoub, Peter Belknap, Benoit Dageville, Karl Dias, Shantanu Joshi, and Hailing Yu. 2008. Oracle's SQL Performance Analyzer. IEEE Data Eng. Bull., Vol. 31, 1 (2008), 51--58.

[54]

Dong Young Yoon, Ning Niu, and Barzan Mozafari. 2016. Dbsherlock: A performance diagnostic tool for transactional databases. In ACM SIGMOD. ACM, 1599--1614.

Digital Library

[55]

Yuqing Zhu, Jianxun Liu, Mengying Guo, Yungang Bao, Wenlong Ma, Zhuoyue Liu, Kunpeng Song, and Yingchun Yang. 2017. Bestconfig: tapping the performance potential of systems via automatic configuration tuning. In SoCC. ACM, 338--350.

Digital Library

[56]

Daniel C Zilio. 1998. Physical Database Design Decision Algorithms and Concurrent Reorganization for Parallel Database Systems.

[57]

Daniel C Zilio, Jun Rao, Sam Lightstone, Guy Lohman, Adam Storm, Christian Garcia-Arellano, and Scott Fadden. 2004. DB2 design advisor: integrated automatic physical database design. In VLDB. VLDB, 1087--1097.

Digital Library

Cited By

Lai YZheng PJi CLi YZhang SZhang RWang ZDu Y(2025)Centrum: Model-based Database Auto-tuning with Minimal Distributional AssumptionsProceedings of the ACM on Management of Data10.1145/37096713:1(1-26)Online publication date: 11-Feb-2025
https://dl.acm.org/doi/10.1145/3709671
Chen SFan JWu BTang NDeng CWang PLi YTan JLi FZhou JDu X(2025)Automatic Database Configuration Debugging using Retrieval-Augmented Language ModelsProceedings of the ACM on Management of Data10.1145/37096633:1(1-27)Online publication date: 11-Feb-2025
https://dl.acm.org/doi/10.1145/3709663
Li YBao LHuang KWu C(2025)CSAT: Configuration structure-aware tuning for highly configurable software systemsJournal of Systems and Software10.1016/j.jss.2024.112316222(112316)Online publication date: Apr-2025
https://doi.org/10.1016/j.jss.2024.112316
Show More Cited By

Index Terms

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning
1. Information systems
  1. Data management systems
    1. Database administration

Recommendations

Automatic Database Management System Tuning Through Large-scale Machine Learning
SIGMOD '17: Proceedings of the 2017 ACM International Conference on Management of Data

Database management system (DBMS) configuration tuning is an essential aspect of any data-intensive application effort. But this is historically a difficult task because DBMSs have hundreds of configuration "knobs" that control everything in the system, ...
${CDBTune}^{+}$ : An efficient deep reinforcement learning-based automatic cloud database tuning system
Abstract
Configuration tuning is vital to optimize the performance of a database management system (DBMS). It becomes more tedious and urgent for cloud databases (CDB) due to diverse database instances and query workloads, which make the job of a database ... $^{}$ $^{}$ $^{}$ $^{}$ $^{}$ $^{}$ $^{}$
ADSTS: Automatic Distributed Storage Tuning System Using Deep Reinforcement Learning
ICPP '22: Proceedings of the 51st International Conference on Parallel Processing

Modern distributed storage systems with the immense number of configurations, unpredictable workloads and difficult performance evaluation pose higher requirements to parameter tuning. Providing an automatic parameter tuning solution for distributed ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data

June 2019

2106 pages

ISBN:9781450356435

DOI:10.1145/3299869

General Chairs:
Peter Boncz
CWI & Vrije Universiteit Amsterdam, The Netherlands
,
Stefan Manegold
CWI & Universiteit Leiden, The Netherlands
,
Program Chairs:
Anastasia Ailamaki
EPFL, Switzerland
,
Amol Deshpande
University of Maryland, USA
,
Tim Kraska
MIT, USA

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMOD: ACM Special Interest Group on Management of Data

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Natural Science Foundation of China

Conference

SIGMOD/PODS '19

Sponsor:

SIGMOD

SIGMOD/PODS '19: International Conference on Management of Data

June 30 - July 5, 2019

Amsterdam, Netherlands

Acceptance Rates

SIGMOD '19 Paper Acceptance Rate 88 of 430 submissions, 20%;

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

213
Total Citations
View Citations
3,215
Total Downloads

Downloads (Last 12 months)337
Downloads (Last 6 weeks)34

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Lai YZheng PJi CLi YZhang SZhang RWang ZDu Y(2025)Centrum: Model-based Database Auto-tuning with Minimal Distributional AssumptionsProceedings of the ACM on Management of Data10.1145/37096713:1(1-26)Online publication date: 11-Feb-2025
https://dl.acm.org/doi/10.1145/3709671
Chen SFan JWu BTang NDeng CWang PLi YTan JLi FZhou JDu X(2025)Automatic Database Configuration Debugging using Retrieval-Augmented Language ModelsProceedings of the ACM on Management of Data10.1145/37096633:1(1-27)Online publication date: 11-Feb-2025
https://dl.acm.org/doi/10.1145/3709663
Li YBao LHuang KWu C(2025)CSAT: Configuration structure-aware tuning for highly configurable software systemsJournal of Systems and Software10.1016/j.jss.2024.112316222(112316)Online publication date: Apr-2025
https://doi.org/10.1016/j.jss.2024.112316
Pei YZhu MZhu CSong WSun YLi LZhu H(2025)Meta Reinforcement Learning Based Dynamic Tuning for Blockchain Systems in Diverse Network EnvironmentsBlockchain: Research and Applications10.1016/j.bcra.2024.100261(100261)Online publication date: Jan-2025
https://doi.org/10.1016/j.bcra.2024.100261
Li CWang JShi JLiu LZhang S(2025)ADWTune: an adaptive dynamic workload tuning system with deep reinforcement learningComplex & Intelligent Systems10.1007/s40747-025-01801-311:4Online publication date: 28-Feb-2025
https://doi.org/10.1007/s40747-025-01801-3
Somashekar GTandon KKini AChang CHusak PBhagwan RDas MGandhi ANatarajan NVanbever LZhang I(2024)OPPerTuneProceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation10.5555/3691825.3691886(1101-1120)Online publication date: 16-Apr-2024
https://dl.acm.org/doi/10.5555/3691825.3691886
Cuzzocrea ACiancarini P(2024)Serendipitous, Open Big Data Management and Analytics: The SeDaSOMA FrameworkModelling10.3390/modelling50300615:3(1173-1196)Online publication date: 4-Sep-2024
https://doi.org/10.3390/modelling5030061
Kroth BMatusevych SAlotaibi RZhu YGruenheid ATian Y(2024)MLOS in Action: Bridging the Gap Between Experimentation and Auto-Tuning in the CloudProceedings of the VLDB Endowment10.14778/3685800.368585217:12(4269-4272)Online publication date: 8-Nov-2024
https://doi.org/10.14778/3685800.3685852
Bianchi AChai ACorvinelli VGodfrey PSzlichta JZuzarte C(2024)Db2une: Tuning Under Pressure via Deep LearningProceedings of the VLDB Endowment10.14778/3685800.368581117:12(3855-3868)Online publication date: 8-Nov-2024
https://dl.acm.org/doi/10.14778/3685800.3685811
Li GTian WZhang JGrosman RLiu ZLi S(2024)GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage DisaggregationProceedings of the VLDB Endowment10.14778/3685800.368580617:12(3786-3798)Online publication date: 8-Nov-2024
https://dl.acm.org/doi/10.14778/3685800.3685806
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten