skip to main content
10.1145/3299869.3300085acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning

Published: 25 June 2019 Publication History

Abstract

Configuration tuning is vital to optimize the performance of database management system (DBMS). It becomes more tedious and urgent for cloud databases (CDB) due to the diverse database instances and query workloads, which make the database administrator (DBA) incompetent. Although there are some studies on automatic DBMS configuration tuning, they have several limitations. Firstly, they adopt a pipelined learning model but cannot optimize the overall performance in an end-to-end manner. Secondly, they rely on large-scale high-quality training samples which are hard to obtain. Thirdly, there are a large number of knobs that are in continuous space and have unseen dependencies, and they cannot recommend reasonable configurations in such high-dimensional continuous space. Lastly, in cloud environment, they can hardly cope with the changes of hardware configurations and workloads, and have poor adaptability. To address these challenges, we design an end-to-end automatic CDB tuning system, CDBTune, using deep reinforcement learning (RL). CDBTune utilizes the deep deterministic policy gradient method to find the optimal configurations in high-dimensional continuous space. CDBTune adopts a try-and-error strategy to learn knob settings with a limited number of samples to accomplish the initial training, which alleviates the difficulty of collecting massive high-quality samples. CDBTune adopts the reward-feedback mechanism in RL instead of traditional regression, which enables end-to-end learning and accelerates the convergence speed of our model and improves efficiency of online tuning. We conducted extensive experiments under 6 different workloads on real cloud databases to demonstrate the superiority of CDBTune. Experimental results showed that CDBTune had a good adaptability and significantly outperformed the state-of-the-art tuning tools and DBA experts.

References

[1]
Sanjay Agrawal, Nicolas Bruno, Surajit Chaudhuri, and Vivek R Narasayya. 2006. AutoAdmin: Self-Tuning Database SystemsTechnology. IEEE Data Eng. Bull., Vol. 29, 3 (2006), 7--15.
[2]
Sanjay Agrawal, Surajit Chaudhuri, Lubor Kollar, Arun Marathe, Vivek Narasayya, and Manoj Syamala. 2005. Database tuning advisor for microsoft sql server 2005. In ACM SIGMOD. ACM, 930--932.
[3]
Sanjay Agrawal, Vivek Narasayya, and Beverly Yang. 2004. Integrating vertical and horizontal partitioning into automated physical database design. In ACM SIGMOD. ACM, 359--370.
[4]
Dana Van Aken, Andrew Pavlo, Geoffrey J. Gordon, and Bohan Zhang. 2017. Automatic Database Management System Tuning Through Large-scale Machine Learning. In ACM SIGMOD . 1009--1024.
[5]
Debabrota Basu, Qian Lin, Hoang Tam Vo, Hoang Tam Vo, Zihong Yuan, and Pierre Senellart. 2016. Regularized Cost-Model Oblivious Database Tuning with Reinforcement Learning .Springer Berlin Heidelberg. 96--132 pages.
[6]
Peter Belknap, Benoit Dageville, Karl Dias, and Khaled Yagoub. 2009. Self-tuning for SQL performance in Oracle database 11g. In ICDE. IEEE, 1694--1700.
[7]
Phil Bernstein et almbox. 1998. The Asilomar report on database research. ACM Sigmod record, Vol. 27, 4 (1998), 74--80.
[8]
Nicolas Bruno and Surajit Chaudhuri. 2005. Automatic physical database tuning: a relaxation-based approach. In ACM SIGMOD. ACM, 227--238.
[9]
Surajit Chaudhuri and Vivek Narasayya. 1998. AutoAdmin "what-if" index analysis utility. In ACM SIGMOD. 367--378.
[10]
Surajit Chaudhuri and Vivek Narasayya. 2007. Self-Tuning Database Systems: A Decade of Progress. In VLDB. 3--14.
[11]
Surajit Chaudhuri and Gerhard Weikum. 2000. Rethinking Database System Architecture: Towards a Self-Tuning RISC-Style Database System. In VLDB. 1--10.
[12]
Biplob K Debnath, David J Lilja, and Mohamed F Mokbel. 2008. SARD: A statistical approach for ranking database tuning parameters. In ICDEW. IEEE, 11--18.
[13]
Karl Dias, Mark Ramacher, Uri Shaft, Venkateshwaran Venkataramani, and Graham Wood. 2005. Automatic Performance Diagnosis and Tuning in Oracle. In CIDR. 84--94.
[14]
Songyun Duan, Vamsidhar Thummala, and Shivnath Babu. 2009. Tuning database configuration parameters with iTuned. VLDB Endowment, Vol. 2, 1 (2009), 1246--1257.
[15]
Yoav Goldberg. 2015. A Primer on Neural Network Models for Natural Language Processing. Computer Science (2015).
[16]
Goetz Graefe and Harumi A. Kuno. 2010. Self-selecting, self-tuning, incrementally optimized indexes. In EDBT . 371--381.
[17]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In CVPR. 770--778.
[18]
G. E. Hinton and R. R. Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science, Vol. 313, 5786 (2006), 504--507.
[19]
Stratos Idreos, Martin L. Kersten, and Stefan Manegold. 2007. Database Cracking. In CIDR. 68--78.
[20]
Stratos Idreos, Martin L. Kersten, and Stefan Manegold. 2009. Self-organizing tuple reconstruction in column-stores. In ACM SIGMOD . 297--308.
[21]
Stratos Idreos, Stefan Manegold, Harumi A. Kuno, and Goetz Graefe. 2011. Merging What's Cracked, Cracking What's Merged: Adaptive Indexing in Main-Memory Column-Stores. PVLDB, Vol. 4, 9 (2011), 585--597.
[22]
Stratos Idreos, Kostas Zoumpatianos, Brian Hentschel, Michael S. Kester, and Demi Guo. 2018. The Data Calculator: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models. In ACM SIGMOD. 535--550.
[23]
Ihab F Ilyas, Volker Markl, Peter Haas, Paul Brown, and Ashraf Aboulnaga. 2004. CORDS: automatic discovery of correlations and soft functional dependencies. In ACM SIGMOD. ACM, 647--658.
[24]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In NIPS . 1097--1105.
[25]
Sushil Kumar. 2003. Oracle database 10g: The self-managing database.
[26]
Eva Kwan, Sam Lightstone, Adam Storm, and Leanne Wu. 2002. Automatic configuration for IBM DB2 universal database. In Proc. of IBM Perf Technical Report .
[27]
Yann Lecun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature, Vol. 521, 7553 (2015), 436.
[28]
Sam S Lightstone and Bishwaranjan Bhattacharjee. 2004. Automated design of multidimensional clustering tables for relational databases. In VLDB. VLDB, 1170--1181.
[29]
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. CoRR, Vol. abs/1509.02971 (2015).
[30]
Vasilis Maglogiannis, Dries Naudts, Adnan Shahid, and Ingrid Moerman. 2018. A Q-Learning Scheme for Fair Coexistence Between LTE and Wi-Fi in Unlicensed Spectrum. IEEE Access, Vol. 6 (2018), 27278--27293.
[31]
Ryan Marcus and Olga Papaemmanouil. 2018. Deep reinforcement learning for join order enumeration. arXiv preprint arXiv:1803.00055 (2018).
[32]
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin A. Riedmiller. 2013. Playing Atari with Deep Reinforcement Learning. CoRR, Vol. abs/1312.5602 (2013).
[33]
Dushyanth Narayanan, Eno Thereska, and Anastassia Ailamaki. 2005. Continuous resource monitoring for self-predicting DBMS. In null. IEEE, 239--248.
[34]
Jennifer Ortiz, Magdalena Balazinska, Johannes Gehrke, and S Sathiya Keerthi. 2018. Learning State Representations for Query Optimization with Deep Reinforcement Learning. arXiv preprint arXiv:1803.08604 (2018).
[35]
Andrew Pavlo, Gustavo Angulo, Joy Arulraj, Haibin Lin, Jiexi Lin, Lin Ma, Prashanth Menon, Todd C Mowry, Matthew Perron, Ian Quah, et almbox. 2017. Self-Driving Database Management Systems. In CIDR .
[36]
Jun Rao, Chun Zhang, Nimrod Megiddo, and Guy Lohman. 2002. Automating physical database design in a parallel database. In ACM SIGMOD. ACM, 558--569.
[37]
Stefan Richter, Jorge-Arnulfo Quiané -Ruiz, Stefan Schuh, and Jens Dittrich. 2014. Towards zero-overhead static and adaptive indexing in Hadoop. VLDB J., Vol. 23, 3 (2014), 469--494.
[38]
Tom Schaul, John Quan, Ioannis Antonoglou, and David Silver. 2015. Prioritized Experience Replay. Computer Science (2015).
[39]
Felix Martin Schuhknecht, Alekh Jindal, and Jens Dittrich. 2013. The Uncracked Pieces in Database Cracking. PVLDB, Vol. 7, 2 (2013), 97--108.
[40]
Ankur Sharma, Felix Martin Schuhknecht, and Jens Dittrich. 2018. The Case for Automatic Database Administration using Deep Reinforcement Learning. (2018).
[41]
Adam J Storm, Christian Garcia-Arellano, Sam S Lightstone, Yixin Diao, and Maheswaran Surendra. 2006. Adaptive self-tuning memory in DB2. In VLDB. VLDB, 1081--1092.
[42]
David G Sullivan, Margo I Seltzer, and Avi Pfeffer. 2004. Using probabilistic reasoning to automate software tuning. Vol. 32. ACM.
[43]
R S Sutton and A G Barto. 2005. Reinforcement Learning: An Introduction, Bradford Book. IEEE Transactions on Neural Networks, Vol. 16, 1 (2005), 285--286.
[44]
Richard S Sutton and Andrew G Barto. 2011. Reinforcement learning: An introduction. (2011).
[45]
Wenhu Tian, Pat Martin, and Wendy Powley. 2003. Techniques for automatically sizing multiple buffer pools in DB2. In Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research . IBM Press, 294--302.
[46]
Dinh Nguyen Tran, Phung Chinh Huynh, Yong C Tay, and Anthony KH Tung. 2008. A new approach to dynamic self-tuning of database buffers. TOS, Vol. 4, 1 (2008), 3.
[47]
Kostas Tzoumas, Timos Sellis, and Christian S Jensen. 2008. A reinforcement learning approach for adaptive query processing. History (2008).
[48]
Linnan Wang, Jinmian Ye, Yiyang Zhao, Wei Wu, Ang Li, Shuaiwen Leon Song, Zenglin Xu, and Tim Kraska. 2018. SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks. (2018).
[49]
Wei Wang, Meihui Zhang, Gang Chen, HV Jagadish, Beng Chin Ooi, and Kian-Lee Tan. 2016. Database meets deep learning: Challenges and opportunities. ACM SIGMOD Record, Vol. 45, 2 (2016), 17--22.
[50]
Gerhard Weikum, Christof Hasse, Axel Mönkeberg, and Peter Zabback. 1994. The COMFORT automatic tuning project. Information systems, Vol. 19, 5 (1994), 381--432.
[51]
Gerhard Weikum, Axel Moenkeberg, Christof Hasse, and Peter Zabback. 2002. Self-tuning database technology and information services: from wishful thinking to viable engineering. In VLDB. Elsevier, 20--31.
[52]
David Wiese, Gennadi Rabinovitch, Michael Reichert, and Stephan Arenswald. 2008. Autonomic tuning expert: a framework for best-practice oriented autonomic database tuning. In 2008 conference of the center for advanced studies on collaborative research: meeting of minds. ACM, 3.
[53]
Khaled Yagoub, Peter Belknap, Benoit Dageville, Karl Dias, Shantanu Joshi, and Hailing Yu. 2008. Oracle's SQL Performance Analyzer. IEEE Data Eng. Bull., Vol. 31, 1 (2008), 51--58.
[54]
Dong Young Yoon, Ning Niu, and Barzan Mozafari. 2016. Dbsherlock: A performance diagnostic tool for transactional databases. In ACM SIGMOD. ACM, 1599--1614.
[55]
Yuqing Zhu, Jianxun Liu, Mengying Guo, Yungang Bao, Wenlong Ma, Zhuoyue Liu, Kunpeng Song, and Yingchun Yang. 2017. Bestconfig: tapping the performance potential of systems via automatic configuration tuning. In SoCC. ACM, 338--350.
[56]
Daniel C Zilio. 1998. Physical Database Design Decision Algorithms and Concurrent Reorganization for Parallel Database Systems.
[57]
Daniel C Zilio, Jun Rao, Sam Lightstone, Guy Lohman, Adam Storm, Christian Garcia-Arellano, and Scott Fadden. 2004. DB2 design advisor: integrated automatic physical database design. In VLDB. VLDB, 1087--1097.

Cited By

View all
  • (2025)Centrum: Model-based Database Auto-tuning with Minimal Distributional AssumptionsProceedings of the ACM on Management of Data10.1145/37096713:1(1-26)Online publication date: 11-Feb-2025
  • (2025)Automatic Database Configuration Debugging using Retrieval-Augmented Language ModelsProceedings of the ACM on Management of Data10.1145/37096633:1(1-27)Online publication date: 11-Feb-2025
  • (2025)CSAT: Configuration structure-aware tuning for highly configurable software systemsJournal of Systems and Software10.1016/j.jss.2024.112316222(112316)Online publication date: Apr-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '19: Proceedings of the 2019 International Conference on Management of Data
June 2019
2106 pages
ISBN:9781450356435
DOI:10.1145/3299869
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 June 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud
  2. database tuning
  3. storage

Qualifiers

  • Research-article

Funding Sources

Conference

SIGMOD/PODS '19
Sponsor:
SIGMOD/PODS '19: International Conference on Management of Data
June 30 - July 5, 2019
Amsterdam, Netherlands

Acceptance Rates

SIGMOD '19 Paper Acceptance Rate 88 of 430 submissions, 20%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)337
  • Downloads (Last 6 weeks)34
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Centrum: Model-based Database Auto-tuning with Minimal Distributional AssumptionsProceedings of the ACM on Management of Data10.1145/37096713:1(1-26)Online publication date: 11-Feb-2025
  • (2025)Automatic Database Configuration Debugging using Retrieval-Augmented Language ModelsProceedings of the ACM on Management of Data10.1145/37096633:1(1-27)Online publication date: 11-Feb-2025
  • (2025)CSAT: Configuration structure-aware tuning for highly configurable software systemsJournal of Systems and Software10.1016/j.jss.2024.112316222(112316)Online publication date: Apr-2025
  • (2025)Meta Reinforcement Learning Based Dynamic Tuning for Blockchain Systems in Diverse Network EnvironmentsBlockchain: Research and Applications10.1016/j.bcra.2024.100261(100261)Online publication date: Jan-2025
  • (2025)ADWTune: an adaptive dynamic workload tuning system with deep reinforcement learningComplex & Intelligent Systems10.1007/s40747-025-01801-311:4Online publication date: 28-Feb-2025
  • (2024)OPPerTuneProceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation10.5555/3691825.3691886(1101-1120)Online publication date: 16-Apr-2024
  • (2024)Serendipitous, Open Big Data Management and Analytics: The SeDaSOMA FrameworkModelling10.3390/modelling50300615:3(1173-1196)Online publication date: 4-Sep-2024
  • (2024)MLOS in Action: Bridging the Gap Between Experimentation and Auto-Tuning in the CloudProceedings of the VLDB Endowment10.14778/3685800.368585217:12(4269-4272)Online publication date: 8-Nov-2024
  • (2024)Db2une: Tuning Under Pressure via Deep LearningProceedings of the VLDB Endowment10.14778/3685800.368581117:12(3855-3868)Online publication date: 8-Nov-2024
  • (2024)GaussDB: A Cloud-Native Multi-Primary Database with Compute-Memory-Storage DisaggregationProceedings of the VLDB Endowment10.14778/3685800.368580617:12(3786-3798)Online publication date: 8-Nov-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media