ABSTRACT
The database performance is one of the most important quality indicators that companies are looking for to choose their appropriate database management systems. Quantifying this performance is usually performed by the means of mathematical cost models. Due to the importance of these models, for each evolution of the database technology pushes researchers to revisit or to propose cost models in order to integrate new dimensions brought by that evolution. As a consequence, a huge number of cost models exists. To exploit them, we need to find their respective scientific papers. This situation is in contradiction with Era Sharing, because it reduces reuse of these cost models by researchers, students (from third world countries), etc. and even more it penalizes the reproduction of experiments that intensively use these cost models. In this paper, we propose a framework for cost models dedicated to query processing and optimization. We first propose a common repository, called, MetricStore, to store metrics of cost model units. Secondly, thanks to model-driven engineering facilities, the repository offers capabilities aiming at publishing, searching and reusing cost models through a suitable user interface. Tool support is fully available.
- D. Bausch, I. Petrov, et al. Making cost-based query optimization asymmetry-aware. In DaMoN, pages 24--32. ACM, 2012. Google ScholarDigital Library
- L. Bellatreche, S. Benkrid, et al. Verification of partitioning and allocation techniques on teradata dbms. In ICA3PP, pages 158--169. Springer, 2011. Google ScholarDigital Library
- L. Bellatreche, S. Cheikh, et al. How to exploit the device diversity and database interaction to propose a generic cost model? In Proceedings of the 17th IDEAS, pages 142--147. ACM, 2013. Google ScholarDigital Library
- S. Breß, I. Geist, E. Schallehn, M. Mory, and G. Saake. A framework for cost based optimization of hybrid cpu/gpu query plans in database systems. Control and Cybernetics, 41, 2012.Google Scholar
- T. Burns, E. Fong, and Other. Reference model for dbms standardization. SIGMOD Record, 15(1):19--58, 1986. Google ScholarDigital Library
- S. Chaudhuri and V. Narasayya. Self-tuning database systems: a decade of progress. In Proceedings of the 33rd international conference on VLDB, pages 3--14. VLDB Endowment, 2007. Google ScholarDigital Library
- S. Chaudhuri and V. R. Narasayya. Autoadmin 'what-if' index analysis utility. In ACM SIGMOD, pages 367--378, 1998. Google ScholarDigital Library
- G. Gardarin, F. Sha, and Z.-H. Tang. Calibrating the query optimizer cost model of iro-db, an object-oriented federated database system. In VLDB, volume 96, pages 3--6, 1996. Google ScholarDigital Library
- S. Kent. Model driven language engineering. Electr. Notes Theor. Comput. Sci., 72(4):6, 2003. Google ScholarCross Ref
- V. Leis, A. Gubichev, and Other. How good are query optimizers, really? PVLDB, 9(3):204--215, 2015. Google ScholarDigital Library
- A. Lübcke and Other. A framework for optimal selection of a storage architecture in rdbms. Datorzinātne un informācijas tehnologijas, page 65, 2010.Google Scholar
- C. Maier, D. Dash, et al. Parinda: an interactive physical designer for postgresql. In EDBT, pages 701--704. ACM, 2010. Google ScholarDigital Library
- S. Manegold, P. Boncz, and M. L. Kersten. Generic database cost models for hierarchical memory systems. In VLDB, pages 191--202, 2002. Google ScholarDigital Library
- B. Mozafari, E. Z. Y. Goh, and D. Y. Yoon. Cliffguard: A principled framework for finding robust database designs. In Proceedings of the 2015 ACM SIGMOD, pages 1167--1182. ACM, 2015. Google ScholarDigital Library
- A. Ouared, Y. Ouhammou, and I. Bellatreche. Costdl: a cost models description language for performance metrics in database. In Proceedings of the 21ST IEEE ICECCS. IEEE, 2016. Google ScholarCross Ref
- A. Ouared, Y. Ouhammou, and A. Roukh. A meta-advisor repository for database physical design. In International Conference on MEDI, pages 72--87. Springer, 2016. Google ScholarCross Ref
- A. Roukh, L. Bellatreche, et al. Eco-dmw: Eco-design methodology for data warehouses. In ACM DOLAP, pages 1--10. ACM, 2015. Google ScholarDigital Library
- P. G. Selinger, M. M. Astrahan, et al. Access path selection in a relational database management system. In ACM SIGMOD, pages 23--34. ACM, 1979. Google ScholarDigital Library
- R. Varadarajan, V. Bharathan, et al. Dbdesigner: A customizable physical design tool for vertica analytic database. In IEEE 30th ICDE, pages 1084--1095. IEEE, 2014. Google ScholarCross Ref
- A. Wolke and Other. Reproducible experiments on dynamic resource allocation in cloud data centers. ISJ, 52:83--95, 2016. Google ScholarDigital Library
- Z. Xu, Y. Tu, and X. Wang. PET: reducing database energy cost via query optimization. PVLDB, 5(12):1954--1957, 2012. Google ScholarDigital Library
- N. Zhang, J. Tatemura, et al. Towards cost-effective storage provisioning for dbmss. Proceedings of the VLDB Endowment, 5(4):274--285, 2011. Google ScholarDigital Library
Index Terms
MetricStore repository: on the leveraging of performance metrics in databases
Recommendations
Reuse Concepts and a Reuse Support Repository
ECBS '96: Proceedings of the IEEE Symposium and Workshop on Engineering of Computer Based SystemsIf we want to create a Reuse Environment for Software Engineers, we have to concentrate on both the technical drawbacks of such a complex Environment, as well as on the Software Engineers themselves. Reuse is not just a new mode or a simple development ...
A Repository to Support Software Process Reuse Based on Process Lines
SBQS '20: Proceedings of the XIX Brazilian Symposium on Software QualityDefining a software process is a complex activity, especially when defined from scratch. Thus, different reuse techniques have been proposed to reduce the effort, as well as increase the quality of the defined process, such as Software Process Line (...
A Cost Model for DBaaS Storage
DEXA 2016: Proceedings, Part I, 27th International Conference on Database and Expert Systems Applications - Volume 9827Cloud infrastructures employ hybrid storage systems that incorporate various types of devices flash memory solid-state and hard disk drives. Dealing with such heterogeneity makes the use of data placements strategies necessary. These strategies ...
Comments