ABSTRACT
This demonstration presents SharedDB, an implementation of a relational database system capable of executing all SQL operators by sharing computation and resources across all running queries. SharedDB sidesteps the traditional query-at-a-time approach and executes queries in batches. Unlike proposed multi-query optimization ideas, in SharedDB queries do not have to contain common subexpressions in order to be part of the same batch, which allows for a higher degree of sharing. By sharing as much as possible, SharedDB avoids repeating parts of computation that is common across all running queries. The goal of this demonstration is to show the ability of shared query execution to a) answer complex and diverse workloads, and b) reduce the interaction among concurrently executed queries that is observed in traditional systems and leads to performance deterioration and instabilities.
- G. Giannikis, G. Alonso, and D. Kossmann. SharedDB: Killing one Thousand Queries with one Stone. Proc. VLDB Endow., 5(6):526--537, Feb. 2012. Google ScholarDigital Library
- S. Harizopoulos and A. Ailamaki. StagedDB: Designing Database Servers for Modern Hardware. IEEE Data Eng. Bull., 28(2):11--16, 2005.Google Scholar
- T.-I. Salomie, I. E. Subasu, J. Giceva, and G. Alonso. Database Engines on Multicores, Why Parallelize when you can Distribute? In Proc. EuroSys, pages 17--30, 2011. Google ScholarDigital Library
- P. Unterbrunner, G. Giannikis, G. Alonso, D. Fauser, and D. Kossmann. Predictable Performance for Unpredictable Workloads. In Proc. VLDB, pages 706--717, 2009. Google ScholarDigital Library
Index Terms
Workload optimization using SharedDB
Recommendations
Shared workload optimization
As a result of increases in both the query load and the data managed, as well as changes in hardware architecture (multicore), the last years have seen a shift from query-at-a-time approaches towards shared work (SW) systems where queries are executed ...
Many-query join: efficient shared execution of relational joins on modern hardware
Database architectures typically process queries one at a time, executing concurrent queries in independent execution contexts. Often, such a design leads to unpredictable performance and poor scalability. One approach to circumvent the problem is to ...
AStream: Ad-hoc Shared Stream Processing
SIGMOD '19: Proceedings of the 2019 International Conference on Management of DataIn the last decade, many distributed stream processing engines (SPEs) were developed to perform continuous queries on massive online data. The central design principle of these engines is to handle queries that potentially run forever on data streams ...
Comments