Abstract
Database systems are widely used in heterogeneous applications. However, it is difficult to decide which database management system meets requirements of a certain application best. This observation is even more true for scientific and statistical data management, because new application and research fields are often first observed in this domain. New requirements are often implied to data management while discovering unknown research and applications areas. That is, heuristics and tools do not exist to select an optimal database management system. We develop a decision support framework to support application-performance analyses on database management systems. We use mappings and merge workload information to patterns. We present heuristics for performance estimation to select the optimal database management system for a given workload. We show, these heuristics improve our decision framework by complexity reduction without loss of accuracy. Finally, we evaluate our heuristics considering standard database benchmarks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Daniel, J.: Abadi. Query execution in column-oriented database systems. PhD thesis, Cambridge, MA, USA, Adviser: Madden, Samuel (2008)
Astrahan, M.M., Blasgen, M.W., Chamberlin, D.D., Eswaran, K.P., Gray, J., Griffiths, P.P., Frank King III, W., Lorie, R.A., McJones, P.R., Mehl, J.W., Putzolu, G.R., Traiger, I.L., Wade, B.W., Watson, V.: System R: Relational Approach to Database Management. ACM Trans. Database Syst. 1(2), 97–137 (1976)
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R.H., Konwinski, A., Lee, G., Patterson, D.A., Rabkin, A., Stoica, I., Zaharia, M.: Above the Clouds: A Berkeley View of Cloud Computing. Technical Report UCB/EECS-2009-28, EECS Department, University of California, Berkeley (February 2009)
Abadi, D.J., Madden, S.R., Hachem, N.: Column-stores vs. row-stores: How different are they really?. In: SIGMOD 2008, pp. 967–980 (2008)
Buyya, R., Yeo, C.S., Venugopal, S.: Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities. In: HPCC 2008, pp. 5–13 (2008)
Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP to User-Analysts: An IT Mandate. Ann ArborMichigan 24 (1993)
Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.: Bigtable: A Distributed Storage System for Structured Data. In: OSDI 2006, pp. 205–218 (2006)
Chaudhuri, S., Narasayya, V.: Self-tuning database systems: A decade of progress. In: VLDB 2007, pp. 3–14 (2007)
Dean, J., Ghemawat, S.: MapReduce: Simplified Data Processing on Large Clusters. In: OSDI 2004, pp. 137–150 (2004)
Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W.: Dynamo: amazon’s highly available key-value store. In: SOSP 2007, pp. 205–220 (2007)
Ferraris, D.R.L.: TPCC-UVa: an open-source TPC-C implementation for global performance measurement of computer systems. SIGMOD Record 35(4), 6–15 (2006)
French, C.D.: Teaching an OLTP database kernel advanced datawarehousing techniques. In: ICDE 1997, pp. 194–198 (1997)
Foster, I.T., Zhao, Y., Raicu, I., Lu, S.: Cloud Computing and Grid Computing 360-Degree Compared. CoRR 2009, abs/0901.0131 (2009)
Graefe, G., DeWitt, D.J.: The EXODUS Optimizer Generator. In: SIGMOD 1987, pp. 160–172 (1987)
Holze, M., Gaidies, C., Ritter, N.: Consistent on-line classification of DBS workload events. In: CIKM 2009, pp. 1641–1644 (2009)
Idreos, S.: Database Cracking: Torwards Auto-tuning Database Kernels. PhD thesis (2010)
Kemper, A., Neumann, T.: HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots. In: ICDE 2011, pp. 195–206 (2011)
Korth, H.F., Silberschatz, A.: Database Research Faces the Information Explosion. Commun. ACM 40(2), 139–142 (1997)
Lübcke, A., Köppen, V., Saake, G.: A Decision Model to Select the Optimal Storage Architecture for Relational Databases. In: Proceedings of the Fifth IEEE International Conference on Research Challenges in Information Science, RCIS 2011, pp. 74–84 (2011)
Lübcke, A., Saake, G.: A Framework for Optimal Selection of a Storage Architecture in RDBMS. In: DB&IS 2010, pp. 65–76 (2010)
Lübcke, A., Schäler, M., Köppen, V., Saake, G.: Workload-based Heuristics for Evaluation of Physical Database Architectures. In: DB&IS 2012, pp. 3–10 (2012)
Lübcke, A.: Challenges in Workload Analyses for Column and Row Storess. In: Grundlagen von Datenbanken (2010)
Naydenova, I., Kaloyanova, K.: Sparsity Handling and Data Explosion in OLAP Systems. In: MCIS 2010, pp. 62–70 (2010)
Tamer Özsu, M., Valdurie, P.: Principles of Distributed Database Systems, 3rd edn. Springer (2011)
Plattner, H.: A common database approach for OLTP and OLAP using an in-memory column database. In: SIGMOD 2009, pp. 1–2. ACM (2009)
Raatikainen, K.E.E.: Cluster Analysis and Workload Classification. SIGMETRICS Performance Evaluation Review 20(4), 24–30 (1993)
Stonebraker, M., Abadi, D.J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O’Neil, E.J., O’Neil, P.E., Rasin, A., Tran, N., Zdonik, S.B.: C-Store: A column-oriented DBMS. In: VLDB 2005, pp. 553–564. VLDB Endowment (2005)
Santos, R.J., Bernardino, J.: Real-time data warehouse loading methodology. In: IDEAS 2008, pp. 49–58 (2008)
Transaction Processing Performance Council. TPC BENCHMARKTM H. White Paper, Decision Support Standard Specification,Revision 2.11.0 (April 2010)
Vaisman, A.A., Mendelzon, A.O., Ruaro, W., Cymerman, S.G.: Supporting dimension updates in an OLAP server. Information Systems 29(2), 165–185 (2004)
Zhu, Y., An, L., Liu, S.: Data Updating and Query in Real-Time Data Warehouse System. In: CSSE 2008, pp. 1295–1297 (2008)
Zukowski, M., Nes, N., Boncz, P.A.: DSM vs. NSM: CPU performance tradeoffs in block-oriented query processing. In: DaMoN 2008, pp. 47–54 (2008)
Zilio, D.C., Rao, J., Lightstone, S., Lohman, G.M., Storm, A.J., Garcia-Arellano, C., Fadden, S.: DB2 Design Advisor: Integrated automatic physical database design. In: VLDB 2004, pp. 1087–1097. VLDB Endowment (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lübcke, A., Köppen, V., Saake, G. (2013). Heuristics-Based Workload Analysis for Relational DBMSs. In: Mayr, H.C., Kop, C., Liddle, S., Ginige, A. (eds) Information Systems: Methods, Models, and Applications. UNISCON 2012. Lecture Notes in Business Information Processing, vol 137. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38370-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-38370-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38369-4
Online ISBN: 978-3-642-38370-0
eBook Packages: Computer ScienceComputer Science (R0)