Abstract
In this paper, we discuss a self-managed distributed column-store system which would adapt its physical design to changing workloads. Architectural novelties of column-stores hold a great promise for construction of an efficient self-managed database. At first, we present a short survey of an existing self-managed systems. Then, we provide some views on the organization of a self-managed distributed column-store system. We discuss its three core components: alerter, reorganization controller and the set of physical design options (actions) available to such a system. We present possible approaches to each of these components and evaluate them. This study is the first step towards a creation of an adaptive distributed column-store system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
O’Neil, P.E., O’Neil, E.J., Chen, X.: The Star Schema Benchmark (SSB). http://www.cs.umb.edu/~poneil/StarSchemaB.PDF (acessed July 20, 2012)
Abadi, D., Boncz, P., Harizopoulos, S.: The Design and Implementation of Modern Column-Oriented Database Systems (2013)
Abadi, D.J., Madden, S.R., Hachem, N.: Column-stores vs. row-stores: how different are they really? In: Proc. of SIGMOD 2008, pp. 967–980 (2008)
Aboulnaga, A., Salem, K.: Report: 4th Int’l Workshop on Self-Managing Database Systems (SMDB 2009), pp. 2–5 (2009)
Agrawal, S., Chaudhuri, S., Kollar, L., Marathe, A., Narasayya, V., Syamala, M.: Database tuning advisor for microsoft SQL server 2005: demo. In: Proceedings of the SIGMOD 2005, pp. 930–932 (2005)
Agrawal, S., Chaudhuri, S., Kollar, L., Marathe, A., Narasayya, V., Syamala, M.: Database tuning advisor for microsoft SQL server 2005. In: Proceedings of VLDB, pp. 1110–1121 (2004)
Agrawal, S., Chu, E., Narasayya, V.: Automatic physical design tuning: workload as a sequence. In: Proceedings of SIGMOD 2006, pp. 683–694 (2006)
Agrawal, S., Narasayya, V., Yang, B.: Integrating vertical and horizontal partitioning into automated physical database design. In: Proceedings of SIGMOD 2004 (2004)
Ailamaki, A., DeWitt, D.J., Hill, M.D., Skounakis, M.: Weaving relations for cache performance. In: Proceedings of VLDB 2001, pp. 169–180 (2001)
Alagiannis, I., Dash, D., Schnaitter, K., Ailamaki, A., Polyzotis, N.: An automated, yet interactive and portable DB designer. In: Proceedings of SIGMOD 2010 (2010)
Alagiannis, I., Idreos, S., Ailamaki, A.: H2O: a hands-free adaptive store. In: Proceedings of SIGMOD 2014, pp. 1103–1114 (2014)
Bellatreche, L., Boukhalfa, K.: Yet another algorithms for selecting bitmap join indexes. In: Bach Pedersen, T., Mohania, M.K., Tjoa, A.M. (eds.) DAWAK 2010. LNCS, vol. 6263, pp. 105–116. Springer, Heidelberg (2010)
Bellatreche, L., Woameno, K.Y.: Dimension table driven approach to referential partition relational data warehouses. In: Proc. of the DOLAP 2009, pp. 9–16 (2009)
Bruno, N., Chaudhuri, S.: Automatic physical database tuning: a relaxation-based approach. In: Proceedings of SIGMOD 2005, pp. 227–238 (2005)
Bruno, N., Chaudhuri, S.: To tune or not to tune?: a lightweight physical design alerter. In: Proceedings of VLDB 2006, pp. 499–510 (2006)
Ceri, S., Navathe, S., Wiederhold, G.: Distribution design of logical database schemas. IEEE Transactions on Software Engineering 9, 487–504 (1983)
Ceri, S., Negri, M., Pelagatti, G.: Horizontal data partitioning in database design. In: Proceedings of SIGMOD 1982, pp. 128–136 (1982)
Chaudhuri, S., Narasayya, V.: Self-tuning database systems: a decade of progress. In: Proceedings of VLDB 2007, pp. 3–14 (2007)
Chaudhuri, S., Weikum, G.: Self-management technology in databases. In: Liu, L., Özsu, M. (eds.) Encyclopedia of Database Systems, pp. 2550–2555 (2009)
Chernishev, G.: Physical design approaches for column-stores. SPIIRAS Proceedings 30, 204–222 (2013). www.mathnet.ru/trspy682
Chernishev, G.: A survey of dbms physical design approaches. SPIIRAS Proceedings 24, 222–276 (2013). www.mathnet.ru/trspy580
Copeland, G., Alexander, W., Boughter, E., Keller, T.: Data placement in Bubba. In: Proceedings of SIGMOD 1988, pp. 99–108 (1988)
Dageville, B., Das, D., Dias, K., Yagoub, K., Zait, M., Ziauddin, M.: Automatic SQL tuning in oracle 10g. In: Proceedings of VLDB 2004, pp. 1098–1109 (2004)
Dageville, B., Dias, K.: Oracle’s self-tuning architecture and solutions. IEEE Data Eng. Bull. 29(3), 24–31 (2006)
Eadon, G., Chong, E.I., Shankar, S., Raghavan, A., Srinivasan, J., Das, S.: Supporting table partitioning by reference in oracle. In: Proceedings of SIGMOD 2008 (2008)
Gebaly, K.E., Aboulnaga, A.: Robustness in automatic physical database design. In: Proceedings EDBT 2008, pp. 145–156 (2008)
Ghandeharizadeh, S., DeWitt, D.J.: Hybrid-range partitioning strategy: a new declustering strategy for multiprocessor database machines. In: Proceedings of VLDB 1990, pp. 481–492 (1990)
Ghandeharizadeh, S., DeWitt, D.J., Qureshi, W.: A performance analysis of alternative multi-attribute declustering strategies. SIGMOD Rec. 21(2), 29–38 (1992)
Graefe, G., Kuno, H.: Self-selecting, self-tuning, incrementally optimized indexes. In: Proceedings of ICDE 2010, pp. 371–381 (2010)
Hammer, M., Chan, A.: Index selection in a self-adaptive data base management system. In: Proceedings of SIGMOD 1976, pp. 1–8 (1976)
Hammer, M., Niamir, B.: A heuristic approach to attribute partitioning. In: Proceedings of SIGMOD 1979, pp. 93–101 (1979)
Harinarayan, V., Rajaraman, A., Ullman, J.D.: Implementing data cubes efficiently. SIGMOD Rec. 25(2), 205–216 (1996)
Hoffer, J.A.: An integer programming formulation of computer database design problems. Inf. Sci. 11, 29–48 (1976)
Holze, M., Ritter, N.: Towards workload shift detection and prediction for autonomic databases. In: Proceedings of PIKM 2007, pp. 109–116 (2007)
Idreos, S., Kersten, M.L., Manegold, S.: Self-organizing tuple reconstruction in column-stores. In: Proceedings of SIGMOD 2009, pp. 297–308 (2009)
Idreos, S., Kersten, M.L., Manegold, S.: Database cracking. In: CIDR, pp. 68–78 (2007). www.cidrdb.org
Jindal, A., Dittrich, J.: Relax and let the database do the partitioning online. In: Castellanos, M., Dayal, U., Lehner, W. (eds.) BIRTE 2011. LNBIP, vol. 126, pp. 65–80. Springer, Heidelberg (2012)
Kwan, E., Lightstone, S., Storm, A., Wu, L.: Automatic configuration for ibm db2 universal database: Compressing years of performance tuning experience into seconds of execution. Tech. rep., Performance technical report, IBM (2002)
Lamb, A., Fuller, M., Varadarajan, R., Tran, N., Vandiver, B., Doshi, L., Bear, C.: The vertica analytic database: C-store 7 years later. Proc. VLDB Endow 5(12)
LeFevre, J., Sankaranarayanan, J., Hacigumus, H., Tatemura, J., Polyzotis, N., Carey, M.J.: MISO: souping up big data query processing with a multistore system. In: Proceedings of SIGMOD 2014, pp. 1591–1602 (2014)
LeFevre, J., Sankaranarayanan, J., Hacigumus, H., Tatemura, J., Polyzotis, N., Carey, M.J.: Opportunistic physical design for big data analytics. In: Proceedings of SIGMOD 2014, pp. 851–862 (2014)
Li, L., Gruenwald, L.: Self-managing online partitioner for databases (smopd): a vertical database partitioning system with a fully automatic online approach. In: Proceedings of IDEAS 2013, pp. 168–173 (2013)
Lin, X., Orlowska, M., Zhang, Y.: A graph based cluster approach for vertical partitioning in database design. Data & Knowl. Eng. 11(2), 151–169 (1993)
Maier, C., Dash, D., Alagiannis, I., Ailamaki, A., Heinis, T.: PARINDA: an interactive physical designer for Postgre SQL. In: Proceedings of EDBT 2010, pp. 701–704 (2010)
Mami, I., Bellahsene, Z.: A survey of view selection methods. SIGMOD Rec. 41(1), 20–29 (2012)
Navathe, S., Ceri, S., Wiederhold, G., Dou, J.: Vertical partitioning algorithms for database design. ACM Trans. Database Syst. 9, 680–710 (1984)
Nehme, R., Bruno, N.: Automated partitioning design in parallel database systems. In: Proceedings of SIGMOD 2011, pp. 1137–1148 (2011)
Piatetsky-Shapiro, G.: The optimal selection of secondary indices is np-complete. SIGMOD Rec. 13(2), 72–75 (1983)
Rao, J., Zhang, C., Megiddo, N., Lohman, G.: Automating physical database design in a parallel database. In: Proceedings of SIGMOD 2002, pp. 558–569 (2002)
Rösch, P., Dannecker, L., Färber, F., Hackenbroich, G.: A storage advisor for hybrid-store databases. Proc. VLDB Endow. 5(12), 1748–1758 (2012)
Sacca, D., Wiederhold, G.: Database partitioning in a cluster of processors. ACM Trans. Database Syst. 10, 29–56 (1985)
Schnaitter, K., Abiteboul, S., Milo, T., Polyzotis, N.: COLT: continuous on-line tuning. In: Proceedings of SIGMOD 2006, pp. 793–795 (2006)
Stonebraker, M.: The choice of partial inversions and combined indices. International Journal of Computer & Information Sciences 3(2), 167–188 (1974)
Stonebraker, M., Abadi, D.J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O’Neil, E., O’Neil, P., Rasin, A., Tran, N., Zdonik, S.: C-store: a column-oriented DBMS. In: Proceedings of VLDB 2005, pp. 553–564 (2005)
Taniar, D., Leung, C.H.C., Rahayu, W., Goel, S.: High Performance Parallel Database Processing and Grid Databases. Wiley Publishing (2008)
Thiem, A., Sattler, K.U.: An integrated approach to performance monitoring for autonomous tuning. In: Proceedings of ICDE 2009, pp. 1671–1678 (2009)
Valentin, G., Zuliani, M., Zilio, D., Lohman, G., Skelley, A.: DB2 advisor: an optimizer smart enough to recommend its own indexes. In: Proc. of ICDE 2000 (2000)
Wong, E., Katz, R.H.: Distributing a database for parallelism. SIGMOD Rec. 13(4), 23–29 (1983)
Yu, P.S., Chen, M.S., Heiss, H.U., Lee, S.: On workload characterization of relational database environments. IEEE Trans. Softw. Eng. 18(4), 347–355 (1992)
Zilio, D.C., Rao, J., Lightstone, S., Lohman, G., Storm, A., Garcia-Arellano, C., Fadden, S.: DB2 design advisor: integrated automatic physical database design. In: Proceedings of VLDB 2004, pp. 1087–1097 (2004)
Zilio, D., Zuzarte, C., Lightstone, S., Ma, W., Lohman, G., Cochrane, R., Pirahesh, H., Colby, L., Gryz, J., Alton, E., Valentin, G.: Recommending materialized views and indexes with the IBM DB2 design advisor. In: Proceedings of ICAC 2004, pp. 180–187 (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Chernishev, G. (2015). Towards Self-management in a Distributed Column-Store System. In: Morzy, T., Valduriez, P., Bellatreche, L. (eds) New Trends in Databases and Information Systems. ADBIS 2015. Communications in Computer and Information Science, vol 539. Springer, Cham. https://doi.org/10.1007/978-3-319-23201-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-23201-0_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23200-3
Online ISBN: 978-3-319-23201-0
eBook Packages: Computer ScienceComputer Science (R0)