Optimizing Write Performance for Read Optimized Databases

Krueger, Jens; Grund, Martin; Tinnefeld, Christian; Plattner, Hasso; Zeier, Alexander; Faerber, Franz

doi:10.1007/978-3-642-12098-5_23

Jens Krueger²⁰,
Martin Grund²⁰,
Christian Tinnefeld²⁰,
Hasso Plattner²⁰,
Alexander Zeier²⁰ &
…
Franz Faerber²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5982))

Included in the following conference series:

International Conference on Database Systems for Advanced Applications

2258 Accesses
18 Citations

Abstract

Compression in column-oriented databases has been proven to offer both performance enhancements and reductions in storage consumption. This is especially true for read access as compressed data can directly be processed for query execution.Nevertheless, compression happens to be disadvantageous when it comes to write access due to unavoidable re-compression: write-access requires significantly more data to be read than involved in the particular operation, more tuples may have to be modified depending on the compression algorithm, and table-level locks have to be acquired instead of row-level locks as long as no second version of the data is stored. As an effect the duration of a single modification — both insert and update — limits both throughput and response time significantly. In this paper, we propose to use an additional write-optimized buffer to maintain the delta that in conjunction with the compressed main store represents the current state of the data. This buffer facilitates an uncompressed, column-oriented data structure. To address the mentioned disadvantages of data compression, we trade write-performance for query-performance and memory consumption by using the buffer as an intermediate storage for several modifications which are then populated as a bulk in a merge operation. Hereby, the overhead created by one single re-compression is shared among all recent modifications. We evaluated our implementation inside SAP’s in memory column store. We then analyze the different parameters influencing the merge process, and make a complexity analysis. Finally, we show optimizations regarding resource consumption and merge duration.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Compression-Aware In-Memory Query Processing: Vision, System Design and Beyond

Revisiting Data Compression in Column-Stores

A Framework of Write Optimization on Read-Optimized Out-of-Core Column-Store Databases

References

Abadi, D., Madden, S., Ferreira, M.: Integrating compression and execution in column-oriented database systems. In: SIGMOD 2006, pp. 671–682. ACM, New York (2006)
Chapter Google Scholar
Abadi, D.J.: Query Execution in Column-Oriented Database Systems. PhD thesis
Google Scholar
Boncz, P.A., Manegold, S., Kersten, M.L.: Database architecture optimized for the new bottleneck: Memory access. In: VLDB, pp. 54–65 (1999)
Google Scholar
Boncz, P.A., Zukowski, M., Nes, N.: Monetdb/x100: Hyper-pipelining query execution. In: CIDR, pp. 225–237 (2005)
Google Scholar
Brown, K.P., Mehta, M., Carey, M.J., Livny, M.: Towards automated performance tuning for complex workloads. In: VLDB, pp. 72–84 (1994)
Google Scholar
Chen, Z., Gehrke, J., Korn, F.: Query optimization in compressed database systems. In: SIGMOD 2001, pp. 271–282. ACM, New York (2001)
Chapter Google Scholar
Copeland, G.P., Khoshafian, S.: A decomposition storage model. In: SIGMOD Conference, pp. 268–279 (1985)
Google Scholar
French, C.D.: “one size fits all” database architectures do not work for DDS. In: SIGMOD Conference, pp. 449–450 (1995)
Google Scholar
French, C.D.: Teaching an OLTP database kernel advanced data warehousing techniques. In: ICDE, pp. 194–198 (1997)
Google Scholar
Harizopoulos, S., Liang, V., Abadi, D.J., Madden, S.: Performance tradeoffs in read-optimized databases. In: VLDB, pp. 487–498 (2006)
Google Scholar
Holloway, A.L., DeWitt, D.J.: Read-optimized databases, in depth. PVLDB 1(1), 502–513 (2008)
Google Scholar
Legler, T., Lehner, W., Ross, A.: Data mining with the SAP NetWeaver BI accelerator. In: VLDB 2006, pp. 1059–1068. VLDB Endowment (2006)
Google Scholar
Pang, H., Carey, M.J., Livny, M.: Multiclass query scheduling in real-time database systems. IEEE Trans. Knowl. Data Eng. 7(4), 533–551 (1995)
Article Google Scholar
Plattner, H.: A common database approach for OLTP and OLAP using an in-memory column database. In: SIGMOD Conference, pp. 1–2 (2009)
Google Scholar
Ramamurthy, R., DeWitt, D.J., Su, Q.: A case for fractured mirrors. In: VLDB, pp. 430–441 (2002)
Google Scholar
Rao, J., Ross, K.A.: Making b$^{\mbox{+}}$-trees cache conscious in main memory. In: SIGMOD Conference, pp. 475–486 (2000)
Google Scholar
Rappaport, R.L.: File structure design to facilitate on-line instantaneous updating. In: SIGMOD 1975, pp. 1–14. ACM, New York (1975)
Chapter Google Scholar
Severance, D.G., Lohman, G.M.: Differential files: their application to the maintenance of large databases. ACM Trans. Database Syst. 1(3), 256–267 (1976)
Article Google Scholar
Stonebraker, M., Abadi, D.J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O’Neil, E.J., O’Neil, P.E., Rasin, A., Tran, N., Zdonik, S.B.: C-store: A column-oriented dbms. In: VLDB, pp. 553–564 (2005)
Google Scholar
Westmann, T., Kossmann, D., Helmer, S., Moerkotte, G.: The implementation and performance of compressed databases. SIGMOD Rec. 29(3), 55–67 (2000)
Article Google Scholar
Willhalm, T., Popovici, N., Boshmaf, Y., Plattner, H., Zeier, A., Schaffner, J.: Simd-scan: Ultra fast in-memory table scan using on-chip vector processing units. PVLDB 2(1), 385–394 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Hasso–Plattner–Institut, August–Bebel–Str. 88, 14482, Potsdam, Germany
Jens Krueger, Martin Grund, Christian Tinnefeld, Hasso Plattner & Alexander Zeier
SAP AG, Dietmar-Hopp-Allee 16, 69190, Walldorf
Franz Faerber

Authors

Jens Krueger
View author publications
You can also search for this author in PubMed Google Scholar
Martin Grund
View author publications
You can also search for this author in PubMed Google Scholar
Christian Tinnefeld
View author publications
You can also search for this author in PubMed Google Scholar
Hasso Plattner
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Zeier
View author publications
You can also search for this author in PubMed Google Scholar
Franz Faerber
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Systems and Information Engineering, University of Tsukuba, 305–8573, Tennodai, Tsukuba, Ibaraki, Japan
Hiroyuki Kitagawa
Information Technology Center, Nagoya University, 464-8601, Furo-cho, Chikusa-ku, Nagoya, Japan
Yoshiharu Ishikawa
Department of Computer Science, City University of Hong Kong, 83 Tat Chee Avenue, Kowloon, Hong Kong, China
Qing Li
Department of Information Science, Ochanomizu University, 2-1-1, Otsuka, Bunkyo-ku, 112-8610, Tokyo, Japan
Chiemi Watanabe

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Krueger, J., Grund, M., Tinnefeld, C., Plattner, H., Zeier, A., Faerber, F. (2010). Optimizing Write Performance for Read Optimized Databases. In: Kitagawa, H., Ishikawa, Y., Li, Q., Watanabe, C. (eds) Database Systems for Advanced Applications. DASFAA 2010. Lecture Notes in Computer Science, vol 5982. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12098-5_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-12098-5_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12097-8
Online ISBN: 978-3-642-12098-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics