Abstract
As data volume grows rapidly and queries become more complex, database engine has to deal with larger amount of temporary data when performing operators such as sort and join. Generally, these temporary data are organized as small temporary files stored in HDD. Due to the poor access performance on HDD, processing time of I/O operations on these files has a direct impact on the response time of queries. Compared to HDD, solid-state drive (SSD) offers more random IOPS and comparable sequential bandwidth. So, using SSD to replace HDD may be an ideal pattern when dealing with temporary data. In this paper, we find out that query processing performing improvement is unsatisfactory if we store temporary files in SSD directly. Thus, we propose a SSD aware temporary data management policy called STDM. STDM takes both the I/O behaviors of temporary data and SSD physical characteristics into account. Temporary data are cached in memory at first, and then written to SSD in an append-only fashion. In this way, we reduce random writes and take full advantage of fast random reads on SSD. We implement a prototype based on PostgreSQL and evaluate its performance on TPC-H benchmark. Experiment result shows that STDM improves query processing performance dramatically compared to the traditional method for organizing temporary data.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Canim, M., Bhattacharjee, B., Mihaila, G.A., Lang, C.A., Ross, K.A.: An object placement advisor for db2 using solid state storage. In: PVLDB, pp. 1318–1329 (2009)
Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J.D., Manasse, M.S., Panigrahy, R.: Design tradeoffs for ssd performance. In: USENIX Annual Technical Conference 2008, pp. 57–70 (2008)
Yang, Q., Ren, J.: I-cash: Intelligently coupled array of ssd and hdd. In: HPCA 2011, pp. 278–289 (2011)
Lee, S.-W., Moon, B.: Design of flash-based DBMS: An in-page logging approach. In: SIGMOD, pp. 55–66 (2007)
Shah, M., Harizopoulos, S., Wiener, J., Graefe, G.: Fast scans and joins using flash drives. In: Proc. of DaMoN Conf., pp. 17–24. ACM Press, New York (2008)
Tsirogiannis, D., Harizopoulos, S., Shah, M.A., Wiener, J.L., Graefe, G.: Query processing techniques for solid state drives. In: SIGMOD, pp. 59–72 (2009)
Shah, M.A., Harizopoulos, S., Wiener, J.L.: andG. Graefe. Fast scans and joins using flash drives. In: DaMoN, pp. 17–24 (2008)
Li, Y., On, S.T., Xu, J., Choi, B., Hu, H.: DigestJoin: Exploiting Fast Random Reads for Flash-based Joins. In: MDM, pp. 152–161 (2009)
Roh, H., Park, S., Kim, S., Shin, M., Lee, S.-W.: B+-tree index optimization by exploiting internal parallelism of flash-based solid state drives. Proceedings of the Very Large Data Base (VLDB) Endowment 5(4), 286–297 (2012)
Lai, W., Fan, Y., Meng, X.: Scan and Join Optimization by Exploiting Internal Parallelism of Flash-Based Solid State Drives. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 381–392. Springer, Heidelberg (2013)
Luo, T., Lee, R., Mesnier, M.P., Chen, F., Zhang, X.: hStorage-DB: Heterogeneity aware data management to exploit the full capability of hybrid storage systems. In: PVLDB, pp. 1076–1087 (2012)
Lee, S.W., Moon, B., Park, C., Kim, J.M., Kim, S.W.: A case for ash memory ssd in enterprise database applications. In: SIGMOD Conference 2008, pp. 1075–1086 (2008)
Lee, S.-W., Moon, B., Park, C.: Advances in flash memory SSD technology for enterprise database applications. In: SIGMOD, pp. 863–870 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Guo, Z., Wang, J., Wang, C., Meng, X. (2014). SSD-Aware Temporary Data Management Policy for Improving Query Performance. In: Li, F., Li, G., Hwang, Sw., Yao, B., Zhang, Z. (eds) Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science, vol 8485. Springer, Cham. https://doi.org/10.1007/978-3-319-08010-9_44
Download citation
DOI: https://doi.org/10.1007/978-3-319-08010-9_44
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08009-3
Online ISBN: 978-3-319-08010-9
eBook Packages: Computer ScienceComputer Science (R0)