Skip to main content

SSD-Aware Temporary Data Management Policy for Improving Query Performance

  • Conference paper
  • 5793 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8485))

Abstract

As data volume grows rapidly and queries become more complex, database engine has to deal with larger amount of temporary data when performing operators such as sort and join. Generally, these temporary data are organized as small temporary files stored in HDD. Due to the poor access performance on HDD, processing time of I/O operations on these files has a direct impact on the response time of queries. Compared to HDD, solid-state drive (SSD) offers more random IOPS and comparable sequential bandwidth. So, using SSD to replace HDD may be an ideal pattern when dealing with temporary data. In this paper, we find out that query processing performing improvement is unsatisfactory if we store temporary files in SSD directly. Thus, we propose a SSD aware temporary data management policy called STDM. STDM takes both the I/O behaviors of temporary data and SSD physical characteristics into account. Temporary data are cached in memory at first, and then written to SSD in an append-only fashion. In this way, we reduce random writes and take full advantage of fast random reads on SSD. We implement a prototype based on PostgreSQL and evaluate its performance on TPC-H benchmark. Experiment result shows that STDM improves query processing performance dramatically compared to the traditional method for organizing temporary data.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Canim, M., Bhattacharjee, B., Mihaila, G.A., Lang, C.A., Ross, K.A.: An object placement advisor for db2 using solid state storage. In: PVLDB, pp. 1318–1329 (2009)

    Google Scholar 

  2. Agrawal, N., Prabhakaran, V., Wobber, T., Davis, J.D., Manasse, M.S., Panigrahy, R.: Design tradeoffs for ssd performance. In: USENIX Annual Technical Conference 2008, pp. 57–70 (2008)

    Google Scholar 

  3. Yang, Q., Ren, J.: I-cash: Intelligently coupled array of ssd and hdd. In: HPCA 2011, pp. 278–289 (2011)

    Google Scholar 

  4. Lee, S.-W., Moon, B.: Design of flash-based DBMS: An in-page logging approach. In: SIGMOD, pp. 55–66 (2007)

    Google Scholar 

  5. Shah, M., Harizopoulos, S., Wiener, J., Graefe, G.: Fast scans and joins using flash drives. In: Proc. of DaMoN Conf., pp. 17–24. ACM Press, New York (2008)

    Google Scholar 

  6. Tsirogiannis, D., Harizopoulos, S., Shah, M.A., Wiener, J.L., Graefe, G.: Query processing techniques for solid state drives. In: SIGMOD, pp. 59–72 (2009)

    Google Scholar 

  7. Shah, M.A., Harizopoulos, S., Wiener, J.L.: andG. Graefe. Fast scans and joins using flash drives. In: DaMoN, pp. 17–24 (2008)

    Google Scholar 

  8. Li, Y., On, S.T., Xu, J., Choi, B., Hu, H.: DigestJoin: Exploiting Fast Random Reads for Flash-based Joins. In: MDM, pp. 152–161 (2009)

    Google Scholar 

  9. Roh, H., Park, S., Kim, S., Shin, M., Lee, S.-W.: B+-tree index optimization by exploiting internal parallelism of flash-based solid state drives. Proceedings of the Very Large Data Base (VLDB) Endowment 5(4), 286–297 (2012)

    Google Scholar 

  10. Lai, W., Fan, Y., Meng, X.: Scan and Join Optimization by Exploiting Internal Parallelism of Flash-Based Solid State Drives. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds.) WAIM 2013. LNCS, vol. 7923, pp. 381–392. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  11. Luo, T., Lee, R., Mesnier, M.P., Chen, F., Zhang, X.: hStorage-DB: Heterogeneity aware data management to exploit the full capability of hybrid storage systems. In: PVLDB, pp. 1076–1087 (2012)

    Google Scholar 

  12. Lee, S.W., Moon, B., Park, C., Kim, J.M., Kim, S.W.: A case for ash memory ssd in enterprise database applications. In: SIGMOD Conference 2008, pp. 1075–1086 (2008)

    Google Scholar 

  13. Lee, S.-W., Moon, B., Park, C.: Advances in flash memory SSD technology for enterprise database applications. In: SIGMOD, pp. 863–870 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Guo, Z., Wang, J., Wang, C., Meng, X. (2014). SSD-Aware Temporary Data Management Policy for Improving Query Performance. In: Li, F., Li, G., Hwang, Sw., Yao, B., Zhang, Z. (eds) Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science, vol 8485. Springer, Cham. https://doi.org/10.1007/978-3-319-08010-9_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08010-9_44

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08009-3

  • Online ISBN: 978-3-319-08010-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics