skip to main content
10.1145/2463676.2465295acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Query processing on smart SSDs: opportunities and challenges

Published: 22 June 2013 Publication History

Abstract

Data storage devices are getting "smarter." Smart Flash storage devices (a.k.a. "Smart SSD") are on the horizon and will package CPU processing and DRAM storage inside a Smart SSD, and make that available to run user programs inside a Smart SSD. The focus of this paper is on exploring the opportunities and challenges associated with exploiting this functionality of Smart SSDs for relational analytic query processing. We have implemented an initial prototype of Microsoft SQL Server running on a Samsung Smart SSD. Our results demonstrate that significant performance and energy gains can be achieved by pushing selected query processing components inside the Smart SSDs. We also identify various changes that SSD device manufacturers can make to increase the benefits of using Smart SSDs for data processing applications, and also suggest possible research opportunities for the database community.

References

[1]
A Technical Overview of the Oracle Exadata Database Machine and Exadata Storage Server. White Paper, Oracle Corp, 2012.
[2]
D. Agrawal, D. Ganesan, R. K. Sitaraman, Y. Diao, and S. Singh. Lazy-Adaptive Tree: An Optimized Index Structure for Flash Devices. PVLDB, 2009.
[3]
A. Ailamaki, D. J. DeWitt, M. D. Hill, and M. Skounakis. Weaving Relations for Cache Performance. In VLDB, 2001.
[4]
ARM Developer Suite. http://infocenter.arm.com/help/topic/com.arm.doc.dui0068b/DUI0060.pdf
[5]
B. Bhattacharjee, C. Lang, G. A. Mihaila, K. A. Ross, and M. Banikazemi. Enhancing Recovery Using an SSD Buffer Pool Extension. In DaMoN, 2011.
[6]
S. Boboila, Y. Kim, S. S. Vazhkudai, P. Desnoyers, and G. M. Shipman. Active Flash: Out-of-core Data Analytics on Flash Storage. In MSST, 2012.
[7]
M. Canim, G. A. Mihaila, B. Bhattacharjee, K. A. Ross, and C. A. Lang. SSD Bufferpool Extensions for Database Systems. In VLDB, 2010.
[8]
S. Cho, C. Park, H. Oh, S. Kim, Y. Yi, and G. Ganger. Active Disk Meets Flash: A Case for Intelligent SSDs. Carnegie Mellon University Technical Report, 2011.
[9]
J. Do and J. M. Patel. Join Processing for Flash SSDs: Remembering Past Lessons. In DaMoN, 2009.
[10]
J. Do, D. Zhang, J. M. Patel, and D. J. DeWitt. Fast Peak-to-Peak Behavior with SSD Buffer Pool. In ICDE, 2013.
[11]
J. Do, D. Zhang, J. M. Patel, and D. J. DeWitt, J. F. Naughton, and A. Halverson. Turbocharging DBMS Buffer Pool Using SSDs. In SIGMOD, 2011.
[12]
P. Francisco. The Netezza Data Appliance Architecture: A Platform for High Performance Data Warehousing and Analytics. IBM Redbook, 2011.
[13]
J. Gray. Tape is Dead, Disk is Tape, Flash is Disk, RAM Locality is King, 2006. http://research.microsoft.com/en-us/um/people/gray/talks/ Flash_is_Good.ppt
[14]
IBM XIV Storage System. http://www.ibm.com/systems/storage/disk/xiv/index.html
[15]
Iometer. http://www.iometer.org
[16]
K. Keeton, D. A. Patterson, and J. M. Hellerstein. A Case for Intelligent Disks (IDISKs). In SIGMOD Record, vol. 27, 1998.
[17]
S. Kim, H. Oh, C. Park, S. Cho, and S-W. Lee. Fast, Energy Efficient Scan inside Flash Memory SSDs. In ADMS, 2011
[18]
I. Koltsidas and S. Viglas. Data Management over Flash Memory. SIGMOD Tutorial, 2011.
[19]
I. Koltsidas and S. Viglas. Designing a Flash-Aware Two-Level Cache. In ADBIS, 2011.
[20]
S.-W. Lee and B. Moon. Design of Flash-Based DBMS: An In-Page Logging Approach. In SIGMOD, 2007.
[21]
Y. Li, B. He, R. J. Yang, Q. Luo, and K. Yi. Tree Indexing on Solid State Drives. PVLDB, 2010.
[22]
LSI, SAS 9211-4i HBA. http://www.lsi.com/channel/products/storagecomponents/Pages/LSISAS9211-4i.aspx
[23]
Microsoft SQL Server 2012. http://www.microsoft.com/sqlserver
[24]
R. Mueller, J. Teubner, and G. Alonso. Data Processing on FPGAs. PVLDB, 2009.
[25]
R. Mueller and J. Teubner. FPGA: What's in it for a Database? In SIGMOD, 2009.
[26]
Power and Temperature Measurement Setup Guide. http://spec.org/power/docs/SPEC-Power_Measurement_Setup_Guide.pdf
[27]
E. Riedel, C. Faloutsos, and D. F. Nagle. Active Disk Architecture for Databases. Carnegie Mellon University Technical Report, 2000.
[28]
E. Riedel, G. A. Gibson, and C. Faloutsos. Active Storage for Large-Scale Data Mining and Multimedia. In VLDB, 1998.
[29]
Teradata. Virtual Storage. http://www.teradata.com/t/brochures/Teradata-Virtual-Storage-eb5944
[30]
TPC Benchmark H (TPC-H). http://www.tpc.org/tpch
[31]
Trace32, Lauterbach Development Tools. http://www.lauterbach.com
[32]
D. Tsirogiannis, S. Harizopoulos, M. A. Shah, J. L. Wiener, and G. Graefe. Query Processing Techniques for Solid State Drives. In SIGMOD, 2009.
[33]
C.-H. Wu, T.-W. Kuo, and L.-P. Chang. An Efficient B-tree Layer Implementation for Flash-Memory Storage Systems. ACM TECS, 2007.

Cited By

View all
  • (2025)Fusion: An Analytics Object Store Optimized for Query PushdownProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707234(540-556)Online publication date: 3-Feb-2025
  • (2025)StorStack: A full-stack design for in-storage file systemsJournal of Systems Architecture10.1016/j.sysarc.2025.103348160(103348)Online publication date: Mar-2025
  • (2024)OmniCacheProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650700(35-50)Online publication date: 27-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
June 2013
1322 pages
ISBN:9781450320375
DOI:10.1145/2463676
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 June 2013

Permissions

Request permissions for this article.

Check for updates

Author Tag

  1. smart ssd

Qualifiers

  • Research-article

Conference

SIGMOD/PODS'13
Sponsor:

Acceptance Rates

SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)204
  • Downloads (Last 6 weeks)27
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Fusion: An Analytics Object Store Optimized for Query PushdownProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707234(540-556)Online publication date: 3-Feb-2025
  • (2025)StorStack: A full-stack design for in-storage file systemsJournal of Systems Architecture10.1016/j.sysarc.2025.103348160(103348)Online publication date: Mar-2025
  • (2024)OmniCacheProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650700(35-50)Online publication date: 27-Feb-2024
  • (2024)Adaptive DRAM Cache Division for Computational Solid-state Drives2024 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE58400.2024.10546745(1-6)Online publication date: 25-Mar-2024
  • (2024)Storage Abstractions for SSDs: The Past, Present, and FutureACM Transactions on Storage10.1145/370899221:1(1-44)Online publication date: 30-Dec-2024
  • (2024)SmartGraph: A Framework for Graph Processing in Computational StorageProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698538(737-754)Online publication date: 20-Nov-2024
  • (2024)Context-aware Prefetching for Near-Storage AcceleratorsProceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems10.1145/3655038.3665956(131-136)Online publication date: 8-Jul-2024
  • (2024)Sting: Near-storage accelerator framework for scalable triangle counting and beyondProceedings of the 61st ACM/IEEE Design Automation Conference10.1145/3649329.3658265(1-6)Online publication date: 23-Jun-2024
  • (2024)SIMDified Data Processing - Foundations, Abstraction, and Advanced TechniquesCompanion of the 2024 International Conference on Management of Data10.1145/3626246.3654694(613-621)Online publication date: 9-Jun-2024
  • (2024)Asynchronous Compaction Acceleration Scheme for Near-data Processing-enabled LSM-tree-based KV StoresACM Transactions on Embedded Computing Systems10.1145/362609723:6(1-33)Online publication date: 11-Sep-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media