ABSTRACT
Data reduction technologies have proven their effectiveness to decrease the ever-growing demands on storage system capacities, but also introduce new complexity in the system I/O stack that can easily invalidate well-known best practices. In this paper, we conduct an extensive set of experiments on an enterprise all-flash storage (AFS) system equipped with an open-source data reduction module, i.e., RedHat VDO, and reveal novel observations on the performance gap between the state-of-the-art and the optimal AFS stack with integrated data reduction. We then offer cross-layer optimizations to enhance the performance of AFS, which range from deriving new optimal hardware RAID configurations up to modifications of the enterprise storage stack tailored to the major bottlenecks observed. By implementing all proposed optimizations in an enterprise AFS, we show up to 12.5x speedup over the baseline AFS with integrated data reduction, and up to 57x performance/cost improvement over an optimized AFS (with no data reduction) for 100% write, high-reduction workload scenarios.
- Mohammadamin Ajdari, Patrick Raaf, Mostafa Kishani, Reza Salkhordeh, Hossein Asadi, and André Brinkmann. 2022. An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems. Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS), Vol. 6, 2 (2022), 27.Google Scholar
- Louis Imershein. 2018. Open Source Data Reduction for High Performance Flash Storage. In Flash Memory Summit, 2018, Santa Clara, CA, USA. https://www.flashmemorysummit.com/English/Collaterals/Proceedings/2018/20180808_SOFT-202-1_Imershein.pdfGoogle Scholar
Index Terms
- An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems
Recommendations
An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems
POMACSAll-flash storage (AFS) systems have become an essential infrastructure component to support enterprise applications, where sub-millisecond latency and very high throughput are required. Nevertheless, the price per capacity ofsolid-state drives (SSDs) is ...
An Enterprise-Grade Open-Source Data Reduction Architecture for All-Flash Storage Systems
SIGMETRICS '22Data reduction technologies have proven their effectiveness to decrease the ever-growing demands on storage system capacities, but also introduce new complexity in the system I/O stack that can easily invalidate well-known best practices. In this paper, ...
Flash-Based Storage Deduplication Techniques: A Survey
Exponential growth of the amount of data stored worldwide together with high level of data redundancy motivates the active development of data deduplication techniques. The overall increasing popularity of solid-state drives (SSDs) as primary storage ...
Comments