skip to main content
10.1145/3538643.3539743acmconferencesArticle/Chapter ViewAbstractPublication PageshotstorageConference Proceedingsconference-collections
research-article

Compaction-aware zone allocation for LSM based key-value store on ZNS SSDs

Published: 27 June 2022 Publication History

Abstract

Unlike traditional block-based SSDs, Zoned Namespace (ZNS) SSDs expose storage through the zoned block interface, completely eliminating the need for in-device garbage collection (GC) and relinquishing this responsibility to applications. As a result, application-aware data placement decisions give the opportunity for applications on the host to perform efficient GC. Meanwhile, RocksDB for ZNS SSD places data with similar invalidation times (lifetimes) in the same zone through ZenFS (a user-level file system) using the Lifetime-based Zone Allocation algorithm (LIZA), and minimizes the GC overhead of valid data copy when reclaiming a zone. However, LIZA, which allocates zones by predicting the lifetime of each SSTable according to the level of the hierarchical structure of the LSM-tree, is very inefficient in minimizing the write amplification (WA) problem due to inaccurate predictions of SSTable lifetimes. Instead, based on our observation that the deletion time of SSTables in the LSM-tree is solely determined by the compaction process, we propose a novel Compaction-Aware Zone Allocation algorithm (CAZA) that allows the newly created SSTables to be deleted together after merging in the future. CAZA is implemented in RocksDB's ZenFS and our extensive evaluations show that CAZA significantly reduces the WA overhead compared to LIZA.

References

[1]
Nitin Agrawal, Vijayan Prabhakaran, Ted Wobber, John D Davis, Mark Manasse, and Rina Panigrahy. 2008. Design Tradeoffs for SSD Performance. In Proceedings of the USENIX Annual Technical Conference (ATC '08). 57--70.
[2]
Matias Bjørling. 2019. From Open-channel SSDs to Zoned Namespaces. In Linux Storage and Filesystems Conference (Vault '19), Vol. 1.
[3]
Matias Bjørling, Abutalib Aghayev, Hans Holmberg, Aravind Ramesh, Damien Le Moal, Gregory R Ganger, and George Amvrosiadis. 2021. ZNS: Avoiding the Block Interface Tax for Flash-based SSDs. In Proceedings of the USENIX Annual Technical Conference (ATC '21). 689--703.
[4]
Gunhee Choi, Kwanghee Lee, Myunghoon Oh, Jongmoo Choi, Jhuyeong Jhin, and Yongseok Oh. 2020. A New LSM-style Garbage Collection Scheme for ZNS SSDs. In Proceedings of the 12th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '20).
[5]
Western Digital Corporation. 2021. nullbk. https://zonedstorage.io/docs/getting-started/nullblk
[6]
Western Digital Corporation. 2022. ZenFS. https://github.com/westerndigitalcorporation/zenfs
[7]
Western Digital Corporation. 2022. Zoned Stroage. https://zonedstorage.io/docs/introduction/zoned-storage
[8]
Facebook. 2022. RocksDB. https://github.com/facebook/rocksdb
[9]
Google. 2021. LevelDB. https://github.com/google/leveldb
[10]
Kyuhwa Han, Hyunho Gwak, Dongkun Shin, and Jooyoung Hwang. 2021. ZNS+: Advanced Zoned Namespace Interface for Supporting In-Storage Zone Compaction. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21). 147--162.
[11]
Hans Holmberg. 2020. ZenFS, Zones and RocksDB - Who Likes to Take out the Garbage Anyway? https://snia.org/sites/default/files/SDC/2020/074-Holmberg-ZenFS-Zones-and-RocksDB.pdf
[12]
Xiao-Yu Hu, Evangelos Eleftheriou, Robert Haas, Ilias Iliadis, and Roman Pletka. 2009. Write Amplification Analysis in Flash-based Solid State Drives. In Proceedings of the ACM International Systems and Storage ConferenceS (SYSTOR '09). 1--9.
[13]
MongoDB Inc. 2022. MongoDB. https://github.com/mongodb/mongo
[14]
Changman Lee, Dongho Sim, Joo Young Hwang, and Sangyeun Cho. 2015. F2FS: A New File System for Flash Storage. In Proceedings of the 13th USENIX Conference on File and Storage Technologies (FAST '15). 273--286.
[15]
Patrick O'Neil, Edward Cheng, Dieter Gawlick, and Elizabeth O'Neil. 1996. The Log-Structured Merge-Tree (LSM-tree). Acta Informatica 33, 4 (1996), 351--385.
[16]
Reza Salkhordeh, Kevin Kremer, Lars Nagel, Dennis Maisenbacher, Hans Holmberg, Matias Bjørling, and André Brinkmann. 2021. Constant Time Garbage Collection in SSDs. In Proceedings of the IEEE International Conference on Networking, Architecture and Storage (NAS '21). 1--9.
[17]
Theano Stavrinos, Daniel S Berger, Ethan Katz-Bassett, and Wyatt Lloyd. 2021. Don't be a blockhead: Zoned namespaces make work on conventional SSDs obsolete. In Proceedings of the Workshop on Hot Topics in Operating Systems (HotOS '21). 144--151.
[18]
Qiuping Wang, Jinhong Li, Patrick PC Lee, Tao Ouyang, Chao Shi, and Lilong Huang. 2022. Separating Data via Block Invalidation Time Inference for Write Amplification Reduction in Log-Structured Storage. In Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST '22). 429--443.
[19]
Shiqin Yan, Huaicheng Li, Mingzhe Hao, Michael Hao Tong, Swaminathan Sundararaman, Andrew A Chien, and Haryadi S Gunawi. 2017. Tiny-Tail Flash: Near-Perfect Elimination of Garbage Collection Tail Latencies in NAND SSDs. ACM Transactions on Storage (TOS) 13, 3 (2017), 1--26.

Cited By

View all
  • (2024)Storage Abstractions for SSDs: The Past, Present, and FutureACM Transactions on Storage10.1145/370899221:1(1-44)Online publication date: 30-Dec-2024
  • (2024)BIZA: Design of Self-Governing Block-Interface ZNS AFA for Endurance and PerformanceProceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles10.1145/3694715.3695953(313-329)Online publication date: 4-Nov-2024
  • (2024)Optimizing Garbage Collection for ZNS SSDs via In-storage Data Migration and Address RemappingACM Transactions on Architecture and Code Optimization10.1145/368933621:4(1-25)Online publication date: 20-Nov-2024
  • Show More Cited By

Index Terms

  1. Compaction-aware zone allocation for LSM based key-value store on ZNS SSDs

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    HotStorage '22: Proceedings of the 14th ACM Workshop on Hot Topics in Storage and File Systems
    June 2022
    141 pages
    ISBN:9781450393997
    DOI:10.1145/3538643
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    In-Cooperation

    • USENIX Assoc: USENIX Assoc

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 June 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. ZNS SSD
    2. key-value store
    3. log-structured merge-tree

    Qualifiers

    • Research-article

    Funding Sources

    • National Research Foundation of Korea (NRF) by the Korea government (MSIT)

    Conference

    HotStorage '22
    Sponsor:

    Acceptance Rates

    HotStorage '22 Paper Acceptance Rate 19 of 47 submissions, 40%;
    Overall Acceptance Rate 34 of 87 submissions, 39%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)190
    • Downloads (Last 6 weeks)13
    Reflects downloads up to 17 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Storage Abstractions for SSDs: The Past, Present, and FutureACM Transactions on Storage10.1145/370899221:1(1-44)Online publication date: 30-Dec-2024
    • (2024)BIZA: Design of Self-Governing Block-Interface ZNS AFA for Endurance and PerformanceProceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles10.1145/3694715.3695953(313-329)Online publication date: 4-Nov-2024
    • (2024)Optimizing Garbage Collection for ZNS SSDs via In-storage Data Migration and Address RemappingACM Transactions on Architecture and Code Optimization10.1145/368933621:4(1-25)Online publication date: 20-Nov-2024
    • (2024)Exploring I/O Management Performance in ZNS with ConfZNS++Proceedings of the 17th ACM International Systems and Storage Conference10.1145/3688351.3689160(162-177)Online publication date: 16-Sep-2024
    • (2024)An Adaptive Zone-Grouping Scheme Enabling General-Purpose File Systems on ZNS SSDsProceedings of the 17th ACM International Systems and Storage Conference10.1145/3688351.3689151(132-145)Online publication date: 16-Sep-2024
    • (2024)Hi-ZNS: High Space Efficiency and Zero-Copy LSM-Tree Based Stores on ZNS SSDsProceedings of the 53rd International Conference on Parallel Processing10.1145/3673038.3673096(1217-1226)Online publication date: 12-Aug-2024
    • (2024)Can ZNS SSDs be Better Storage Devices for Persistent Cache?Proceedings of the 16th ACM Workshop on Hot Topics in Storage and File Systems10.1145/3655038.3665946(55-62)Online publication date: 8-Jul-2024
    • (2024)Balloon-ZNS: Constructing High-Capacity and Low-Cost ZNS SSDs with Built-in CompressionProceedings of the 61st ACM/IEEE Design Automation Conference10.1145/3649329.3657368(1-6)Online publication date: 23-Jun-2024
    • (2024)Fair-ZNS: Enhancing Fairness in ZNS SSDs Through Self-Balancing I/O SchedulingIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2022.323299743:7(2012-2022)Online publication date: Jul-2024
    • (2024)Land of Oz: Resolving Orderless Writes in Zoned Namespace SSDsIEEE Transactions on Computers10.1109/TC.2024.344186673:11(2520-2533)Online publication date: 1-Nov-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media