research-article

Public Access

Towards low-latency I/O services for mixed workloads using ultra-low latency SSDs

Authors:

Liting HuAuthors Info & Claims

ICS '22: Proceedings of the 36th ACM International Conference on Supercomputing

Article No.: 13, Pages 1 - 12

https://doi.org/10.1145/3524059.3532378

Published: 28 June 2022 Publication History

Abstract

Low-latency I/O services are essential for latency-sensitive workloads when they co-run with throughput-oriented workloads in cloud data centers. Although advanced SSDs such as Intel Optane SSDs can offer ultra-low latency at the device layer, I/O interference among various workloads through the I/O stack can still significantly enlarge I/O latency. It is still an open problem to best utilize ultra-low latency SSDs in cloud computing environments.

In this paper, we analyze the entire I/O stack and reveal that I/O interference is mainly attributed to resource contention in the SSD device, transactions commit in the file system, and costly process scheduling. To address these problems, we propose FastResponse, a holistic approach to use ultra-low latency SSDs for latency-sensitive workloads. First, we propose a new I/O scheduler at the block layer to throttle I/O requests of throughput-oriented workloads, and thus reduce the resource contention in the SSD device. Second, we develop a fine-grained journaling scheme to reduce the latency of transaction at the file system layer. Third, we redesign Completely Fair Scheduler (CFS) to promote the priority of latency-sensitive processes. We implement FastResponse in Linux kernel and evaluate it with several mixed workloads. Compared with the vanilla Linux and the state-of-the-art SelectISR, FastResponse can reduce the average response time of latency-sensitive workloads by 18--70% and 10--67%, respectively, and reduce the 99.9th percentile response time by 58--80% and 52--78%, respectively. Meanwhile, the performance degradation for throughput-oriented workloads is less than 6%.

References

[1]

Călin Iorgulescu, Reza Azimi, Youngjin Kwon, Sameh Elnikety, Manoj Syamala, Vivek Narasayya, Jack Zhang, and Junhua Wang. Perfiso: Performance Isolation for Commercial Latency-Sensitive Services. In Proceedings of 2018 USENIX Annual Technical Conference (ATC '18), pages 519--532, 2018.

[2]

Luiz Barroso, Mike Marty, David Patterson, and Parthasarathy Ranganathan. Attack of the Killer Microseconds. Communications of the ACM, 60(4):48--54, 2017.

Digital Library

[3]

Amy Ousterhout, Joshua Fried, Jonathan Behrens, Adam Belay, and Hari Balakrishnan. Shenango: Achieving High CPU Efficiency for Latency-Sensitive Datacenter Workloads. In Proceedings of 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI '19), pages 361--378, 2019.

[4]

Shaohong Li, Xi Wang, Xiao Zhang, Vasileios Kontorinis, Sreekumar Kodakara, David Lo, and Parthasarathy Ranganathan. Thunderbolt: Throughput-Optimized, Quality-of-Service-Aware Power Capping at Scale. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20), pages 1241--1255, 2020.

[5]

Mingzhe Hao, Levent Toksoz, Nanqinqin Li, Edward Edberg Halim, Henry Hoffmann, and Haryadi S. Gunawi. LinnOS: Predictability on Unpredictable Flash Storage with a Light Neural Network. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20), pages 173--190, 2020.

[6]

Joshua Fried, Zhenyuan Ruan, Amy Ousterhout, and Adam Belay. Caladan: Mitigating Interference at Microsecond Timescales. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), pages 281--297, 2020.

[7]

Yu Gan, Yanqi Zhang, Kelvin Hu, Dailun Cheng, Yuan He, Christina Delimitrou, and Christina Delimitrou. Seer: Leveraging Big Data to Navigate the Complexity of Performance Debugging in Cloud Microservices. In Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '19), pages 19--33, 2019.

[8]

Intel Optane SSD 900P Series. https://www.intel.com/content/www/us/en/products/docs/memory-storage/solid-state-drives/optane-ssd-900p-brief.html, 2017.

[9]

Samsung Z-NAND SSD. https://www.samsung.com/us/labs/pdfs/collateral/Samsung_Z-NAND_Technology_Brief_v5.pdf, 2017.

[10]

XL-Flash 3D NAND. https://www.anandtech.com/show/13183/toshiba-announces-xlflash-lowlatency-3d-nand, 2018.

[11]

Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, Changlim Lee, Mohammad Alian, Myoungjun Chun, Mahmut Taylan Kandemir, Nam Sung Kim, Jihong Kim, and Myoungsoo Jung. FLASHSHARE: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs. In Proceedings of 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI '18), pages 477--492, 2018.

[12]

Jiahao Liu, Fang Wang, and Dan Feng. CostPI: Cost-Effective Performance Isolation for Shared NVMe SSDs. In Proceedings of the 48th International Conference on Parallel Processing (ICPP '19), pages 1--10, 2019.

[13]

Sangwook Kim, Hwanju Kim, Joonwon Lee, and Jinkyu Jeong. Enlightening the I/O Path: A Holistic Approach for Application Performance. In Proceedings of 15th USENIX Conference on File and Storage Technologies (FAST '17), pages 345--358, 2017.

[14]

Sangwook Shane Hahn, Sungjin Lee, Inhyuk Yee, Donguk Ryu, and Jihong Kim. FastTrack: Foreground App-Aware I/O Management for Improving User Experience of Android Smartphones. In Proceedings of 2018 USENIX Annual Technical Conference (ATC '18), pages 15--28, 2018.

[15]

Suli Yang, Tyler Harter, Nishant Agrawal, Salini Selvaraj Kowsalya, Anand Krishnamurthy, Samer Al-Kiswany, Rini T. Kaushik, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. Split-Level I/O Scheduling. In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP '15), pages 474--489, 2015.

Digital Library

[16]

Till Miemietz, Hannes Weisbach, Michael Roitzsch, and Hermann Härtig. K2: Work-Constraining Scheduling of NVMe-Attached Storage. In Proceedings of 2019 IEEE Real-Time Systems Symposium (RTSS '19), pages 56--68, 2019.

[17]

Jiwon Woo, Minwoo Ahn, Gyusun Lee, and Jinkyu Jeong. D2FQ: Device-Direct Fair Queueing for NVMe SSDs. In Proceedings of 19th USENIX Conference on File and Storage Technologies (FAST '21), pages 403--415, 2021.

[18]

Arash Tavakkol, Mohammad Sadrosadati, Saugata Ghose, Jeremie Kim, Yixin Luo, Yaohua Wang, Nika Mansouri Ghiasi, Lois Orosa, Juan Gómez-Luna, and Onur Mutlu. FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. In Proceedings of 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA '18), pages 397--410, 2018.

[19]

FastResponse. https://github.com/CGCL-codes/FastResponse, 2022.

[20]

RocksDB. https://github.com/facebook/rocksdb, 2017.

[21]

Shengsheng Huang, Jie Huang, Jinquan Dai, Tao Xie, and Bo Huang. The Hi-Bench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis. In Proceedings of 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW '10), pages 41--51, 2010.

[22]

FIO: Flexible I/O Tester. https://github.com/axboe/fio, 2017.

[23]

Paolo Valente and Arianna Avanzini. Evolution of the BFQ Storage-I/O Scheduler. In Proceedings of 2015 Mobile Systems Technologies Workshop (MST '15), pages 15--20, 2015.

[24]

Block layer introduction. https://lwn.net/Articles/738449/, 2017.

[25]

Gyusun Lee, Seokha Shin, Wonsuk Song, Tae Jun Ham, Jae W. Lee, and Jinkyu Jeong. Asynchronous I/O Stack: A Low-Latency Kernel I/O Stack for Ultra-Low Latency SSDs. In Proceedings of 2019 USENIX Annual Technical Conference (ATC '19), pages 603--616, 2019.

[26]

Adrian M. Caulfield, Arup De, Joel Coburn, Todor I. Mollow, Rajesh K. Gupta, and Steven Swanson. Moneta: A High-Performance Storage Array Architecture for Next-Generation, Non-Volatile Memories. In Proceedings of 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO '10), pages 385--395, 2010.

Digital Library

[27]

Amy Tai, Igor Smolyar, Michael Wei, and Dan Tsafrir. Optimizing Storage Performance with Calibrated Interrupts. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21), pages 129--145, 2021.

[28]

Jaehyun Hwang, Midhul Vuppalapati, Simon Peter, and Rachit Agarwal. Rearchitecting Linux Storage Stack for μs Latency and High Throughput. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21), pages 113--128, 2021.

[29]

Dbbench. https://github.com/facebook/rocksdb/wiki/Benchmarking-tools, 2017.

[30]

David Lo, Liqun Cheng, Rama Govindaraju, Luiz André Barroso, and Christos Kozyrakis. Towards Energy Proportionality for Large-Scale Latency-Critical Workloads. In Proceedings of 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA '14), pages 301--312, 2014.

[31]

Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Shujie Zhang, Chen Zheng, Gang Lu, Kent Zhan, Xiaona Li, and Bizhu Qiu. Bigdatabench: A Big Data Benchmark Suite from Internet Services. In Proceedings of 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA '14), pages 488--499, 2014.

[32]

Thanumalayan Sankaranarayana Pillai, Ramnatthan Alagappan, Lanyue Lu, Vijay Chidambaram, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. Application Crash Consistency and Performance with CCFS. ACM Transactions on Storage, 13(3):1 -- 29, 2017.

Digital Library

[33]

Daejun Park and Dongkun Shin. iJournaling: Fine-Grained Journaling for Improving the Latency of Fsync System Call. In Proceedings of 2017 USENIX Annual Technical Conference (ATC '17), pages 787--798, 2017.

[34]

Mohammad Hedayati, Kai Shen, Michael L. Scott, and Mike Marty. Multi-Queue Fair Queuing. In Proceedings of 2019 USENIX Annual Technical Conference (ATC '19), pages 301--314, 2019.

[35]

Kanchan Joshi, Kaushal Yadav, and Praval Choudhary. Enabling NVMe WRR Support in Linux Block Layer. In Proceedings of 9th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '17), 2017.

[36]

Jisoo Yang, Dave B. Minturn, and Frank Hady. When Poll is Better than Interrupt. In Proceedings of the 10th USENIX Conference on File and Storage Technologies, (FAST '12), pages 25--31, 2012.

[37]

Sooman Jeong, Kisung Lee, Seongjin Lee, Seoungbum Son, and Youjip Won. I/O Stack Optimization for Smartphones. In Proceedings of 2013 USENIX Annual Technical Conference (ATC '13), pages 309--320, 2013.

[38]

Sungjoon Koh, Changrim Lee, Miryeong Kwon, and Myoungsoo Jung. Exploring System Challenges of Ultra-Low Latency Solid State Drives. In Proceedings of 10th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '18), 2018.

[39]

I/O Latency Optimization with Polling. https://events.static.linuxfound.org/sites/events/files/slides/lemoal-nvme-polling-vault-2017-final_0.pdf, 2017.

[40]

Assaf Eisenman, Darryl Gardner, Islam AbdelRahman, Jens Axboe, Siying Dong, Kim Hazelwood, Chris Petersen, Asaf Cidon, and Sachin Katti. Reducing DRAM Footprint with NVM in Facebook. In Proceedings of the Thirteenth EuroSys Conference (EuroSys '18), pages 1--13, 2018.

Cited By

Park YPark JPark JKhan AKim KPark SKim Y(2024)OctoFAS: A Two-Level Fair Scheduler That Increases Fairness in Network-Based Key-Value StorageElectronics10.3390/electronics1303061913:3(619)Online publication date: 1-Feb-2024
https://doi.org/10.3390/electronics13030619
Zhou YZhou JLu KZhan LXu PWu PChen SLiu XWan J(2024)A Contract-aware and Cost-effective LSM Store for Cloud Storage with Low Latency SpikesACM Transactions on Storage10.1145/364385120:2(1-27)Online publication date: 4-Apr-2024
https://dl.acm.org/doi/10.1145/3643851
Zhou YWang FShi ZFeng D(2024)CoFS: A Collaboration-Aware Fairness Scheme for NVMe SSD in Cloud Storage SystemIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.341297043:12(4490-4504)Online publication date: Dec-2024
https://doi.org/10.1109/TCAD.2024.3412970
Show More Cited By

Index Terms

Towards low-latency I/O services for mixed workloads using ultra-low latency SSDs
1. General and reference
  1. Cross-computing tools and techniques
    1. Performance
2. Software and its engineering
  1. Software organization and properties
    1. Contextual software domains
      1. Operating systems

Recommendations

Reparo: A Fast RAID Recovery Scheme for Ultra-large SSDs
A recent ultra-large SSD (e.g., a 32-TB SSD) provides many benefits in building cost-efficient enterprise storage systems. Owing to its large capacity, however, when such SSDs fail in a RAID storage system, a long rebuild overhead is inevitable for RAID ...
SILK+ Preventing Latency Spikes in Log-Structured Merge Key-Value Stores Running Heterogeneous Workloads
Section: Best of ATC 2019 and Regular Paper

Log-Structured Merge Key-Value stores (LSM KVs) are designed to offer good write performance, by capturing client writes in memory, and only later flushing them to storage. Writes are later compacted into a tree-like data structure on disk to improve ...
High performance & low latency in solid-state drives through redundancy
INFLOW '13: Proceedings of the 1st Workshop on Interactions of NVM/FLASH with Operating Systems and Workloads

Solid-state drives are becoming increasingly popular in enterprise storage systems, playing the role of large caches and permanent storage. Although SSDs provide faster random access than hard-drives, their performance under read/write workloads is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICS '22: Proceedings of the 36th ACM International Conference on Supercomputing

June 2022

514 pages

ISBN:9781450392815

DOI:10.1145/3524059

General Chairs:
Lawrence Rauchwerger
University of Illinois at Urbana-Champaign
,
Kirk Cameron
Virginia Tech
,
Program Chairs:
Dimitrios S. Nikolopoulos
Virginia Tech
,
Dionisios Pnevmatikatos
National Technical University of Athens

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGARCH: ACM Special Interest Group on Computer Architecture

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conference

ICS '22

Sponsor:

SIGARCH

ICS '22: 2022 International Conference on Supercomputing

June 28 - 30, 2022

Virtual Event

Acceptance Rates

Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
846
Total Downloads

Downloads (Last 12 months)378
Downloads (Last 6 weeks)54

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Park YPark JPark JKhan AKim KPark SKim Y(2024)OctoFAS: A Two-Level Fair Scheduler That Increases Fairness in Network-Based Key-Value StorageElectronics10.3390/electronics1303061913:3(619)Online publication date: 1-Feb-2024
https://doi.org/10.3390/electronics13030619
Zhou YZhou JLu KZhan LXu PWu PChen SLiu XWan J(2024)A Contract-aware and Cost-effective LSM Store for Cloud Storage with Low Latency SpikesACM Transactions on Storage10.1145/364385120:2(1-27)Online publication date: 4-Apr-2024
https://dl.acm.org/doi/10.1145/3643851
Zhou YWang FShi ZFeng D(2024)CoFS: A Collaboration-Aware Fairness Scheme for NVMe SSD in Cloud Storage SystemIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.341297043:12(4490-4504)Online publication date: Dec-2024
https://doi.org/10.1109/TCAD.2024.3412970
Wu ZChen YChang YKuo T(2023)APP: Enabling Soft Real-Time Execution on Densely-Populated Hybrid Memory System2023 60th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC56929.2023.10247672(1-6)Online publication date: 9-Jul-2023
https://doi.org/10.1109/DAC56929.2023.10247672
Oliveira GGhose SGómez-Luna JBoroumand ASavery ARao SQazi SGrignou GThakur RShiu EMutlu O(2023)Extending Memory Capacity in Modern Consumer Systems With Emerging Non-Volatile Memory: Experimental Analysis and Characterization Using the Intel Optane SSDIEEE Access10.1109/ACCESS.2023.331788411(105843-105871)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3317884

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten