skip to main content
10.1145/3524059.3532378acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
research-article
Public Access

Towards low-latency I/O services for mixed workloads using ultra-low latency SSDs

Published: 28 June 2022 Publication History

Abstract

Low-latency I/O services are essential for latency-sensitive workloads when they co-run with throughput-oriented workloads in cloud data centers. Although advanced SSDs such as Intel Optane SSDs can offer ultra-low latency at the device layer, I/O interference among various workloads through the I/O stack can still significantly enlarge I/O latency. It is still an open problem to best utilize ultra-low latency SSDs in cloud computing environments.
In this paper, we analyze the entire I/O stack and reveal that I/O interference is mainly attributed to resource contention in the SSD device, transactions commit in the file system, and costly process scheduling. To address these problems, we propose FastResponse, a holistic approach to use ultra-low latency SSDs for latency-sensitive workloads. First, we propose a new I/O scheduler at the block layer to throttle I/O requests of throughput-oriented workloads, and thus reduce the resource contention in the SSD device. Second, we develop a fine-grained journaling scheme to reduce the latency of transaction at the file system layer. Third, we redesign Completely Fair Scheduler (CFS) to promote the priority of latency-sensitive processes. We implement FastResponse in Linux kernel and evaluate it with several mixed workloads. Compared with the vanilla Linux and the state-of-the-art SelectISR, FastResponse can reduce the average response time of latency-sensitive workloads by 18--70% and 10--67%, respectively, and reduce the 99.9th percentile response time by 58--80% and 52--78%, respectively. Meanwhile, the performance degradation for throughput-oriented workloads is less than 6%.

References

[1]
Călin Iorgulescu, Reza Azimi, Youngjin Kwon, Sameh Elnikety, Manoj Syamala, Vivek Narasayya, Jack Zhang, and Junhua Wang. Perfiso: Performance Isolation for Commercial Latency-Sensitive Services. In Proceedings of 2018 USENIX Annual Technical Conference (ATC '18), pages 519--532, 2018.
[2]
Luiz Barroso, Mike Marty, David Patterson, and Parthasarathy Ranganathan. Attack of the Killer Microseconds. Communications of the ACM, 60(4):48--54, 2017.
[3]
Amy Ousterhout, Joshua Fried, Jonathan Behrens, Adam Belay, and Hari Balakrishnan. Shenango: Achieving High CPU Efficiency for Latency-Sensitive Datacenter Workloads. In Proceedings of 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI '19), pages 361--378, 2019.
[4]
Shaohong Li, Xi Wang, Xiao Zhang, Vasileios Kontorinis, Sreekumar Kodakara, David Lo, and Parthasarathy Ranganathan. Thunderbolt: Throughput-Optimized, Quality-of-Service-Aware Power Capping at Scale. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20), pages 1241--1255, 2020.
[5]
Mingzhe Hao, Levent Toksoz, Nanqinqin Li, Edward Edberg Halim, Henry Hoffmann, and Haryadi S. Gunawi. LinnOS: Predictability on Unpredictable Flash Storage with a Light Neural Network. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI '20), pages 173--190, 2020.
[6]
Joshua Fried, Zhenyuan Ruan, Amy Ousterhout, and Adam Belay. Caladan: Mitigating Interference at Microsecond Timescales. In Proceedings of 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI'20), pages 281--297, 2020.
[7]
Yu Gan, Yanqi Zhang, Kelvin Hu, Dailun Cheng, Yuan He, Christina Delimitrou, and Christina Delimitrou. Seer: Leveraging Big Data to Navigate the Complexity of Performance Debugging in Cloud Microservices. In Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '19), pages 19--33, 2019.
[8]
Intel Optane SSD 900P Series. https://www.intel.com/content/www/us/en/products/docs/memory-storage/solid-state-drives/optane-ssd-900p-brief.html, 2017.
[9]
Samsung Z-NAND SSD. https://www.samsung.com/us/labs/pdfs/collateral/Samsung_Z-NAND_Technology_Brief_v5.pdf, 2017.
[10]
XL-Flash 3D NAND. https://www.anandtech.com/show/13183/toshiba-announces-xlflash-lowlatency-3d-nand, 2018.
[11]
Jie Zhang, Miryeong Kwon, Donghyun Gouk, Sungjoon Koh, Changlim Lee, Mohammad Alian, Myoungjun Chun, Mahmut Taylan Kandemir, Nam Sung Kim, Jihong Kim, and Myoungsoo Jung. FLASHSHARE: Punching Through Server Storage Stack from Kernel to Firmware for Ultra-Low Latency SSDs. In Proceedings of 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI '18), pages 477--492, 2018.
[12]
Jiahao Liu, Fang Wang, and Dan Feng. CostPI: Cost-Effective Performance Isolation for Shared NVMe SSDs. In Proceedings of the 48th International Conference on Parallel Processing (ICPP '19), pages 1--10, 2019.
[13]
Sangwook Kim, Hwanju Kim, Joonwon Lee, and Jinkyu Jeong. Enlightening the I/O Path: A Holistic Approach for Application Performance. In Proceedings of 15th USENIX Conference on File and Storage Technologies (FAST '17), pages 345--358, 2017.
[14]
Sangwook Shane Hahn, Sungjin Lee, Inhyuk Yee, Donguk Ryu, and Jihong Kim. FastTrack: Foreground App-Aware I/O Management for Improving User Experience of Android Smartphones. In Proceedings of 2018 USENIX Annual Technical Conference (ATC '18), pages 15--28, 2018.
[15]
Suli Yang, Tyler Harter, Nishant Agrawal, Salini Selvaraj Kowsalya, Anand Krishnamurthy, Samer Al-Kiswany, Rini T. Kaushik, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. Split-Level I/O Scheduling. In Proceedings of the 25th Symposium on Operating Systems Principles (SOSP '15), pages 474--489, 2015.
[16]
Till Miemietz, Hannes Weisbach, Michael Roitzsch, and Hermann Härtig. K2: Work-Constraining Scheduling of NVMe-Attached Storage. In Proceedings of 2019 IEEE Real-Time Systems Symposium (RTSS '19), pages 56--68, 2019.
[17]
Jiwon Woo, Minwoo Ahn, Gyusun Lee, and Jinkyu Jeong. D2FQ: Device-Direct Fair Queueing for NVMe SSDs. In Proceedings of 19th USENIX Conference on File and Storage Technologies (FAST '21), pages 403--415, 2021.
[18]
Arash Tavakkol, Mohammad Sadrosadati, Saugata Ghose, Jeremie Kim, Yixin Luo, Yaohua Wang, Nika Mansouri Ghiasi, Lois Orosa, Juan Gómez-Luna, and Onur Mutlu. FLIN: Enabling Fairness and Enhancing Performance in Modern NVMe Solid State Drives. In Proceedings of 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture (ISCA '18), pages 397--410, 2018.
[19]
FastResponse. https://github.com/CGCL-codes/FastResponse, 2022.
[20]
RocksDB. https://github.com/facebook/rocksdb, 2017.
[21]
Shengsheng Huang, Jie Huang, Jinquan Dai, Tao Xie, and Bo Huang. The Hi-Bench Benchmark Suite: Characterization of the MapReduce-Based Data Analysis. In Proceedings of 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW '10), pages 41--51, 2010.
[22]
FIO: Flexible I/O Tester. https://github.com/axboe/fio, 2017.
[23]
Paolo Valente and Arianna Avanzini. Evolution of the BFQ Storage-I/O Scheduler. In Proceedings of 2015 Mobile Systems Technologies Workshop (MST '15), pages 15--20, 2015.
[24]
Block layer introduction. https://lwn.net/Articles/738449/, 2017.
[25]
Gyusun Lee, Seokha Shin, Wonsuk Song, Tae Jun Ham, Jae W. Lee, and Jinkyu Jeong. Asynchronous I/O Stack: A Low-Latency Kernel I/O Stack for Ultra-Low Latency SSDs. In Proceedings of 2019 USENIX Annual Technical Conference (ATC '19), pages 603--616, 2019.
[26]
Adrian M. Caulfield, Arup De, Joel Coburn, Todor I. Mollow, Rajesh K. Gupta, and Steven Swanson. Moneta: A High-Performance Storage Array Architecture for Next-Generation, Non-Volatile Memories. In Proceedings of 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO '10), pages 385--395, 2010.
[27]
Amy Tai, Igor Smolyar, Michael Wei, and Dan Tsafrir. Optimizing Storage Performance with Calibrated Interrupts. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21), pages 129--145, 2021.
[28]
Jaehyun Hwang, Midhul Vuppalapati, Simon Peter, and Rachit Agarwal. Rearchitecting Linux Storage Stack for μs Latency and High Throughput. In Proceedings of the 15th USENIX Symposium on Operating Systems Design and Implementation (OSDI '21), pages 113--128, 2021.
[29]
Dbbench. https://github.com/facebook/rocksdb/wiki/Benchmarking-tools, 2017.
[30]
David Lo, Liqun Cheng, Rama Govindaraju, Luiz André Barroso, and Christos Kozyrakis. Towards Energy Proportionality for Large-Scale Latency-Critical Workloads. In Proceedings of 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA '14), pages 301--312, 2014.
[31]
Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Shujie Zhang, Chen Zheng, Gang Lu, Kent Zhan, Xiaona Li, and Bizhu Qiu. Bigdatabench: A Big Data Benchmark Suite from Internet Services. In Proceedings of 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA '14), pages 488--499, 2014.
[32]
Thanumalayan Sankaranarayana Pillai, Ramnatthan Alagappan, Lanyue Lu, Vijay Chidambaram, Andrea C. Arpaci-Dusseau, and Remzi H. Arpaci-Dusseau. Application Crash Consistency and Performance with CCFS. ACM Transactions on Storage, 13(3):1 -- 29, 2017.
[33]
Daejun Park and Dongkun Shin. iJournaling: Fine-Grained Journaling for Improving the Latency of Fsync System Call. In Proceedings of 2017 USENIX Annual Technical Conference (ATC '17), pages 787--798, 2017.
[34]
Mohammad Hedayati, Kai Shen, Michael L. Scott, and Mike Marty. Multi-Queue Fair Queuing. In Proceedings of 2019 USENIX Annual Technical Conference (ATC '19), pages 301--314, 2019.
[35]
Kanchan Joshi, Kaushal Yadav, and Praval Choudhary. Enabling NVMe WRR Support in Linux Block Layer. In Proceedings of 9th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '17), 2017.
[36]
Jisoo Yang, Dave B. Minturn, and Frank Hady. When Poll is Better than Interrupt. In Proceedings of the 10th USENIX Conference on File and Storage Technologies, (FAST '12), pages 25--31, 2012.
[37]
Sooman Jeong, Kisung Lee, Seongjin Lee, Seoungbum Son, and Youjip Won. I/O Stack Optimization for Smartphones. In Proceedings of 2013 USENIX Annual Technical Conference (ATC '13), pages 309--320, 2013.
[38]
Sungjoon Koh, Changrim Lee, Miryeong Kwon, and Myoungsoo Jung. Exploring System Challenges of Ultra-Low Latency Solid State Drives. In Proceedings of 10th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '18), 2018.
[39]
I/O Latency Optimization with Polling. https://events.static.linuxfound.org/sites/events/files/slides/lemoal-nvme-polling-vault-2017-final_0.pdf, 2017.
[40]
Assaf Eisenman, Darryl Gardner, Islam AbdelRahman, Jens Axboe, Siying Dong, Kim Hazelwood, Chris Petersen, Asaf Cidon, and Sachin Katti. Reducing DRAM Footprint with NVM in Facebook. In Proceedings of the Thirteenth EuroSys Conference (EuroSys '18), pages 1--13, 2018.

Cited By

View all
  • (2024)OctoFAS: A Two-Level Fair Scheduler That Increases Fairness in Network-Based Key-Value StorageElectronics10.3390/electronics1303061913:3(619)Online publication date: 1-Feb-2024
  • (2024)A Contract-aware and Cost-effective LSM Store for Cloud Storage with Low Latency SpikesACM Transactions on Storage10.1145/364385120:2(1-27)Online publication date: 4-Apr-2024
  • (2024)CoFS: A Collaboration-Aware Fairness Scheme for NVMe SSD in Cloud Storage SystemIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.341297043:12(4490-4504)Online publication date: Dec-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICS '22: Proceedings of the 36th ACM International Conference on Supercomputing
June 2022
514 pages
ISBN:9781450392815
DOI:10.1145/3524059
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 June 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. I/O interference
  2. I/O scheduling
  3. storage system
  4. ultra low-latency SSD

Qualifiers

  • Research-article

Funding Sources

Conference

ICS '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)378
  • Downloads (Last 6 weeks)54
Reflects downloads up to 27 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)OctoFAS: A Two-Level Fair Scheduler That Increases Fairness in Network-Based Key-Value StorageElectronics10.3390/electronics1303061913:3(619)Online publication date: 1-Feb-2024
  • (2024)A Contract-aware and Cost-effective LSM Store for Cloud Storage with Low Latency SpikesACM Transactions on Storage10.1145/364385120:2(1-27)Online publication date: 4-Apr-2024
  • (2024)CoFS: A Collaboration-Aware Fairness Scheme for NVMe SSD in Cloud Storage SystemIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2024.341297043:12(4490-4504)Online publication date: Dec-2024
  • (2023)APP: Enabling Soft Real-Time Execution on Densely-Populated Hybrid Memory System2023 60th ACM/IEEE Design Automation Conference (DAC)10.1109/DAC56929.2023.10247672(1-6)Online publication date: 9-Jul-2023
  • (2023)Extending Memory Capacity in Modern Consumer Systems With Emerging Non-Volatile Memory: Experimental Analysis and Characterization Using the Intel Optane SSDIEEE Access10.1109/ACCESS.2023.331788411(105843-105871)Online publication date: 2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media