skip to main content
research-article
Public Access

Customizable SLO and Its Near-Precise Enforcement for Storage Bandwidth

Published: 16 February 2017 Publication History

Abstract

Cloud service is being adopted as a utility for large numbers of tenants by renting Virtual Machines (VMs). But for cloud storage, unpredictable IO characteristics make accurate Service-Level-Objective (SLO) enforcement challenging. As a result, it has been very difficult to support simple-to-use and technology-agnostic SLO specifying a particular value for a specific metric (e.g., storage bandwidth). This is because the quality of SLO enforcement depends on performance error and fluctuation that measure the precision of SLO enforcement. High precision of SLO enforcement is critical for user-oriented performance customization and user experiences. To address this challenge, this article presents V-Cup, a framework for VM-oriented customizable SLO and its near-precise enforcement. It consists of multiple auto-tuners, each of which exports an interface for a tenant to customize the desired storage bandwidth for a VM and enable the storage bandwidth of the VM to converge on the target value with a predictable precision. We design and implement V-Cup in the Xen hypervisor based on the fair sharing scheduler for VM-level resource management. Our V-Cup prototype evaluation shows that it achieves satisfying performance guarantees through near-precise SLO enforcement.

References

[1]
Amazon EC2. 2015. Amazon EC2 website. Retrieved from http://aws.amazon.com/ec2.
[2]
Amazon EC2 SLA. 2015. Amazon EC2 SLA. Retrieved from http://aws.amazon.com/cn/ec2-sla/.
[3]
J. Axboe. 2004. Linux block IO—Present and future. In Proceedings of the Ottawa Linux Symposium.
[4]
P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt, and A. Warfield. 2003. Xen and the art of virtualization. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP).
[5]
J. C. R. Bennett and H. Zhang. 1996. WF2Q: Worst-case fair weighted fair queueing. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM).
[6]
R. Bradford, E. Kotsovinos, A. Feldmann, and H. Schioberg. 2007. Live wide-area migration of virtual machines including local persistent state. In Proceedings of the International Conference on Virtual Execution Environments (VEE).
[7]
D. D. Chambliss, G. A. Alvarez, P. Pandey, D. Jadav, J. Xu, R. Menon, and T. P. Lee. 2003. Performance virtualization for large-scale storage systems. In Proceedings of the 22nd International Symposium on Reliable Distributed Systems (SRDS).
[8]
B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the ACM Symposium on Cloud Computing (SoCC).
[9]
A. Dan and D. Sitaram. 1996. A generalized interval caching policy for mixed interactive and long video environments. In Proceedings of Multimedia Computing and Networking Conference.
[10]
K. J. Duda and D. R. Cheriton. 1999. Borrowed-virtual-time (BVT) scheduling: Supporting latency-sensitive threads in a general-purpose scheduler. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP).
[11]
FIO. 2015. FIO. Retrieved from http://freecode.com/projects/fio.
[12]
G. F. Franklin, J. D. Powell, and M. Workman. 1998. Digital Control of Dynamic Systems. Addison-Wesley.
[13]
K. Fraser, S. Hand, R. Neugebauer, I. Pratt, A. Warfield, and M. Williamson. 2004. Safe hardware access with the Xen virtual machine monitor. In Proceedings of the 1st Workshop on Operating System and Architectural Support for the On Demand IT InfraStructure (OASIS).
[14]
S. Golestani. 1994. A self-clocked fair queueing scheme for broadband applications. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM).
[15]
Google Compute Engine. 2015. Google Compute Engine. Retrieved from https://cloud.google.com/products/compute-engine/.
[16]
P. Goyal, H. M. Vin, and H. Cheng. 1997. Start-time fair queuing: A scheduling algorithm for integrated services packet switching networks. IEEE/ACM Transactions on Networking 55 (1997), 690--704.
[17]
A. Gulati, I. Ahmad, and C. A. Waldspurger. 2009. PARDA: Proportional allocation of resources for distributed storage access. In Proccedings of the Conference on File and Storage Technologies (FAST).
[18]
A. Gulati, A. Merchant, and P. J. Varman. 2010. mClock: Handling throughput variability for hypervisor IO scheduling. In Proceedings of the Symposium on Operating Systems Design and Implementation (OSDI).
[19]
A. Gulati, G. Shanmuganathan, X. Zhang, and P. Varman. 2012. Demand based hierarchical QoS using storage resource pools. In Proceedings of the USENIX Annual Technical Conference (ATC).
[20]
J. L. Hellerstein, Y. Diao, S. Parekh, and D. M. Tilbury. 2004. Feedback Control of Computing Systems. John Wiley 8 Sons, Inc., Hoboken, New Jersey.
[21]
W. Jin, J. S. Chase, and J. Kaur. 2004. Interposed proportional sharing for a storage service utility. ACM SIGMETRICS Performance Evaluation Review 32, 1 (2004), 37--48.
[22]
J. Kang, B. Zhang, T. Wo, C. Hu, and J. Huai. 2014. MultiLanes: Providing virtualized storage for OS-level virtualization on many cores. In Proccedings of the Conference on File and Storage Technologies (FAST).
[23]
M. Karlsson, C. Karamanolis, and X. Zhu. 2005. Triage: Performance differentiation for storage systems using adaptive control. ACM Transactions on Storage (TOS) 1, 4 (2005), 457--480.
[24]
J. Kim, D. Lee, and S. H. Noh. 2015. Towards SLO complying SSDs through OPS isolation. In Proceedings of the Conference on File and Storage Technologies (FAST).
[25]
A. Kivity, Y. Kamay, D. Laor, U. Lublin, and A. Liguori. 2007. kvm: The Linux virtual machine monitor. In Proceedings of the 2007 Linux Symposium.
[26]
J. Liu, S. G. Rao, H. Zhang, and B. Li. 2008. Opportunities and challenges of peer-to-peer internet video broadcast. Proceedings of the IEEE 96, 1 (2008), 11--24.
[27]
Z. Lu, J. Wu, Y. Huang, L. Chen, and D. Deng. 2012. CPDID: A novel CDN-P2P dynamic interactive delivery scheme for live streaming. In Proceedings of IEEE 18th International Conference on Parallel and Distributed Systems (ICPADS).
[28]
C. R. Lumb, A. Merchant, and G. A. Alvarez. 2003. Façade: Virtual storage devices with performance guarantees. In Proccedings of the Conference on File and Storage Technologies (FAST).
[29]
LXC. 2016. LXC. Retrieved from https://linuxcontainers.org/lxc/.
[30]
K. Mansley, G. Law, D. Riddoch, G. Barzini, N. Turton, and S. Pope. 2007. Getting 10 Gb/s from Xen: Safe and fast device access from unprivileged domains. In Proceedings of the 2007 Conference on Parallel Processing. 224--233.
[31]
A. Mashtizadeh, E. Celebi, T. Garfinkel, and M. Cai. 2011. The design and evolution of live storage migration in VMware ESX. In Proceedings of the USENIX Annual Technical Conference (ATC).
[32]
R. Mcdougall. 2015. A prototype model-based workload for file systems, work in progress. Retrieved from http://solarisinternals.com/si/tools/filebench/filebench nasconf.pdf.
[33]
MongoDB. 2015. MongoDB. Retrieved from http://www.mongodb.org/.
[34]
R. Nathuji, A. Kansal, and A. Ghaffarkhah. 2010. Q-clouds: Managing performance interference effects for QoS-aware clouds. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys).
[35]
J. Nieh and M. S. Lam. 2003. A SMART scheduler for multimedia applications. ACM Transactions on Computer Systems (TOCS) 21, 2 (2003), 117--163.
[36]
A. Povzner, T. Kaldewey, S.Brandt, R. Golding, T. M. Wong, and C. Maltzahn. 2008. Efficient guaranteed disk request scheduling with Fahrrad. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys).
[37]
R. Russell. 2008. virtio: Towards a de-facto standard for virtual I/O devices. SIGOPS Operating Systems Review 42, 5 (2008), 95--103.
[38]
M. Ryu and U. Ramachandran. 2013. FlashStream: A multi-tiered storage architecture for adaptive HTTP streaming. In Proceedings of the 21st ACM International Conference on Multimedia. 313--322.
[39]
M. Shreedhar and G. Varghese. 1987. Efficient fair queuing using deficit round-robin. IEEE/ACM Transactions on Networking 4, 3 (1987), 375--385.
[40]
D. Shue, M. J. Freedman, and A. Shaikh. 2012. Performance isolation and fairness for multi-tenant cloud storage. In Proceedings of the Symposium on Operating Systems Design and Implementation (OSDI).
[41]
S. Soltesz, H. Pötzl, M. E. Fiuczynski, A. Bavier, and L. Peterson. 2007. Container-based operating system virtualization: A scalable, high-performance alternative to hypervisors. In Proceedings of the 3rd European Conference on Computer Systems (EuroSys).
[42]
J. Sugerman, G. Venkitachalam, and B. H. Lim. 2001. Virtualizing I/O devices on VMware workstation’s hosted virtual machine monitor. In Proceedings of the USENIX Annual Technical Conference (ATC).
[43]
S. Suri, G. Varghese, and G. Chandramenon. 1997. Leap forward virtual clock: A new fair queueing scheme with guaranteed delay and throughput fairness. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM).
[44]
E. Thereska, H. Ballani, G. O’Shea, T. Karagiannis, A. Rowstron, T. Talpey, R. Black, and T. Zhu. 2013. IOFlow: A software-defined storage architecture. In Proceedings of the ACM Symposium on Operating Systems Principles (SOSP).
[45]
VMware Infrastructure. 2016. VMware, Inc. Introduction to VMware Infrastructure. Retrieved from http://www.vmware.com/support/pubs/.
[46]
M. Wachs, M. Abd-El-Malek, E. Thereska, and G. R. Ganger. 2007. Argon: Performance insulation for shared storage servers. In Proccedings of the Conference on File and Storage Technologies (FAST).
[47]
A. Wang, S. Venkataraman, S. Alspaugh, R. Katz, and I. Stoica. 2012. Cake: Enabling high-level SLOs on shared storage systems. In Proceedings of the ACM Symposium on Cloud Computing (SoCC).
[48]
Windows Azure. 2015. Windows Azure. Retrieved from http://www.windowsazure.com/.
[49]
J. C. Wu and S. A. Brandt. 2006. The design and implementation of Aqua: An adaptive quality of service aware object-based storage device. In Proceedings of the IEEE Conference on Mass Storage Systems and Technologies (MSST).
[50]
S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2009. WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance. In Proccedings of the Conference on File and Storage Technologies (FAST).
[51]
S. Wu, H. Jiang, and B. Mao. 2012. IDO: Intelligent data outsourcing with improved RAID reconstruction performance in large-scale data centers. In Proceedings of the USENIX Large Installation System Administration (LISA).
[52]
Y. Wu, C. Wu, B. Li, X. Qiu, and F. C. Lau. 2011. CloudMedia: When cloud on demand meets video on demand. In Proceedings of the IEEE International Conference on Distributed Computing Systems (ICDCS).
[53]
J. Zhang, A. Riska, A. Sivasubramaniam, Q. Wang, and E. Riedel. 2006. Storage performance virtualization via throughput and latency control. ACM Transactions on Storage (TOS) 2, 3 (2006), 283--308.

Cited By

View all
  • (2023)Autothrottle: Satisfying Network Performance Requirements for ContainersIEEE Transactions on Cloud Computing10.1109/TCC.2022.318639711:2(2096-2109)Online publication date: 1-Apr-2023
  • (2022)E-Government Cybersecurity Modeling in the Context of Software-Defined NetworksCybersecurity Measures for E-Government Frameworks10.4018/978-1-7998-9624-1.ch001(1-21)Online publication date: 11-Mar-2022
  • (2022)Near-memory Computing on FPGAs with 3D-stacked Memories: Applications, Architectures, and OptimizationsACM Transactions on Reconfigurable Technology and Systems10.1145/354765816:1(1-32)Online publication date: 18-Jul-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Storage
ACM Transactions on Storage  Volume 13, Issue 1
Special Issue on USENIX FAST 2016 and Regular Papers
February 2017
201 pages
ISSN:1553-3077
EISSN:1553-3093
DOI:10.1145/3054178
  • Editor:
  • Sam H. Noh
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 February 2017
Accepted: 01 September 2016
Revised: 01 July 2016
Received: 01 July 2015
Published in TOS Volume 13, Issue 1

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Cloud storage
  2. end-to-end control
  3. service-level objective
  4. storage management

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • NSFC
  • National High Technology Research and Development Program of China
  • US NSF

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)73
  • Downloads (Last 6 weeks)11
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Autothrottle: Satisfying Network Performance Requirements for ContainersIEEE Transactions on Cloud Computing10.1109/TCC.2022.318639711:2(2096-2109)Online publication date: 1-Apr-2023
  • (2022)E-Government Cybersecurity Modeling in the Context of Software-Defined NetworksCybersecurity Measures for E-Government Frameworks10.4018/978-1-7998-9624-1.ch001(1-21)Online publication date: 11-Mar-2022
  • (2022)Near-memory Computing on FPGAs with 3D-stacked Memories: Applications, Architectures, and OptimizationsACM Transactions on Reconfigurable Technology and Systems10.1145/354765816:1(1-32)Online publication date: 18-Jul-2022
  • (2021)Hybrid Precoding Algorithm for Millimeter-Wave Massive MIMO Systems with Subconnection StructuresWireless Communications & Mobile Computing10.1155/2021/55329392021Online publication date: 1-Jan-2021
  • (2021)A Large-scale Analysis of Hundreds of In-memory Key-value Cache Clusters at TwitterACM Transactions on Storage10.1145/346852117:3(1-35)Online publication date: 16-Aug-2021
  • (2021)Energy-collision-aware Minimum Latency Aggregation Scheduling for Energy-harvesting Sensor NetworksACM Transactions on Sensor Networks10.1145/346101317:4(1-34)Online publication date: 16-Jul-2021
  • (2021)Joint Deployment Strategy of Battery-Free Sensor Networks with Coverage GuaranteeACM Transactions on Sensor Networks10.1145/345712317:4(1-29)Online publication date: 22-Jul-2021
  • (2020)Austere flash caching with deduplication and compressionProceedings of the 2020 USENIX Conference on Usenix Annual Technical Conference10.5555/3489146.3489195(713-726)Online publication date: 15-Jul-2020
  • (2019)Energy-Efficient Broadcast Scheduling Algorithm in Duty-Cycled Multihop Wireless NetworksWireless Communications & Mobile Computing10.1155/2019/50641092019Online publication date: 6-Feb-2019
  • (2019)PINE:Optimizing Performance Isolation in Container EnvironmentsIEEE Access10.1109/ACCESS.2019.2900451(1-1)Online publication date: 2019
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media