skip to main content
10.1145/1998582.1998638acmconferencesArticle/Chapter ViewAbstractPublication PagesicacConference Proceedingsconference-collections

Maestro: quality-of-service in large disk arrays

Published: 14 June 2011 Publication History


Provisioning storage in disk arrays is a difficult problem because many applications with different workload characteristics and priorities share resources provided by the array. Currently, storage in disk arrays is statically partitioned, leading to difficult choices between over-provisioning to meet peak demands and resource sharing to meet efficiency targets. In this paper, we present Maestro, a feedback controller that can manage resources on large disk arrays to provide performance differentiation among multiple applications. Maestro monitors the performance of each application and dynamically allocates the array resources so that diverse performance requirements can be met without static partitioning. It supports multiple performance metrics (e.g., latency and throughput) and application priorities so that important applications receive better performance in case of resource contention. By ensuring that high-priority applications sharing storage with other applications obtain the performance levels they require, Maestro makes it possible to use storage resources efficiently. We evaluate Maestro using both synthetic and real-world workloads on a large, commercial disk array. Our experiments indicate that Maestro can reliably adjust the allocation of disk array resources to achieve application performance targets.


T.F. Abdelzaher, K.G. Shin, and N. Bhatti. Performance guarantees for web server end-systems: A control-theoretical approach. IEEE Transactions on Parallel and Distributed Systems, 13, 2002.
J. Bruno et al. Disk scheduling with quality of service guarantees. In Proc. of the IEEE Int'l Conf. on Multimedia Computing and Systems, Vol. 2, 1999.
D.D. Chambliss et al. Performance virtualization for large-scale storage systems. In Proc. of Symp. on Reliable Distributed Systems, October 2003.
A. Demers, S. Keshav, and S. Shenker. Analysis and simulation of a fair queuing algorithm. Journal of Internetworking Research and Experience, 1(1):3--26, September 1990.
Y. Diao et al. MIMO control of an apache web server: Modeling and controller design. In Proc. of American Control Conference (ACC), 2002.
A. Gulati, I. Ahmad, and C. Waldspurger. PARDA: Proportional allocation of resources in distributed storage access. In Proc. of USENIX FAST, Feb 2009.
A. Gulati, A. Merchant, and P. Varman. pClock: An arrival curve based approach for QoS in shared storage systems. In Proc. of ACM SIGMETRICS, pages 13--24, June 2007.
A. Gulati, A. Merchant, and P.J. Varman. mClock: Handling throughput variability for hypervisor IO scheduling. In Proc. of USENIX OSDI'10, October 2010.
J. L. Hellerstein. Designing in control engineering of computing systems. In Proc. of American Control Conference, 2004.
L. Huang, G. Peng, and T-C. Chiueh. Multi-dimensional storage virtualization. In Proc. of ACM SIGMETRICS, pages 14--24, June 2004.
W. Jin, J.S. Chase, and J. Kaur. Interposed proportional sharing for a storage service utility. In Proc. of ACM SIGMETRICS, pages 37--48, 2004.
A. Kamra, V. Misra, and E. Nahum. Yaksha: A self-tuning controller for managing the performance of 3-tiered web sites. In Proc. of the International Workshop on Quality of Service (IWQoS), June 2004.
C. Karamanolis, M. Karlsson, and X. Zhu. Designing controllable computer systems. In Proc. of HOTOS, pages 49--54, June 2005.
M. Karlsson, C.T. Karamanolis, and X. Zhu. Triage: Performance differentiation for storage systems using adaptive control. ACM Transactions on Storage, 1(4):457--480, 2005.
X. Liu, X. Zhu, P. Padala, Z. Wang, and S. Singhal. Optimal multivariate control for differentiated services on a shared hosting platform. In Proc. of the IEEE Conference on Decision and Control (CDC), 2007.
Y. Lu, T.F. Abdelzaher, and A. Saxena. Design, implementation, and evaluation of differentiated caching services. IEEE Transactions on Parallel and Distributed Systems, 15(5), May 2004.
C.R. Lumb, A. Merchant, and G.A. Alvarez. Fa\c cade: Virtual storage devices with performance guarantees. In Proc. of USENIX FAST, 2003.
M.P. Mesnier et al. Modeling the relative fitness of storage. In ACM SIGMETRICS, pages 37--48, 2007.
R. Nathuji, A. Kansal, and A. Ghaffarkhah. Q-Clouds: Managing performance interference effects for QoS-aware clouds. In Proc. of ACM EuroSys, April 2010.
P. Padala et al. Adaptive control of virutalized resources in utility computing environments. In Proc. of ACM EuroSys, March 2007.
P. Padala et al. Automated control of multiple virtualized resources. In Proc. of ACM EuroSys, March 2009.
A. Povzner et al. Efficient guaranteed disk request scheduling with Fahrrad. SIGOPS Oper. Syst. Rev., 42(4):13--25, 2008.
E. Varki et al. Issues and challenges in the performance analysis of real disk arrays. IEEE Trans. Parallel Distrib. Syst., 15(6):559--574, 2004.
M. Wachs, M. Abd-El-Malek, E. Thereska, and G.R. Ganger. Argon: performance insulation for shared storage servers. In Proc. of USENIX FAST, 2007.
Y. Wang and A. Merchant. Proportional-share scheduling for distributed storage systems. In Proc. of USENIX FAST, Feb 2007.
T.M. Wong, R.A. Golding, C. Lin, and R.A. Becker-Szendy. Zygaria: Storage performance as managed resource. In Proc. of RTAS, Apr 2006.
J.C. Wu and S.A. Brandt. The design and implementation of AQuA: an adaptive quality of service aware object-based storage device. In Proc. of IEEE/NASA MSST, May 2006.
J. Zhang et al. Storage performance virtualization via throughput and latency control. In Proc. of MASCOTS, September 2005.
Y. Zhang et al. Friendly virtual machines: leveraging a feedback-control model for application adaptation. In Proc. of the ACM Virtual Execution Environments Conference (VEE), 2005.\endthebibliography

Cited By

View all
  • (2024)PhD Forum: Towards Metastable-Failure-Free Distributed Transaction Systems2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00038(318-321)Online publication date: 30-Sep-2024
  • (2024)MSF-Model: Queuing-Based Analysis and Prediction of Metastable Failures in Replicated Storage Systems2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00013(12-22)Online publication date: 30-Sep-2024
  • (2021)Metastable failures in distributed systemsProceedings of the Workshop on Hot Topics in Operating Systems10.1145/3458336.3465286(221-227)Online publication date: 1-Jun-2021
  • Show More Cited By



Information & Contributors


Published In

cover image ACM Conferences
ICAC '11: Proceedings of the 8th ACM international conference on Autonomic computing
June 2011
278 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]



Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 June 2011


Request permissions for this article.

Check for updates

Author Tags

  1. control theory
  2. disk arrays
  3. feedback control
  4. quality of service
  5. storage systems


  • Research-article


ICAC '11


Other Metrics

Bibliometrics & Citations


Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)2
Reflects downloads up to 17 Feb 2025

Other Metrics


Cited By

View all
  • (2024)PhD Forum: Towards Metastable-Failure-Free Distributed Transaction Systems2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00038(318-321)Online publication date: 30-Sep-2024
  • (2024)MSF-Model: Queuing-Based Analysis and Prediction of Metastable Failures in Replicated Storage Systems2024 43rd International Symposium on Reliable Distributed Systems (SRDS)10.1109/SRDS64841.2024.00013(12-22)Online publication date: 30-Sep-2024
  • (2021)Metastable failures in distributed systemsProceedings of the Workshop on Hot Topics in Operating Systems10.1145/3458336.3465286(221-227)Online publication date: 1-Jun-2021
  • (2020)GIFTProceedings of the 18th USENIX Conference on File and Storage Technologies10.5555/3386691.3386702(103-120)Online publication date: 24-Feb-2020
  • (2019)Fair-EDFProceedings of the 11th USENIX Conference on Hot Topics in Storage and File Systems10.5555/3357062.3357070(6-6)Online publication date: 8-Jul-2019
  • (2019)Latency Fairness Scheduling for Shared Storage Systems2019 IEEE International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS.2019.8834732(1-8)Online publication date: Aug-2019
  • (2019)Evaluation of I/O Performance Regulating Function with a Virtual MachineAdvances in Networked-based Information Systems10.1007/978-3-030-29029-0_63(641-649)Online publication date: 15-Aug-2019
  • (2018)Analyzing, Modeling, and Provisioning QoS for NVMe SSDs2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing (UCC)10.1109/UCC.2018.00033(247-256)Online publication date: Dec-2018
  • (2017)ReFlexACM SIGARCH Computer Architecture News10.1145/3093337.303773245:1(345-359)Online publication date: 4-Apr-2017
  • (2017)ReFlexACM SIGPLAN Notices10.1145/3093336.303773252:4(345-359)Online publication date: 4-Apr-2017
  • Show More Cited By

View Options

Login options

View options


View or Download as a PDF file.



View online with eReader.







Share this Publication link

Share on social media