Article

Automatic configuration of internet services

Authors:

Ricardo Bianchini,

Thu D. NguyenAuthors Info & Claims

EuroSys '07: Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007

Pages 219 - 229

https://doi.org/10.1145/1272996.1273020

Published: 21 March 2007 Publication History

Abstract

Recent research has found that operators frequently misconfigure Internet services, causing various availability and performance problems. In this paper, we propose a software infrastructure that eliminates several types of misconfiguration by automating the generation of configuration files in Internet services, even as the services evolve. The infrastructure comprises a custom scripting language, configuration file templates, communicating runtime monitors, and heuristic algorithms to detect dependencies between configuration parameters and select ideal configurations. To demonstrate our infrastructure experimentally, we apply it to a realistic online auction service. Our results show that the infrastructure can simplify operation significantly while eliminating 58% of the misconfigurations found in a previous study of the same service. Furthermore, our results show that the infrastructure can efficiently determine the configuration parameters that lead to high performance as the service evolves through a hardware upgrade and the scheduled maintenance of a few nodes.

References

[1]

Anderson, P., Goldsack, P., and Paterson, J. SmartFrog meets LCFG: Autonomous Reconfiguration with Central Policy Control. In Proceedings of the 17th Systems Administration Conference (2003).

Digital Library

[2]

Anderson, P., And Scobie, A. LCFG: The Next Generation. In Proceedings of the UKUUG Winter Conference (2002).

[3]

Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J. Classfication and Regression Trees. 1984.

[4]

Brown, A. B., Keller, A., And Hellerstein, J. L. A Model of Configuration Complexity and its Application to a Change Management System. In Proceedings of the 9th IFIP/IEEE International Symposium on Integrated Network Management (2005).

[5]

Burgess, M. Cfengine: A site configuration engine. USENIX Computing systems 8, 3 (1995).

[6]

Chandra, R., Zeldovich, N., Sapuntzakis, C., and Lam, M. The Collective: A Cache-Based System Management Architecture. In Proceedings of the 2nd Symposium on Networked Systems Design and Implementation (2005).

Digital Library

[7]

Chung, I.-H., and Hollingsworth, J. K. Automated Cluster-Based Web Service Performance Tuning. In Proceedings of the 13th IEEE International Symposium on High-Performance Distributed Computing (2004).

Digital Library

[8]

Diao, Y., Hellerstein, J., Parekh, S., and Bigus, J. Managing Web Server Performance with AutoTune Agent. IBM Systems Journal 42, 1 (2003).

Digital Library

[9]

Kiciman, E., and Wang, Y.-M. Discovering Correctness Constraints for Self-Management of System Configuration. In Proceedings of the 1st International Conference on Autonomic Computing (2004).

Digital Library

[10]

Kirkpatrick, S., Gelatt, C. D., and Vecchi, M. P. Optimization by Simulated Annealing. Science 220, 4598 (1983).

[11]

Levanta. http://www.levanta.com.

[12]

Nagaraja, K., Gama, G. M. C., Bianchini, R., Martin, R. P., Jr., W. M., and Nguyen, T. D. Quantifying the Performability of Cluster-Based Services. IEEE Transactions on Parallel and Distributed Systems 16, 5 (2005).

Digital Library

[13]

Nagaraja, K., Oliveira, F., Bianchini, R., Martin, R. P., and Nguyen, T. D. Understanding and Dealing with Operator Mistakes in Internet Services. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation (2004).

Digital Library

[14]

Nelder, J. A., and Mead, R. A Simplex Method for Function Minimization. Computer Journal 7, 4 (1965).

[15]

Oliveira, F., Patel, J., Hensbergen, E. V., Gheith, A., and Rajamony, R. Blutopia: Cluster Life-Cycle Management. Tech. Rep. RC23784, IBM Austin, 2005.

[16]

Oppenheimer, D., Ganapathi, A., and Patterson, D. Why do Internet Services Fail, and What Can Be Done About It? In Proceedings of the USENIX Symposium on Internet Technologies and Systems (2003).

Digital Library

[17]

Oppenheimer, D., and Patterson, D. Architecture and Dependability of Large-Scale Internet Services. IEEE Internet Computing 6, 5 (2002).

Digital Library

[18]

Rice University. DynaServer Project. http://www.cs.rice.edu/CS/Systems/DynaServer, 2003.

[19]

Stewart, C., and Shen, K. Performance Modeling and System Management for Multi-component Online Services. In Proceedings of the 2nd USENIX Symposium on Networked Systems Design and Implementation (2005).

Digital Library

[20]

Wang, H. J., Platt, J. C., Chen, Y., Zhang, R., and Wang, Y.-M. Automatic Misconfiguration Troubleshooting with PeerPressure. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation (2004).

Digital Library

[21]

Wang, Y.-M., Verbowski, C., Dunagan, J., Chen, Y., Wang, H. J., Yuan, C., and Zhang, Z. Strider: A Black-box, State-based Approach to Change and Configuration Management and Support. In Proceedings of the 17th Systems Administration Conference (2003).

Digital Library

[22]

Whitaker, A., Cox, R. S., and Gribble, S. D. Configuration Debugging as Search: Finding the Needle in the Haystack. In Proceedings of the USENIX Symposium on Operating Systems Design and Implementation (2004).

Digital Library

Cited By

Read MDehury CSrirama SBuyya R(2024)Deep Reinforcement Learning (DRL)-Based Methods for Serverless Stream Processing Engines: A Vision, Architectural Elements, and Future DirectionsResource Management in Distributed Systems10.1007/978-981-97-2644-8_14(285-314)Online publication date: 31-May-2024
https://doi.org/10.1007/978-981-97-2644-8_14
Zhang YGe YYu PZhang JZhang YBaker T(2021)A Novel Method to Prevent Misconfigurations of Industrial Automation and Control SystemsIEEE Transactions on Industrial Informatics10.1109/TII.2020.301775417:6(4210-4218)Online publication date: Jun-2021
https://doi.org/10.1109/TII.2020.3017754
Bianchini RFontoura MCortez EBonde AMuzio AConstantin AMoscibroda TMagalhaes GBablani GRussinovich M(2020)Toward ML-centric cloud platformsCommunications of the ACM10.1145/336468463:2(50-59)Online publication date: 22-Jan-2020
https://dl.acm.org/doi/10.1145/3364684
Show More Cited By

Index Terms

Automatic configuration of internet services
1. Social and professional topics
  1. Professional topics
    1. Management of computing and information systems

Recommendations

Understanding and discovering software configuration dependencies in cloud and datacenter systems
ESEC/FSE 2020: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering

A large percentage of real-world software configuration issues, such as misconfigurations, involve multiple interdependent configuration parameters. However, existing techniques and tools either do not consider dependencies among configuration ...
Automatic configuration of internet services
EuroSys'07 Conference Proceedings

Recent research has found that operators frequently misconfigure Internet services, causing various availability and performance problems. In this paper, we propose a software infrastructure that eliminates several types of misconfiguration by ...
Configuration research and commercial solutions

In this paper we intend to motivate various research areas in configuration, based on our experience in developing commercial configuration solutions. Informal definitions are given for the configuration task and for configuration specification and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

EuroSys '07: Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007

March 2007

431 pages

ISBN:9781595936363

DOI:10.1145/1272996

ACM SIGOPS Operating Systems Review Volume 41, Issue 3
EuroSys'07 Conference Proceedings
June 2007
386 pages
ISSN:0163-5980
DOI:10.1145/1272998
Issue’s Table of Contents

Copyright © 2007 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGOPS: ACM Special Interest Group on Operating Systems

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 March 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

EuroSys07

Sponsor:

SIGOPS

EuroSys07: Eurosys 2007 Conference

March 21 - 23, 2007

Lisbon, Portugal

Acceptance Rates

Overall Acceptance Rate 241 of 1,308 submissions, 18%

Upcoming Conference

EuroSys '25

Sponsor:
sigops

Twentieth European Conference on Computer Systems

March 30 - April 3, 2025

Rotterdam , Netherlands

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

72
Total Citations
View Citations
639
Total Downloads

Downloads (Last 12 months)12
Downloads (Last 6 weeks)2

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Read MDehury CSrirama SBuyya R(2024)Deep Reinforcement Learning (DRL)-Based Methods for Serverless Stream Processing Engines: A Vision, Architectural Elements, and Future DirectionsResource Management in Distributed Systems10.1007/978-981-97-2644-8_14(285-314)Online publication date: 31-May-2024
https://doi.org/10.1007/978-981-97-2644-8_14
Zhang YGe YYu PZhang JZhang YBaker T(2021)A Novel Method to Prevent Misconfigurations of Industrial Automation and Control SystemsIEEE Transactions on Industrial Informatics10.1109/TII.2020.301775417:6(4210-4218)Online publication date: Jun-2021
https://doi.org/10.1109/TII.2020.3017754
Bianchini RFontoura MCortez EBonde AMuzio AConstantin AMoscibroda TMagalhaes GBablani GRussinovich M(2020)Toward ML-centric cloud platformsCommunications of the ACM10.1145/336468463:2(50-59)Online publication date: 22-Jan-2020
https://dl.acm.org/doi/10.1145/3364684
Nguyen NMaifi Hasan Khan MWang K(2018)Towards Automatic Tuning of Apache Spark Configuration2018 IEEE 11th International Conference on Cloud Computing (CLOUD)10.1109/CLOUD.2018.00059(417-425)Online publication date: Jul-2018
https://doi.org/10.1109/CLOUD.2018.00059
Zhu YLiu JGuo MBao YMa WLiu ZSong KYang Y(2017)BestConfigProceedings of the 2017 Symposium on Cloud Computing10.1145/3127479.3128605(338-350)Online publication date: 24-Sep-2017
https://dl.acm.org/doi/10.1145/3127479.3128605
Bilal MCanini M(2017)Towards automatic parameter tuning of stream processing systemsProceedings of the 2017 Symposium on Cloud Computing10.1145/3127479.3127492(189-200)Online publication date: 24-Sep-2017
https://dl.acm.org/doi/10.1145/3127479.3127492
Li YChang KBel OMiller ELong DMohr BRaghavan P(2017)CAPESProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3126908.3126951(1-14)Online publication date: 12-Nov-2017
https://dl.acm.org/doi/10.1145/3126908.3126951
Zhu YLiu JGuo MMa WBao Y(2017)ACTS in NeedProceedings of the 8th Asia-Pacific Workshop on Systems10.1145/3124680.3124730(1-8)Online publication date: 2-Sep-2017
https://dl.acm.org/doi/10.1145/3124680.3124730
Xu TZhou Y(2015)Systems Approaches to Tackling Configuration ErrorsACM Computing Surveys10.1145/279157747:4(1-41)Online publication date: 21-Jul-2015
https://dl.acm.org/doi/10.1145/2791577
Xu TJin LFan XZhou YPasupathy STalwadker RDi Nitto EHarman MHeymans P(2015)Hey, you have given me too many knobs!: understanding and dealing with over-designed configuration in system softwareProceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering10.1145/2786805.2786852(307-319)Online publication date: 30-Aug-2015
https://dl.acm.org/doi/10.1145/2786805.2786852
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten