ABSTRACT
Job schedulers for grids and clouds can offer great generality and configurability, but they typically do so at the cost of increased administrator complexity. In this paper, we present Wallaby, an open-source, scalable configuration service for compute resources managed by the Condor high-throughput computing system. Wallaby offers several notable advantages over similar systems: it lets administrators write declarative specifications of user-visible functionality on groups of nodes instead of low-level configuration file fragments; it presents a high-level semantic model of Condor features and their interactions and dependencies; it validates configurations before pushing them to nodes; it supports version control, "undo," and configuration differencing; and it includes a networked API that enables extensions and advanced functionality. Wallaby allows administrators to extend pools to include more physical, virtual, or cloud nodes with minimal explicit configuration. Finally, it is scalable, supporting pools consisting of thousands of nodes with hundreds of configuration parameters each.
- 2001 IEEE International Conference on Cluster Computing (CLUSTER 2001), 8-11 October 2001, Newport Beach, CA, USA, 2001. IEEE Computer Society.Google Scholar
- T. Delaet and W. Joosen. Podim: a language for high-level configuration management. In Proceedings of the 21st conference on Large Installation System Administration Conference, pages 21:1--21:13, Berkeley, CA, USA, 2007. USENIX Association. Google ScholarDigital Library
- W. Enck, T. Moyer, P. Mcdaniel, S. Sen, P. Sebos, S. Spoerel, A. G. Greenberg, Y. wei Eric Sung, S. G. Rao, and W. Aiello. Configuration management at massive scale: system design and experience. IEEE Journal on Selected Areas in Communications, 27:323--335, 2009. Google ScholarDigital Library
- P. Goldsack, J. Guijarro, S. Loughran, A. Coles, A. Farrell, A. Lain, P. Murray, and P. Toft. The smartfrog configuration management framework. SIGOPS Operating Systems Review, 43:16--25, January 2009. ISSN 0163-5980. Google ScholarDigital Library
- M. Litzkow, M. Livny, and M. Mutka. Condor - a hunter of idle workstations. In Proceedings of the 8th International Conference of Distributed Computing Systems, June 1988.Google ScholarCross Ref
- U. of Wisconsin Condor Team. Condor Version 7.6.0 Manual, 2011.Google Scholar
- P. M. Papadopoulos, M. J. Katz, and G. Bruno. Npaci rocks: Tools and techniques for easily deploying manageable linux clusters. In CLUSTER DBL {1}, pages 258--. Google ScholarDigital Library
- S. Vinoski. Advanced message queuing protocol. IEEE Internet Computing, 10:87--89, November 2006. Google ScholarDigital Library
Comments