ABSTRACT
ZooKeeper provides scalable, highly available coordination services for distributed applications. In this paper, we evaluate the use of ZooKeeper in a distributed stream computing system called System S to provide a resilient name service, dynamic configuration management, and system state management. The evaluation shed light on the advantages of using ZooKeeper in these contexts as well as its limitations. We also describe design changes we made to handle named objects in System S to overcome the limitations. We present detailed experimental results, which we believe will be beneficial to the community.
- Hunt, Patrick, Mahadev Konar, Flavio P. Junqueira, and Benjamin Reed. "ZooKeeper: Wait-free coordination for Internet-scale systems." In USENIX ATC, vol. 10. 2010. Google ScholarDigital Library
- Junqueira, Flavio P., Benjamin C. Reed, and Marco Serafini. "Zab: High-performance broadcast for primary-backup systems." In Dependable Systems & Networks (DSN), 2011 IEEE/IFIP 41st International Conference on, pp. 245--256. IEEE, 2011. Google ScholarDigital Library
- Wagle, Rohit, Henrique Andrade, Kirsten Hildrum, Chitra Venkatramani, and Michael Spicer. "Distributed middleware reliability and fault tolerance support in system S." In Proceedings of the 5th ACM international conference on Distributed event-based system, pp. 335--346. ACM, 2011. Google ScholarDigital Library
- IBM InfoSphere Streams: http://www-01.ibm.com/software/data/infosphere/streams/Google Scholar
- Apache HBase: http://hbase.apache.org/Google Scholar
- Loesing, Simon, Martin Hentschel, Tim Kraska, and Donald Kossmann. "Stormy: an elastic and highly available streaming service in the cloud." InProceedings of the 2012 Joint EDBT/ICDT Workshops, pp. 55--60. ACM, 2012. Google ScholarDigital Library
- Marz, N., "A Storm is coming" http://engineering.twitter.com/2011/08/storm-is-coming-more-details-and-plans.html, August 2011Google Scholar
- Amini, Lisa, Henrique Andrade, Ranjita Bhagwan, Frank Eskesen, Richard King, Philippe Selo, Yoonho Park, and Chitra Venkatramani. "SPC: A distributed, scalable platform for data mining." In Proceedings of the 4th international workshop on Data mining standards, services and platforms, pp. 27--37. ACM, 2006. Google ScholarDigital Library
- Wu, Kun-Lung, Kirsten W. Hildrum, Wei Fan, Philip S. Yu, Charu C. Aggarwal, David A. George, Buğra Gedik et al. "Challenges and experience in prototyping a multi-modal stream analytic and monitoring application on System S." InProceedings of the 33rd international conference on Very large data bases, pp. 1185--1196. VLDB Endowment, 2007. Google ScholarDigital Library
- Wolf, Joel, Nikhil Bansal, Kirsten Hildrum, Sujay Parekh, Deepak Rajan, Rohit Wagle, Kun-Lung Wu, and Lisa Fleischer. "SODA: An optimizing scheduler for large-scale stream-based distributed computer systems." In Middleware 2008, pp. 306--325. Springer Berlin Heidelberg, 2008. Google ScholarDigital Library
- Wolf, Joel, Nikhil Bansal, Kirsten Hildrum, Sujay Parekh, Deepak Rajan, Rohit Wagle, and Kun-Lung Wu. "Job admission and resource allocation in distributed streaming systems." In Job Scheduling Strategies for Parallel Processing, pp. 169--189. Springer Berlin Heidelberg, 2009. Google ScholarDigital Library
Index Terms
- An evaluation of zookeeper for high availability in system S
Recommendations
High availability on cloud with HA-OSCAR
Euro-Par'11: Proceedings of the 2011 international conference on Parallel Processing - Volume 2Cloud computing provides virtual resources so that end-users or organizations can buy computing power as a public utility. Cloud service providers however must strive to ensure good QoS by offering highly available services with dynamically scalable ...
High Availability Benchmarking for Cloud Management Infrastructure
ICSS '14: Proceedings of the 2014 International Conference on Service SciencesCloud-management infrastructure plays an important role as a part of cloud computing stacks, serving as the resource manager of cloud platforms. The complexity of cloud-management infrastructure makes its high availability (HA) one of the most critical ...
High availability in clouds: systematic review and research challenges
Cloud Computing has been used by different types of clients because it has many advantages, including the minimization of infrastructure resources costs, and its elasticity property, which allows services to be scaled up or down according to the current ...
Comments