Abstract
Gossip protocols and services provide a means by which failures can be detected in large, distributed systems in an asynchronous manner without the limits associated with reliable multicasting for group communications. Extending the gossip protocol such that a system reaches consensus on detected faults can be performed via a flat structure, or it can be hierarchically distributed across cooperating layers of nodes. In this paper, the performance of gossip services employing flat and hierarchical schemes is analyzed on an experimental testbed in terms of consensus time, resource utilization and scalability. Performance associated with a hierarchically arranged gossip scheme is analyzed with varying group sizes and is shown to scale well. Resource utilization of the gossip-style failure detection and consensus service is measured in terms of network bandwidth utilization and CPU utilization. Analytical models are developed for resource utilization and performance projections are made for large system sizes.
Similar content being viewed by others
References
R. van Renesse, R. Minsky and M. Heyden, A gossip-style failure detection service, in: Proc. of IFP International Conf. on Distributed Systems Platforms and Open Distributed Processing Middleware'98, Lake District, England, 15–;18 September 1998.
M. Burns, A. George and B. Wallace, Simulative performance analysis of gossip failure detection for scalable distributed systems, Cluster Computing 2(3) (1999) 207-217.
S. Ranganathan, A. George, R. Todd and M. Chidester, Gossip-style failure detection and distributed consensus for scalable heterogeneous clusters, Cluster Computing 4(3) (2001) 197-209.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sistla, K., George, A.D. & Todd, R.W. Experimental Analysis of a Gossip-Based Service for Scalable, Distributed Failure Detection and Consensus. Cluster Computing 6, 237–251 (2003). https://doi.org/10.1023/A:1023592621046
Issue Date:
DOI: https://doi.org/10.1023/A:1023592621046