Skip to main content
Log in

Experimental Analysis of a Gossip-Based Service for Scalable, Distributed Failure Detection and Consensus

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Gossip protocols and services provide a means by which failures can be detected in large, distributed systems in an asynchronous manner without the limits associated with reliable multicasting for group communications. Extending the gossip protocol such that a system reaches consensus on detected faults can be performed via a flat structure, or it can be hierarchically distributed across cooperating layers of nodes. In this paper, the performance of gossip services employing flat and hierarchical schemes is analyzed on an experimental testbed in terms of consensus time, resource utilization and scalability. Performance associated with a hierarchically arranged gossip scheme is analyzed with varying group sizes and is shown to scale well. Resource utilization of the gossip-style failure detection and consensus service is measured in terms of network bandwidth utilization and CPU utilization. Analytical models are developed for resource utilization and performance projections are made for large system sizes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. R. van Renesse, R. Minsky and M. Heyden, A gossip-style failure detection service, in: Proc. of IFP International Conf. on Distributed Systems Platforms and Open Distributed Processing Middleware'98, Lake District, England, 15–;18 September 1998.

  2. M. Burns, A. George and B. Wallace, Simulative performance analysis of gossip failure detection for scalable distributed systems, Cluster Computing 2(3) (1999) 207-217.

    Google Scholar 

  3. S. Ranganathan, A. George, R. Todd and M. Chidester, Gossip-style failure detection and distributed consensus for scalable heterogeneous clusters, Cluster Computing 4(3) (2001) 197-209.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alan D. George.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sistla, K., George, A.D. & Todd, R.W. Experimental Analysis of a Gossip-Based Service for Scalable, Distributed Failure Detection and Consensus. Cluster Computing 6, 237–251 (2003). https://doi.org/10.1023/A:1023592621046

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1023592621046

Navigation