Skip to main content

Group Communication in Partitionable Distributed Systems

  • Chapter
  • First Online:
Advances in Distributed Systems

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1752))

Abstract

We give a formal specification and an implementation for a partitionable group communication service in asynchronous distributed systems. Our specification is motivated by the requirements for building “partition-aware” applications that can continue operating without blocking in multiple concurrent partitions and reconfigure themselves dynamically when partitions merge. The specified service guarantees liveness and excludes trivial solutions; it constitutes a useful basis for building realistic partition-aware applications; and it is implementable in practical asynchronous distributed systems where certain stability conditions hold.

Portions reprinted, with permission, from IEEE Trans. on Software Engineering.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Y. Amir, L.E. Moser, P.M. Melliar-Smith, D.A. Agarwal, and P. Ciarfella. The Totem Single-Ring Ordering and Membership Protocol. ACM Transactions on Computer Systems, 13(4):311–342, November 1995.

    Article  Google Scholar 

  2. Y. Amir and J. Stanton. The Spread Wide-Aread Group Communication System. Technical report, Center of Networking and Distributed Systems, Johns Hopkins University, Baltimore, Mariland, April 1998.

    Google Scholar 

  3. E. Anceaume, B. Charron-Bost, P. Minet, and S. Toueg. On the Formal Specification of Group Membership Services. Technical Report TR95-1534, Department of Computer Science, Cornell University, August 1995.

    Google Scholar 

  4. T. Anker, G. Chockler, D. Dolev, and I. Keidar. Scalable Group Membership Services for Novel Applications. In Proceedings of the DIMACS Workshop on Networks in Distributed Computing, pages 23–42. American Mathematical Society, 1998.

    Google Scholar 

  5. Ö. Babaoğlu, R. Davoli, L.A. Giachini, and M.G. Baker. RELACS: A Communications Infrastructure for Constructing Reliable Applications in Large-Scale Distributed Systems. In Proceedings of the 28th Hawaii International Conference on System Sciences (HICS), pages 612–621, Maui, Hawaii, January 1995.

    Google Scholar 

  6. Ö. Babaoğlu, R. Davoli, and A. Montresor. Group Communication in Partitionable Systems: Specification and Algorithms. Technical Report UBLCS-98-1, Department of Computer Science, University of Bologna, April 1998.

    Google Scholar 

  7. Ö. Babaoğlu, R. Davoli, A. Montresor, and R. Segala. System Support for Partition-Aware Network Applications. In Proceedings of the 18th International Conference on Distributed Computing Systems (ICDCS), pages 184–191, Amsterdam, The Netherlands, May 1998.

    Google Scholar 

  8. K. Birman. The Process Group Approach to Reliable Distributed Computing. Communications of the ACM, 36(12):36–53, December 1993.

    Article  Google Scholar 

  9. K. Birman and R. van Renesse. Reliable Distributed Computing with the ISIS Toolkit. IEEE Computer Society Press, 1994.

    Google Scholar 

  10. T. Chandra, V. Hadzilacos, S. Toueg, and B. Charron-Bost. On the Impossibility of Group Membership. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), pages 322–330, May 1996. Also available as technical report TR95-1533, Department of Computer Science, Cornell University.

    Google Scholar 

  11. T.D. Chandra and S. Toueg. Unreliable Failure Detectors for Reliable Distributed Systems. Journal of the ACM, 43(1):225–267, March 1996.

    Article  MATH  MathSciNet  Google Scholar 

  12. D. Dolev, R. Friedman, I. Keidar, and D. Malki. Failure Detectors in Omission Failure Environments. In Proceedings of the 16th ACM Symposium on Principles of Distributed Computing (PODC), Santa Barbara, California, August 1997. Also available as Technical Report TR96-1608.

    Google Scholar 

  13. D. Dolev and D. Malki. The Transis Approach to High Availability Cluster Communication. Communications of the ACM, 39(4), April 1996.

    Google Scholar 

  14. D. Dolev, D. Malki, and R. Strong. An Asynchronous Membership Protocol that Tolerates Partitions. Technical Report CS94-6, Institute of Computer Science, The Hebrew University of Jerusalem, 1994.

    Google Scholar 

  15. D. Dolev, D. Malki, and R. Strong. A Framework for Partitionable Membership Service. Technical Report CS95-4, Institute of Computer Science, The Hebrew University of Jerusalem, 1995.

    Google Scholar 

  16. D. Dolev, D. Malki, and R. Strong. A Framework for Partitionable Membership Service. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), May 1996.

    Google Scholar 

  17. P.E. Ezhilchelvan, R.A. Macêdo, and S.K. Shrivastava. Newtop: A Fault-Tolerant Group Communication Protocol. In Proceedings of the 15th International Conference on Distributed Computing Systems (ICDCS), Vancouver, BC, Canada, June 1995.

    Google Scholar 

  18. A. Fekete, N. Lynch, and A. Shvartsman. Specifying and Using a Partitionable Group Communication Service. In Proceedings of the 16th ACM Symposium on Principles of Distributed Computing (PODC), Santa Barbara, California, August 1997.

    Google Scholar 

  19. M.J. Fischer, N.A. Lynch, and M.S. Patterson. Impossibility of Distributed Consensus with one Faulty Process. Journal of the ACM, 32(2):374–382, April 1985.

    Article  MATH  Google Scholar 

  20. R. Friedman and R. Van Renesse. Strong and Weak Virtual Synchrony in Horus. Technical Report TR95-1537, Department of Computer Science, Cornell University, March 1995.

    Google Scholar 

  21. M. Hayden. The Ensenble System. PhD thesis, Department of Computer Science, Cornell University, January 1998.

    Google Scholar 

  22. F. Kaashoek and A. Tanenbaum. Group Communication in the Amoeba Distributed Operating System. In Proceedings of the 12th IEEE Symp. on Reliable Distributed Systems, pages 222–230, Arlington, TX, May 1991.

    Google Scholar 

  23. R. Khazan, A. Fekete, and N. Lynch. Multicast Group Communication as a Base for a Load-Balancing Replicated Data Service. In Proceedings of the 12th Symposium on Distributed Computing, August 1998.

    Google Scholar 

  24. C. Malloth. Conception and Implementation of a Toolkit for Building Fault-Tolerant Distributed Applications in Large-Scale Networks. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 1996.

    Google Scholar 

  25. C. Malloth and A. Schiper. View Synchronous Communication in Large Scale Networks. In Proceedings of the 2nd Open Workshop of the ESPRIT Project Broadcast, Grenoble, France, July 1995.

    Google Scholar 

  26. A. Montresor. A Dependable Registry Service for the Jgroup Distributed Object Model. In Proceedings of the 3rd European Reasearch Seminar on Advances in Distributed Systems (ERSADS), Madeira, Portugal, April 1999.

    Google Scholar 

  27. A. Montresor. The Jgroup Reliable Distributed Object Model. In Proceedings of the 2nd IFIP International Working Conference on Distributed Applications and Systems (DAIS), Helsinki, Finland, June 1999.

    Google Scholar 

  28. L. Moser, P. Melliar-Smith, D. Agarwal, R. Budhia, and C. Lingley-Papadopoulos. Totem: A Fault-Tolerant Group Communication System. Communications of the ACM, 39(4), April 1996.

    Google Scholar 

  29. L.E. Moser, Y. Amir, P.M. Melliar-Smith, and D.A. Agarwal. Extended Virtual Synchrony. In Proceedings of the 14th International Conference on Distributed Computing Systems (ICDCS), Poznan, Poland, June 1994.

    Google Scholar 

  30. G. Neiger. A New Look at Membership Services. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), May 1996.

    Google Scholar 

  31. R. De Prisco, A. Fekete, N. Lynch, and A. Shvartsman. A Dynamic View-Oriented Group Communication Service. In Proceedings of the 17th ACM Symposium on Principles of Distributed Computing (PODC), June 1998.

    Google Scholar 

  32. A. Ricciardi and K. Birman. Using Process Groups to Implement Failure Detection in Asynchronous Environments. In Proceedings of the 10th ACM Symposium on Principles of Distributed Computing (PODC), pages 341–352, August 1991.

    Google Scholar 

  33. A. Schiper and A. Ricciardi. Virtually-synchronous Communication Based on a Weak Failure Suspector. In Proceedings of the 23rd International Symposium on Fault-Tolerant Computing (FTCS), pages 534–543, June 1993.

    Google Scholar 

  34. R. van Renesse, K. Birman, R. Cooper, B. Glade, and P. Stephenson. The Horus System. In K. Birman and R. van Renesse, editors, Reliable Distributed Computing with the Isis Toolkit, pages 133–147. IEEE Computer Society Press, 1993.

    Google Scholar 

  35. R. van Renesse, K.P. Birman, and S. Maffeis. Horus: A Flexible Group Communication System. Communications of the ACM, 39(4):76–83, April 1996.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Babaoğlu, Ö., Davoli, R., Montresor, A. (2000). Group Communication in Partitionable Distributed Systems. In: Krakowiak, S., Shrivastava, S. (eds) Advances in Distributed Systems. Lecture Notes in Computer Science, vol 1752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46475-1_3

Download citation

  • DOI: https://doi.org/10.1007/3-540-46475-1_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67196-1

  • Online ISBN: 978-3-540-46475-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics