Abstract
We give a formal specification and an implementation for a partitionable group communication service in asynchronous distributed systems. Our specification is motivated by the requirements for building “partition-aware” applications that can continue operating without blocking in multiple concurrent partitions and reconfigure themselves dynamically when partitions merge. The specified service guarantees liveness and excludes trivial solutions; it constitutes a useful basis for building realistic partition-aware applications; and it is implementable in practical asynchronous distributed systems where certain stability conditions hold.
Portions reprinted, with permission, from IEEE Trans. on Software Engineering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Y. Amir, L.E. Moser, P.M. Melliar-Smith, D.A. Agarwal, and P. Ciarfella. The Totem Single-Ring Ordering and Membership Protocol. ACM Transactions on Computer Systems, 13(4):311–342, November 1995.
Y. Amir and J. Stanton. The Spread Wide-Aread Group Communication System. Technical report, Center of Networking and Distributed Systems, Johns Hopkins University, Baltimore, Mariland, April 1998.
E. Anceaume, B. Charron-Bost, P. Minet, and S. Toueg. On the Formal Specification of Group Membership Services. Technical Report TR95-1534, Department of Computer Science, Cornell University, August 1995.
T. Anker, G. Chockler, D. Dolev, and I. Keidar. Scalable Group Membership Services for Novel Applications. In Proceedings of the DIMACS Workshop on Networks in Distributed Computing, pages 23–42. American Mathematical Society, 1998.
Ö. Babaoğlu, R. Davoli, L.A. Giachini, and M.G. Baker. RELACS: A Communications Infrastructure for Constructing Reliable Applications in Large-Scale Distributed Systems. In Proceedings of the 28th Hawaii International Conference on System Sciences (HICS), pages 612–621, Maui, Hawaii, January 1995.
Ö. Babaoğlu, R. Davoli, and A. Montresor. Group Communication in Partitionable Systems: Specification and Algorithms. Technical Report UBLCS-98-1, Department of Computer Science, University of Bologna, April 1998.
Ö. Babaoğlu, R. Davoli, A. Montresor, and R. Segala. System Support for Partition-Aware Network Applications. In Proceedings of the 18th International Conference on Distributed Computing Systems (ICDCS), pages 184–191, Amsterdam, The Netherlands, May 1998.
K. Birman. The Process Group Approach to Reliable Distributed Computing. Communications of the ACM, 36(12):36–53, December 1993.
K. Birman and R. van Renesse. Reliable Distributed Computing with the ISIS Toolkit. IEEE Computer Society Press, 1994.
T. Chandra, V. Hadzilacos, S. Toueg, and B. Charron-Bost. On the Impossibility of Group Membership. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), pages 322–330, May 1996. Also available as technical report TR95-1533, Department of Computer Science, Cornell University.
T.D. Chandra and S. Toueg. Unreliable Failure Detectors for Reliable Distributed Systems. Journal of the ACM, 43(1):225–267, March 1996.
D. Dolev, R. Friedman, I. Keidar, and D. Malki. Failure Detectors in Omission Failure Environments. In Proceedings of the 16th ACM Symposium on Principles of Distributed Computing (PODC), Santa Barbara, California, August 1997. Also available as Technical Report TR96-1608.
D. Dolev and D. Malki. The Transis Approach to High Availability Cluster Communication. Communications of the ACM, 39(4), April 1996.
D. Dolev, D. Malki, and R. Strong. An Asynchronous Membership Protocol that Tolerates Partitions. Technical Report CS94-6, Institute of Computer Science, The Hebrew University of Jerusalem, 1994.
D. Dolev, D. Malki, and R. Strong. A Framework for Partitionable Membership Service. Technical Report CS95-4, Institute of Computer Science, The Hebrew University of Jerusalem, 1995.
D. Dolev, D. Malki, and R. Strong. A Framework for Partitionable Membership Service. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), May 1996.
P.E. Ezhilchelvan, R.A. Macêdo, and S.K. Shrivastava. Newtop: A Fault-Tolerant Group Communication Protocol. In Proceedings of the 15th International Conference on Distributed Computing Systems (ICDCS), Vancouver, BC, Canada, June 1995.
A. Fekete, N. Lynch, and A. Shvartsman. Specifying and Using a Partitionable Group Communication Service. In Proceedings of the 16th ACM Symposium on Principles of Distributed Computing (PODC), Santa Barbara, California, August 1997.
M.J. Fischer, N.A. Lynch, and M.S. Patterson. Impossibility of Distributed Consensus with one Faulty Process. Journal of the ACM, 32(2):374–382, April 1985.
R. Friedman and R. Van Renesse. Strong and Weak Virtual Synchrony in Horus. Technical Report TR95-1537, Department of Computer Science, Cornell University, March 1995.
M. Hayden. The Ensenble System. PhD thesis, Department of Computer Science, Cornell University, January 1998.
F. Kaashoek and A. Tanenbaum. Group Communication in the Amoeba Distributed Operating System. In Proceedings of the 12th IEEE Symp. on Reliable Distributed Systems, pages 222–230, Arlington, TX, May 1991.
R. Khazan, A. Fekete, and N. Lynch. Multicast Group Communication as a Base for a Load-Balancing Replicated Data Service. In Proceedings of the 12th Symposium on Distributed Computing, August 1998.
C. Malloth. Conception and Implementation of a Toolkit for Building Fault-Tolerant Distributed Applications in Large-Scale Networks. PhD thesis, Ecole Polytechnique Fédérale de Lausanne, 1996.
C. Malloth and A. Schiper. View Synchronous Communication in Large Scale Networks. In Proceedings of the 2nd Open Workshop of the ESPRIT Project Broadcast, Grenoble, France, July 1995.
A. Montresor. A Dependable Registry Service for the Jgroup Distributed Object Model. In Proceedings of the 3rd European Reasearch Seminar on Advances in Distributed Systems (ERSADS), Madeira, Portugal, April 1999.
A. Montresor. The Jgroup Reliable Distributed Object Model. In Proceedings of the 2nd IFIP International Working Conference on Distributed Applications and Systems (DAIS), Helsinki, Finland, June 1999.
L. Moser, P. Melliar-Smith, D. Agarwal, R. Budhia, and C. Lingley-Papadopoulos. Totem: A Fault-Tolerant Group Communication System. Communications of the ACM, 39(4), April 1996.
L.E. Moser, Y. Amir, P.M. Melliar-Smith, and D.A. Agarwal. Extended Virtual Synchrony. In Proceedings of the 14th International Conference on Distributed Computing Systems (ICDCS), Poznan, Poland, June 1994.
G. Neiger. A New Look at Membership Services. In Proceedings of the 15th ACM Symposium on Principles of Distributed Computing (PODC), May 1996.
R. De Prisco, A. Fekete, N. Lynch, and A. Shvartsman. A Dynamic View-Oriented Group Communication Service. In Proceedings of the 17th ACM Symposium on Principles of Distributed Computing (PODC), June 1998.
A. Ricciardi and K. Birman. Using Process Groups to Implement Failure Detection in Asynchronous Environments. In Proceedings of the 10th ACM Symposium on Principles of Distributed Computing (PODC), pages 341–352, August 1991.
A. Schiper and A. Ricciardi. Virtually-synchronous Communication Based on a Weak Failure Suspector. In Proceedings of the 23rd International Symposium on Fault-Tolerant Computing (FTCS), pages 534–543, June 1993.
R. van Renesse, K. Birman, R. Cooper, B. Glade, and P. Stephenson. The Horus System. In K. Birman and R. van Renesse, editors, Reliable Distributed Computing with the Isis Toolkit, pages 133–147. IEEE Computer Society Press, 1993.
R. van Renesse, K.P. Birman, and S. Maffeis. Horus: A Flexible Group Communication System. Communications of the ACM, 39(4):76–83, April 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Babaoğlu, Ö., Davoli, R., Montresor, A. (2000). Group Communication in Partitionable Distributed Systems. In: Krakowiak, S., Shrivastava, S. (eds) Advances in Distributed Systems. Lecture Notes in Computer Science, vol 1752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46475-1_3
Download citation
DOI: https://doi.org/10.1007/3-540-46475-1_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67196-1
Online ISBN: 978-3-540-46475-4
eBook Packages: Springer Book Archive