Abstract
Fault tolerance is a key aspect on the development of distributed systems, but it is barely supported on large-scale systems due to the cost of traditional techniques. This paper revisits RBP, a Total Order Broadcast protocol known by its efficiency that presents some very interesting characteristics for scalable systems. However, we found a membership flaw on RBP that can lead to inconsistencies among correct processes. Hence, we propose iRBP, an improvement to the RBP algorithm that not only circumvents the membership weaknesses using recent membership techniques, but also improves its scalability aspects.
Chapter PDF
Similar content being viewed by others
References
Barchet-Estefanel, L.A.: Analysing RBP, a Total Order Broadcast Protocol for Unreliable Networks. Technical Report, IC-EPFL – Switzerland (2002)
Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. Journal of the ACM 43(2), 225–267 (1996)
Chang, J., Maxemchuck, N.: Reliable Broadcast Protocols. ACM Trans. on Computer Systems 2(3), 251–273 (1984)
Charron-Bost, B., Défago, X., Schiper, A.: Broadcasting Messages in Fault- Tolerant Distributed Systems: the benefit of handling input-triggered and outputtriggered suspicions differently. In: Proceedings of the 21st Int’l Symposium on Reliable Distributed Systems, Osaka, Japan (2002)
Chockler, G., Keidar, I., Vitenberg, R.: Group Communication Specifications: a comprehensive study. ACM Computing Surveys 33(4), 427–469 (2001)
Défago, X.: Agreement-related Problems: from semi-passive replication to totally ordered broadcasts. PhD Thesis, EPFL – Switzerland (2000)
Jalote, P.: Fault Tolerance in Distributed Systems. Prentice-Hall, Englewood Cliffs (1994)
Maxemchuck, N., Shur, D.: An Internet Multicast System for the Stock Market. ACM Trans. on Computer Systems 19(3), 384–412 (2001)
Schiper, A.: Early consensus in an asynchronous system with a weak failure detector. Distributed Computing 10(3), 149–157 (1997)
Urbán, P., Défago, X., Schiper, A.: Neko: A single environment to simulate and prototype distributed algorithms. In: Proceedings of the 15th Int’l Conf. on Information Networking, Beppu City, Japan (2001)
Whetten, B., Montgomery, T., Kaplan, S.: A High Performance Totally Ordered Multicast Protocol. In: Birman, K.P., Mattern, F., Schiper, A. (eds.) Theory and Practice in Distributed Systems: International Workshop, pp. 33–57. Springer, Heidelberg (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barchet-Estefanel, L.A. (2003). iRBP – A Fault Tolerant Total Order Broadcast for Large Scale Systems. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds) Euro-Par 2003 Parallel Processing. Euro-Par 2003. Lecture Notes in Computer Science, vol 2790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45209-6_90
Download citation
DOI: https://doi.org/10.1007/978-3-540-45209-6_90
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40788-1
Online ISBN: 978-3-540-45209-6
eBook Packages: Springer Book Archive