Skip to main content

Topology-Aware Algorithms for Large-Scale Communication

  • Chapter
  • First Online:
Advances in Distributed Systems

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1752))

Abstract

When designing communication protocols there is always a tradeoff between generality and performance. This chapter reports one approach to achieve right balance between these two aspects, using a network model that can be applied to the majority of existing large-scale networks based on reliable high-speed local-area networks interconnected by slower long-haul connections. The approach consists in making visible relevant topological aspects of the uderlying network infrastructure to the protocol designer, and is illustrated by several algorithms that use topology information to achieve improved performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Y. Amir, D. Dolev, S. Kramer, and D. Malki. Transis: A communication subsystem for high-availability. In Digest of Papers, The 22nd Int. Symp. on Fault-Tolerant Computing Systems, pages 76–84. IEEE, 1993.

    Google Scholar 

  2. K. Birman and T. Joseph. Reliable communication in the presence of failures. ACM, Transactions on Computer Systems, 5(1), February 1987.

    Google Scholar 

  3. K. Birman, A. Schiper, and P. Stephenson. Lightweight causal and atomic group multicast. ACM, Transactions on Computer Systems, 9(3), August 1991.

    Google Scholar 

  4. T. Chandra and S. Toueg. Unreliable failure detectors for reliable distributed systems. Journal of the ACM, 34(1):225–267, 1996.

    Article  MathSciNet  Google Scholar 

  5. J. Chang and N. Maxemchuck. Reliable broadcast protocols. ACM, Transactions on Computer Systems, 2(3), August 1984.

    Google Scholar 

  6. B. Charron-Bost. Concerning the size of logical clocks in distributed systems. Information Processing Letters, 39(1):11–16, July 1991.

    Article  MATH  MathSciNet  Google Scholar 

  7. D. Dolev, S. Kramer, and D. Malki. Early delivery totally ordered multicast in asynchronous environments. In Digest of Papers, The 23th Int. Symp. on Fault-Tolerant Computing, pages 544–553. IEEE, 1993.

    Google Scholar 

  8. Simon Even. Graph Algorithms. Computer Science Press, 1979.

    Google Scholar 

  9. P. Ezhilchelvan, R. Macedo, and S. Shrivastava. Newtop: A fault-tolerant group communication protocol. In Proceedings of the 15th Int. Conf. on Distributed Computing Systems, pages 296–306. IEEE, 1995.

    Google Scholar 

  10. A. Heybey. The network simulator version 2.1. Technical report, M.I.T., September 1990.

    Google Scholar 

  11. P. John. Statistical Methods in Engineering and Quality Assurance. John Wiley & Sons Inc, 1990.

    Google Scholar 

  12. M. Kaashoek and A. Tanenbaum. Group communication in the Amoeba distributed operating system. In Proceedings of the 11th Int. Conf. on Distributed Computing Systems, pages 222–230. IEEE, 1991.

    Google Scholar 

  13. R. Ladin, B. Liskov, L. Shrira, and S. Ghemawat. Lazy replication: Exploiting the semantics of distributed services. In Proceedings of the Ninth Annual ACM Symp. of Principles of Distributed Computing, pages 43–57, 1990.

    Google Scholar 

  14. L. Lamport. Time, clocks and the ordering of events in a distributed system. Communications of the ACM, 21(7):558–565, July 1978.

    Article  MATH  Google Scholar 

  15. W. Lloyd and P. Kearns. Bounding sequence numbers in distributed systems: a general approach. In Proceedings of the 10th Int. Conf. on Distributed Computing Systems, pages 312–319, Paris, France, May 1990. IEEE.

    Google Scholar 

  16. S. Meldal, S. Sankar, and J. Vera. Exploiting locality in maitaining potential causality. In Procedings of the 10th ACM SIGACT-SIGOPS Symp. on Principles of Distributed Computing, pages 231–239, 1991.

    Google Scholar 

  17. David Mills. Network time protocol (version 2): Specification and implementation. Technical Report RFC 1119, DARPA Network Working Group, September 1989.

    Google Scholar 

  18. L. Moser, P. Melliar-Smith, A. Agarwal, R. Budhia, C. Lingley-Ppadopoulos, and T. Archambault. The Totem system. In Digest of Papers of the 25th Int. Symp. on Fault-Tolerant Computing Systems, pages 61–66. IEEE, June 1995.

    Google Scholar 

  19. L. Peterson, N. Buchholz, and R. Schlichting. Preserving and using context information in interprocess communication. ACM Transactions on Computer Systems, 7(3):217–146, August 1989.

    Article  Google Scholar 

  20. M. Raynal, A. Schiper, and S. Toueg. The causal ordering abstraction and a simple way to implement it. Information processing letters, 39(6):343–350, September 1991.

    Article  MATH  MathSciNet  Google Scholar 

  21. L. Rodrigues, H. Fonseca, and P. Veríssimo. Totally ordered multicast in large-scale systems. In Proceedings of the 16th IEEE Int. Conf. on Distributed Computing Systems, pages 503–510, Hong Kong, May 1996. (extended report available).

    Google Scholar 

  22. L. Rodrigues and P. Veríssimo. xAMp: a multi-primitive group communications service. In Proceedings of the 11th Symp. on Reliable Distributed Systems, pages 112–121. IEEE, 1992.

    Google Scholar 

  23. L. Rodrigues and P. Veríssimo. Causal separators for large-scale multicast communication. In Proceedings of the 15th IEEE Int. Conf. on Distributed Computing Systems, pages 83–91, Vancouver, British Columbia, Canada, May 1995. (extended report available).

    Google Scholar 

  24. A. Schiper, J. Eggli, and A. Sandoz. A New Algorithm to Implement Causal Ordering. In Proceedings of the 3rd Int Workshop on Distributed Algorithms, volume LNCS 392, pages 219–232, Nice-France, September 1989. Springer Verlag.

    Google Scholar 

  25. F. Schneider. Implementing fault-tolerant services using the state machine approach: a tutorial. ACM Computing Surveys, 22(4):290–319, December 1990.

    Article  Google Scholar 

  26. A. Schwarz and F. Mattern. Detecting Causal Relationships in Distributed Computations: In search of the Holy Grail. Technical report, Departement of Computer Science, University of Kaiserlautern, 1991.

    Google Scholar 

  27. M. Singhal and Kshemkalyani A. An Efficient Implementation of vector clocks. Technical report, Ohio State University, October 1990.

    Google Scholar 

  28. P. Stephenson. Fast Causal Multicast. PhD thesis, Cornell Univ., February 1991.

    Google Scholar 

  29. R. van Renesse, Ken Birman, and S. Maffeis. Horus: A flexible group communications system. Communications of the ACM, 39(4):76–83, April 1996.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Rodrigues, L., Veríssimo, P. (2000). Topology-Aware Algorithms for Large-Scale Communication. In: Krakowiak, S., Shrivastava, S. (eds) Advances in Distributed Systems. Lecture Notes in Computer Science, vol 1752. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46475-1_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-46475-1_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-67196-1

  • Online ISBN: 978-3-540-46475-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics