skip to main content
10.1145/3590140.3629122acmconferencesArticle/Chapter ViewAbstractPublication PagesmiddlewareConference Proceedingsconference-collections
research-article

FlexCast: Genuine Overlay-based Atomic Multicast

Published: 27 November 2023 Publication History

Abstract

Atomic multicast is a communication abstraction where messages are propagated to groups of processes with reliability and order guarantees. Atomic multicast is at the core of strongly consistent storage and transactional systems. This paper presents FlexCast, the first genuine overlay-based atomic multicast protocol. Genuineness captures the essence of atomic multicast in that only the sender of a message and the message's destinations coordinate to order the message, leading to efficient protocols. Overlay-based protocols restrict how process groups can communicate. Limiting communication leads to simpler protocols and reduces the amount of information each process must keep about the rest of the system. FlexCast implements genuine atomic multicast using a complete DAG overlay. We experimentally evaluate FlexCast in a geographically distributed environment using gTPC-C, a variation of the TPC-C benchmark that takes into account geographical distribution and locality. We show that, by exploiting genuineness and workload locality, FlexCast outperforms well-established atomic multicast protocols without the inherent communication overhead of state-of-the-art non-genuine multicast protocols.

References

[1]
T. Ahmed-Nacer, P. Sutra, and D. Conan. 2016. The Convoy Effect in Atomic Multicast. In 2016 IEEE 35th Symposium on Reliable Distributed Systems Workshops (SRDSW). IEEE Computer Society, Los Alamitos, CA, USA, 67--72. https://doi.org/10.1109/SRDSW.2016.22
[2]
E. Batista, P. Coelho, E. Alchieri, F. Dotti, and F. Pedone. 2023. FlexCast: genuine overlay-based atomic multicast. https://arxiv.org/abs/2309.14074
[3]
Kenneth P. Birman and Thomas A. Joseph. 1987. Reliable Communication in the Presence of Failures. ACM Trans. Comput. Syst. 5, 1 (jan 1987), 47--76. https://doi.org/10.1145/7351.7478
[4]
Cloudping. 2022. AWS Latency Monitoring Website. https://www.cloudping.co/grid
[5]
Paulo Coelho, Tarcisio Ceolin Junior, Alysson Bessani, Fernando Dotti, and Fernando Pedone. 2018. Byzantine Fault-Tolerant Atomic Multicast. In 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). 39--50. https://doi.org/10.1109/DSN.2018.00017
[6]
Paulo Coelho, Nicolas Schiper, and Fernando Pedone. 2017. Fast Atomic Multicast. In DSN.
[7]
James C Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, et al. 2012. Spanner: Google's globally-distributed database. In OSDI.
[8]
Transaction Processing Performance Council. 1996. TPC benchmark C Standard Specification. http://www.tpc.org/tpcc/spec/tpcc_current.pdf (1996).
[9]
James Cowling and Barbara Liskov. 2012. Granola: Low-Overhead Distributed Transaction Coordination. In Proceedings of the 2012 USENIX Annual Technical Conference. USENIX, Boston, MA, USA.
[10]
Xavier Défago, André Schiper, and Péter Urbán. 2004. Total order broadcast and multicast algorithms: Taxonomy and survey. ACM Comput. Surv. 36, 4 (2004).
[11]
C. Delporte-Gallet and H. Fauconnier. 2000. Fault-tolerant genuine atomic multicast to multiple groups. In Proceedings of the 12th International Conference on Principles of Distributed Systems (OPODIS). 107--122.
[12]
Dmitry Duplyakin, Robert Ricci, Aleksander Maricq, Gary Wong, Jonathon Duerig, Eric Eide, Leigh Stoller, Mike Hibler, David Johnson, Kirk Webb, Aditya Akella, Kuangching Wang, Glenn Ricart, Larry Landweber, Chip Elliott, Michael Zink, Emmanuel Cecchet, Snigdhaswin Kar, and Prabodh Mishra. 2019. The Design and Operation of CloudLab. In Proceedings of the USENIX Annual Technical Conference (ATC). 1--14. https://www.flux.utah.edu/paper/duplyakin-atc19
[13]
Cynthia Dwork, Nancy Lynch, and Larry Stockmeyer. 1988. Consensus in the presence of partial synchrony. J. ACM 35, 2 (1988), 288--323. https://doi.org/10.1145/42282.42283
[14]
M.J. Fischer, N. A. Lynch, and M. S. Paterson. 1985. Impossibility of distributed consensus with one faulty processor. J.ACM 32, 2 (1985), 374--382.
[15]
Jr. Fritzke, U., P. Ingels, A. Mostefaoui, and M. Raynal. 1998. Fault-Tolerant Total Order Multicast to Asynchronous Groups. In Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems. 228--234.
[16]
H. Garcia-Molina and A. Spauster. 1989. Message ordering in a multicast environment. In [1989] Proceedings. The 9th International Conference on Distributed Computing Systems. 354--361. https://doi.org/10.1109/ICDCS.1989.37965
[17]
Alexey Gotsman, Anatole Lefort, and Gregory Chockler. 2019. White-Box Atomic Multicast. In 2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). IEEE, 176--187.
[18]
R. Guerraoui and A. Schiper. 2001. Genuine atomic multicast in asynchronous distributed systems. Theor. Comput. Sci. 254, 1-2 (2001), 297--316.
[19]
Vassos Hadzilacos and Sam Toueg. 1994. A Modular Approach to Fault-Tolerant Broadcasts and Related Problems. Technical Report. USA.
[20]
Fabian Kuhn and Rogert Wattenhofer. 2004. Dynamic Analysis of the Arrow Distributed Protocol. In Proceedings of the Sixteenth Annual ACM Symposium on Parallelism in Algorithms and Architectures (Barcelona, Spain) (SPAA '04). Association for Computing Machinery, New York, NY, USA, 294--301. https://doi.org/10.1145/1007912.1007962
[21]
L. Lamport. 1998. The part-time parliament. ACM Transactions on Computer Systems 16, 2 (May 1998), 133--169.
[22]
Long Hoang Le, Mojtaba Eslahi-Kelorazi, Paulo R. Coelho, and Fernando Pedone. 2021. RamCast: RDMA-based atomic multicast. Proceedings of the 22nd International Middleware Conference (2021).
[23]
L. Rodrigues, R. Guerraoui, and A. Schiper. 1998. Scalable atomic multicast. In International Conference on Computer Communications and Networks. 840--847.
[24]
Nicolas Schiper and Fernando Pedone. 2008. On the inherent cost of atomic broadcast and multicast in wide area networks. In International conference on Distributed computing and networking (ICDCN). 147--157.
[25]
Nicolas Schiper and Fernando Pedone. 2008. Solving Atomic Multicast When Groups Crash. In International Conference On Principles Of Distributed Systems (OPODIS). Springer, 481--495.
[26]
F. B. Schneider. 1990. Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial. Comput. Surveys 22, 4 (1990), 299--319.
[27]
Daniele Sciascia, Fernando Pedone, and Flavio Junqueira. 2012. Scalable deferred update replication. In Dependable Systems and Networks (DSN), 2012 42nd Annual IEEE/IFIP International Conference on. IEEE, 1--12.
[28]
Alexander Thomson, Thaddeus Diamond, Shu-Chun Weng, Kun Ren, Philip Shao, and Daniel J Abadi. 2012. Calvin: fast distributed transactions for partitioned database systems. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 1--12.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
Middleware '23: Proceedings of the 24th International Middleware Conference
November 2023
334 pages
ISBN:9798400701771
DOI:10.1145/3590140
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IFIP: International Federation for Information Processing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 November 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Atomic multicast
  2. Consensus
  3. Fault tolerance

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

Middleware '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 203 of 948 submissions, 21%

Upcoming Conference

MIDDLEWARE '25
26th International Middleware Conference
December 15 - 19, 2025
Nashville , TN , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 67
    Total Downloads
  • Downloads (Last 12 months)34
  • Downloads (Last 6 weeks)4
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media