Threshold protocols in survivor set systems

Junqueira, Flavio P.; Marzullo, Keith; Herlihy, Maurice; Penso, Lucia Draque

doi:10.1007/s00446-010-0107-3

Threshold protocols in survivor set systems

Published: 09 June 2010

Volume 23, pages 135–149, (2010)
Cite this article

Distributed Computing Aims and scope Submit manuscript

Flavio P. Junqueira¹,
Keith Marzullo²,
Maurice Herlihy³ &
…
Lucia Draque Penso⁴

104 Accesses
Explore all metrics

Abstract

Many replication protocols employ a threshold model when expressing failures they are able to tolerate. In this model, one assumes that no more than t out of n components can fail, which is a good representation when failures are independent and identically distributed (IID). In many real systems, however, failures are not IID, and a straightforward application of threshold protocols yields suboptimal results. Here, we examine the problem of transforming threshold protocols into survivor-set protocols tolerating dependent failures. Our main goal is to show the equivalence between the threshold model and the core/survivor set model. Toward this goal, we develop techniques to transform threshold protocols into survivor set ones. Our techniques do not require authentication, self-verification or encryption. Our results show in one case that we can transform a threshold protocol to a subset by spreading a number of processes across processors. This technique treats a given threshold algorithm as a black box, and consequently can transform any threshold algorithm. However, it has the disadvantage that the transformation is not possible for all sets of survivor sets. The second technique instead focuses on transforming voters: functions that evaluate to a value out of a set of tallied values in a replication protocol. Voters are an essential part of many fault-tolerant protocols, and we show a universal way of transforming them. With such a transformation we expect that a large number of protocols in the literature can be directly transformed with our technique. It is still an open problem, however, if the two models are equivalent, and our results constitute an important first step in this direction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Unique Chain Rule and Its Applications

Byzantine k-Set Agreement

Session Types for Link Failures

References

Attiya H., Welch J.: Distributed computing: fundamentals, simulations, and advanced topics. McGraw-Hill, NY (1998)
Google Scholar
Bazzi R.A., Neiger G.: Simplifying fault-tolerance: providing the abstraction of crash failures. J. ACM 48(3), 499–554 (2001)
Article MathSciNet Google Scholar
Budhiraja, N., Marzullo, K., Schneider, F., Toueg, S.: Optimal primary-backup protocols. In: Proceedings of the 6th International Workshop on Distributed Algorithms (WDAG’97), pp. 362–378 (1992)
Castro M., Liskov B.: Practical byzantine fault-tolerance and proactive recovery. ACM Trans. Comput. Syst. 20, 398–461 (2002)
Article Google Scholar
Castro M., Rodrigues R., Liskov B.: BASE: using abstraction to improve fault tolerance. ACM Trans. Comput. Syst. 21, 236–269 (2003)
Article Google Scholar
Chandra T.D., Toueg S.: Unreliable failure detectors for reliable distributed systems. J. ACM 43(2), 225–267 (1996)
Article MATH MathSciNet Google Scholar
Garcia-Molina H., Barbara D.: How to assign votes in a distributed system. J. ACM 32(4), 841–860 (1985)
Article MATH MathSciNet Google Scholar
Guerraoui, R., Vukolic, M.: Refined quorum systems. In: Proceedings of the 26th ACM Symposium on Principles of Distributed Computing (PODC’07), pp. 119–128. Springer, Berlin (2007)
Herlihy M.: Wait-free synchronization. ACM Trans. Program. Languages Syst. 13(1), 124–149 (1991)
Article Google Scholar
Herlihy M., Shavit N.: The topological structure of asynchronous computability. J. ACM 46(6), 858–923 (1999)
Article MATH MathSciNet Google Scholar
Hirt, M., Maurer, U.: Complete characterization of adversaries tolerable in secure multi-party computation. In: Proceedings of the 16th Annual ACM Symposium on Principles of Distributed Computing (PODC’97), pp. 25–34 (1997)
Junqueira, F.: Coping with dependent failures in distributed systems. Ph.D. Dissertation, UC San Diego, May (2006)
Junqueira F., Marzullo K.: Designing algorithms for dependent process failures. Future Directions Distributed Comput. 2584, 24–28 (2003)
Article Google Scholar
Junqueira, F., Marzullo, K.: Synchronous consensus for dependent process failures. In: Proceedings of the Conference on Distributed Computing Systems (ICDCS’03), pp. 274–283. Springer, Berlin (2003)
Junqueira, F., Marzullo, K.: Replication predicates for dependent-failures algorithms. In: Proceedings of the 11th Euro-Par Conference (Euro-Par’05), pp. 617–632 (2005)
Junqueira F., Marzullo K.: A framework for the design of dependent-failure algorithms. Concurrency Comput.: Pract. Exper. 19(17), 2255–2269 (2007)
Article Google Scholar
Lamport L., Shostak R., Pease M.: The Byzantine generals problem. ACM Trans. Program. Languages Syst. 4(3), 382–401 (1982)
Article MATH Google Scholar
Malkhi, D., Reiter, M.: Byzantine quorum systems. Distributed Computing 11(4), October, June (1998)
Marzullo K.: Tolerating failures of continuous-valued sensors. ACM Trans. Comput. Syst. 8(4), 284–304 (1990)
Article Google Scholar
Mitra S., McCluskey E.J.: Word voter: A new voter design for triple modular redundant systems. VLSI Test Symposium, IEEE 0, 465 (2000)
Google Scholar
Neiger, G., Toueg, S.: Automatically increasing the fault-tolerance of distributed systems. In: PODC ’88: Proceedings of the 7th annual ACM Symposium on Principles of Distributed Computing, pp. 248–262. ACM, New York, NY, USA (1988)
Neumann P.G.: Computer related risks. ACM Press, New York (1995)
Google Scholar
Papadimitriou C., Steiglitz K.: Combinatorial optimization: algorithms and complexity. Dover Publications Inc., Mineola (1998)
MATH Google Scholar
Ross, S.: Introduction to probability models, 7th edn. Academic Press (2000)
Schneider F.: Implementing fault-tolerant services using the state-machine approach: a tutorial. ACM Comput. Surveys 22(4), 299–319 (1990)
Article Google Scholar
von Neumann J.: Probabilistic logics and synthesis of reliable organisms from unreliable components. In: Shannon, C., McCarthy, J. (eds) Automata studies., pp. 43–98. Princeton University Press, Princeton (1956)
Google Scholar
Warns, T., Freiling, F.C., Hasselbring, W.: Solving consensus using structural failure models. In: Proceedings of the 25th IEEE Symposium on Reliable Distributed Systems (SRDS2006), Springer-Verlag, pp. 212–224 (2006)
Zieliński, P.: Automatic verification and discovery of Byzantine consensus protocols. In: The 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN’07), pp. 25–28. IEEE Computer Society (2007)

Download references

Author information

Authors and Affiliations

Yahoo! Research, Barcelona, Spain
Flavio P. Junqueira
UC San Diego, 9500 Gilman Dr., La Jolla, CA, 92093, USA
Keith Marzullo
Brown University, Providence, USA
Maurice Herlihy
TU Ilmenau, Ilmenau, Germany
Lucia Draque Penso

Authors

Flavio P. Junqueira
View author publications
You can also search for this author inPubMed Google Scholar
Keith Marzullo
View author publications
You can also search for this author inPubMed Google Scholar
Maurice Herlihy
View author publications
You can also search for this author inPubMed Google Scholar
Lucia Draque Penso
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Flavio P. Junqueira.

Additional information

Some elements of this paper appear in the paper entitled “Optimizing threshold protocols in adversarial structures” in the Proceedings of the 22nd International Symposium on Distributed Computing (DISC’08).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Junqueira, F.P., Marzullo, K., Herlihy, M. et al. Threshold protocols in survivor set systems. Distrib. Comput. 23, 135–149 (2010). https://doi.org/10.1007/s00446-010-0107-3

Download citation

Received: 10 January 2009
Accepted: 06 April 2010
Published: 09 June 2010
Issue Date: October 2010
DOI: https://doi.org/10.1007/s00446-010-0107-3

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Threshold protocols in survivor set systems

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

The Unique Chain Rule and Its Applications

Byzantine k-Set Agreement

Session Types for Link Failures

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now