Years and Authors of Summarized Original Work
-
1996; Chandra, Toueg
Problem Definition
A distributed system is comprised of a collection of processes. The processes typically seek to achieve some common task by communicating through message passing or shared memory. Most interesting tasks require, at least at certain points of the computation, some form of agreement between the processes. An abstract form of such agreement is consensus where processes need to agree on a single value among a set of proposed values. Solving this seemingly elementary problem is at the heart of reliable distributed computing and, in particular, of distributed database commitment, total ordering of messages, and emulations of many shared object types.
Fischer, Lynch, and Paterson's seminal result in the theory of distributed computing [13] says that consensus cannot be deterministically solved in an asynchronousdistributed system that is prone to process failures. This impossibility holds consequently for...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Aguilera MK, Delporte-Gallet C, Fauconnier H, Toueg S (2003) On implementing omega with weak reliability and synchrony assumptions. In: 22th ACM symposium on principles of distributed computing, pp 306–314
Bertier M, Marin O, Sens P (2003) Performance analysis of a hierarchical failure detector. In: Proceedings 2003 international conference on dependable systems and networks (DSN 2003), San Francisco, 22–25 June 2003, pp 635–644
Boroswsky E, Gafni E (n.d.) Generalized FLP impossibility result for t-resilient asynchronous computations. In: Proceedings of the 25th ACM symposium on theory of computing. ACM, pp 91–100
Chandra TD, Hadzilacos V, Toueg S (1996) The weakest failure detector for solving consensus. J ACM 43(4):685–722
Chandra TD, Toueg S (1996) Unreliable failure detectors for reliable distributed systems. J ACM 43(2):225–267
Chauduri S (1993) More choices allow more faults: set consensus problems in totally asynchronous systems. Inf Comput 105(1):132–158
Chen W, Toueg S, Aguilera MK (2002) On the quality of service of failure detectors. IEEE Trans Comput 51(1):13–32
Delporte-Gallet C, Fauconnier H, Guerraoui R (2002) Failure detection lower bounds on registers and consensus. In: Proceedings of the 16th international symposium on distributed computing, LNCS, vol 2508
Delporte-Gallet C, Fauconnier H, Guerraoui R (2005) Implementing atomic objects in a message passing system. Technical report, EPFL Lausanne
Dwork C, Lynch NA, Stockmeyer L (1988) Consensus in the presence of partial synchrony. J ACM 35(2):288–323
Felber P, Guerraoui R, Fayad M (1999) Putting oo distributed programming to work. Commun ACM 42(11):97–101
Fernández A, Jiménez E, Raynal M (2006) Eventual leader election with weak assumptions on initial knowledge, communication reliability and synchrony. In: Proceedings of the international symposium on dependable systems and networks (DSN), pp 166–178
Fischer MJ, Lynch NA, Paterson MS (1985) Impossibility of distributed consensus with one faulty process. J ACM 32(2):374–382
Guerraoui R (2000) Indulgent algorithms. In: Proceedings of the 19th annual ACM symposium on principles of distributed computing, ACM, Portland, pp 289–297
Herlihy M (1991) Wait-free synchronization. ACM Trans Program Lang Syst 13(1):123–149
Herlihy M, Shavit N (1993) The asynchronous computability theorem for t-resilient tasks. In: Proceedings of the 25th ACM symposium on theory of computing, pp 111–120
Keidar I, Rajsbaum S (2002) On the cost of fault-tolerant consensus when there are no faults-a tutorial. In: Tutorial 21st ACM symposium on principles of distributed computing
Lamport L (1998) The part-time parliament. ACM Trans Comput Syst 16(2):133–169
Lo W-K, Hadzilacos V (1994) Using failure detectors to solve consensus in asynchronous shared memory systems. In: Proceedings of the 8th international workshop on distributed algorithms. LNCS, vol 857, pp 280–295
Lynch N (1996) Distributed algorithms. Morgan Kauffman
Michel R, Corentin T (2006) In search of the holy grail: looking for the weakest failure detector for wait-free set agreement. Technical Report TR 06-1811, INRIA
Saks M, Zaharoglou F (1993) Wait-free k-set agreement is impossible: the topology of public knowledge. In: Proceedings of the 25th ACM symposium on theory of computing, ACM, pp 101–110
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this entry
Cite this entry
Guerraoui, R. (2016). Failure Detectors. In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2864-4_140
Download citation
DOI: https://doi.org/10.1007/978-1-4939-2864-4_140
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-2863-7
Online ISBN: 978-1-4939-2864-4
eBook Packages: Computer ScienceReference Module Computer Science and Engineering