Abstract
To maintain consistency on a distributed database system, a commit protocol is often employed to ensure that either all the sub-transactions of a transaction commit or all abort. Special care needs to be taken to ensure the correct functioning of a commit protocol in the face of node failures during the execution of the protocol. A protocol is said to be fault-tolerant if it properly commits or aborts, even if nodes fail.
In this paper, we present a fault tolerant decentralized commit protocol which is message optimal in the absence of failures. Furthermore, the protocol (FTDCP) ensures that all sites can achieve a consistent final decision by at most 4 extra messages in the single site failure situation.
Similar content being viewed by others
References
J. N. Gray,Notes on database operating systems, in Operating Systems: An Advanted Course, Berlin: Springer-Verlag, 1979.
M. Hammer and D. Shipman,Reliability mechanisms for SDD-1:A system for distributed databases, Tech. Rep., Computer Corporation of America, Cambridge, Mass., July 1979.
C. Mohan and B. Lindsay,Efficient commit protocols for the tree of processes model of distributed transactions, in Proc. of 2nd ACM SIGACT/SIGOPS Symposium on Principles of Distributed Computing, (Montreal, Canada), pp. 76–80, Aug. 1983.
T. V. Lakshman and A. K. Agrawala,Communication structure of decentralized commit protocols, in Proc. of 6th Int. Conf. on Distributed Computing Systems, (Boston), IEEE, May 1986.
S. Yuan and A. K. Agrawala,A class of optimal decentralized commit protocols, in Proc. of 8th Int. Conf. on Distributed Computing Systems, (San Jose, CA), pp. 234–241, IEEE, June 1988.
D. Skeen,Nonblocking commit protocols, in ACM SIGMOD Int. Conf. on Management of Data, pp. 133–142, ACM, 1981.
D. Skeen and M. Stonebraker,A formal model of crash recovery in a distributed system, IEEE Trans. Software Eng., vol. 9, pp. 219–228, May 1983.
S. Yuan and P. Jalote,Fault tolerant commit protocols, in Proc. of 5th int. Data Eng. Conf., (Los Angeles, CA), pp. 280–286, IEEE, Feb. 1989.
M. J. Fisher, N. A. Lynch, and M. S. Paterson,Impossibility of distributed consensus with one faulty process, J. ACM, vol. 32, pp. 374–382, April 1985.
R. D. Schlichting and F. B. Schneider,Fail-stop processor: An approach to designing fault-tolerant computing systems, ACM Trans. on Computer Systems, vol. 1, pp. 222–238, Aug. 1983.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Yuan, SM. An efficient fault-tolerant decentralized commit protocol for single site failure. BIT 31, 53–68 (1991). https://doi.org/10.1007/BF01952783
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF01952783