Communication-based prevention of useless checkpoints in distributed computations

Hélary, J.-M.; Mostefaoui, A.; Netzer, R.H.B.; Raynal, M.

doi:10.1007/s004460050003

Communication-based prevention of useless checkpoints in distributed computations

Original articles
Published: January 2000

Volume 13, pages 29–43, (2000)
Cite this article

Distributed Computing Aims and scope Submit manuscript

J.-M. Hélary¹,
A. Mostefaoui¹,
R.H.B. Netzer² &
…
M. Raynal¹

97 Accesses
69 Citations
3 Altmetric
Explore all metrics

Summary.

A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. This paper addresses the following problem. Given a set of processes that take (basic) local checkpoints in an independent and unknown way, the problem is to design communication-induced checkpointing protocols that direct processes to take additional local (forced) checkpoints to ensure no local checkpoint is useless.

The paper first proves two properties related to integer timestamps which are associated with each local checkpoint. The first property is a necessary and sufficient condition that these timestamps must satisfy for no checkpoint to be useless. The second property provides an easy timestamp-based determination of consistent global checkpoints. Then, a general communication-induced checkpointing protocol is proposed. This protocol, derived from the two previous properties, actually defines a family of timestamp-based communication-induced checkpointing protocols. It is shown that several existing checkpointing protocols for the same problem are particular instances of the general protocol. The design of this general protocol is motivated by the use of communication-induced checkpointing protocols in “consistent global checkpoint”-based distributed applications such as the detection of stable or unstable properties and the determination of distributed breakpoints.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Monitoring Distributed Component-Based Systems

Optimal Design of Checkpoint Systems with General Structures, Tasks and Schemes

Compact Privacy Protocols from Post-quantum and Timed Classical Assumptions

Author information

Authors and Affiliations

IRISA, Université de Rennes, Campus de Beaulieu, F-35042 Rennes Cedex, France (e-mail: {helary,mostefaoui,raynal}@irisa.fr), , , , , , FR
J.-M. Hélary, A. Mostefaoui & M. Raynal
Computer Science Department, Brown University, Box 1910, Providence, RI 02921, USA (e-mail: rn@cs.brown.edu), , , , , , US
R.H.B. Netzer

Authors

J.-M. Hélary
View author publications
Search author on:PubMed Google Scholar
A. Mostefaoui
View author publications
Search author on:PubMed Google Scholar
R.H.B. Netzer
View author publications
Search author on:PubMed Google Scholar
M. Raynal
View author publications
Search author on:PubMed Google Scholar

Additional information

Received: July 1997 / Accepted: August 1999

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hélary, JM., Mostefaoui, A., Netzer, R. et al. Communication-based prevention of useless checkpoints in distributed computations. Distrib Comput 13, 29–43 (2000). https://doi.org/10.1007/s004460050003

Download citation

Issue Date: January 2000
DOI: https://doi.org/10.1007/s004460050003

Key words:Asynchronous distributed system – Checkpointing protocols – Fault-Tolerance

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Communication-based prevention of useless checkpoints in distributed computations

Summary.

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Monitoring Distributed Component-Based Systems

Optimal Design of Checkpoint Systems with General Structures, Tasks and Schemes

Compact Privacy Protocols from Post-quantum and Timed Classical Assumptions

Explore related subjects

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now