Cyclic Strategies for Balanced and Fault-Tolerant Distributed Storage

Marcelín-Jiménez, Ricardo; Rajsbaum, Sergio

doi:10.1007/978-3-540-45214-0_17

Ricardo Marcelín-Jiménez⁷ &
Sergio Rajsbaum⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2847))

Included in the following conference series:

Latin-American Symposium on Dependable Computing

354 Accesses
2 Citations

Abstract

Given a set V of active components in charge of a distributed execution, a storage scheme is a sequence of subsets, B ₁,B ₂,...,B _b, of V where, succesive global states are recorded. The subsets, called blocks, have the same size and are scheduled according to some fixed and cyclic calendar of b steps. During i-th step, block B _i is selected. Next, a global snapshot is taken and each component sends its corresponding local state to one of the appointed places in B _i, in a way that each component stores (approx.) the same number of local states. Afterwards, if a component of B _i crashes, all of the data stored in the block is useless, because the global state can not be reconstructed. In this case, the information recorded in an earlier block can be used to recover a global state, provided there is at least one such block where no component has crashed. The goal is to design storage schema that tolerate as many crashes as possible, while trying to have each component participating in as few blocks as possible and, at the same time, working with large blocks (so that a component in a block stores a small number of local states). In this paper several such schema are described and compared in terms of these measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Azagury, A., Factor, M.E., Satran, J.: Point-in-time copy: Yesterday, today and tomorrow. In: 19th IEEE Symposium on Mass Storage Systems, pp. 259–270. IEEE, Los Alamitos (2002)
Google Scholar
Chandy, M., Lamport, L.: Distributed snapshots: Determining global states in distributed systems. ACM Trans. on Computer Science 3(1), 63–75 (1985)
Article Google Scholar
Babaoglu, O., Marzullo, K.: Consistent global states of distributed systems: Fundamental concepts and mechanisms. In: Mullender, S. (ed.) Distributed Systems, 2nd edn. ch. 4, pp. 55–96. ACM, New York (1993)
Google Scholar
Fraleigh, J.B.: Álgebra Abstracta. Addison-Wesley, Iberoamericana (1987)
Google Scholar
Rhea, S., et al.: Maintenance-free global data storage. IEEE Internet Computing, pp. 40–49 (September/October 2001)
Google Scholar
MacWilliams, F.J., Sloane, N.J.: The Theory of Error-correcting codes, 8th edn. North-Holland, Amsterdam (1993)
Google Scholar
Tonchev, V.: Combinatorial Configurations Designs, Codes, Graphs. Longman Scientific and Technical (1988)
Google Scholar
Stinson, D.R.: An introduction to combinatorial designs. Technical report, University of Waterloo, Dept. of Combinatorics and Optimization (December 1999)
Google Scholar
Colbourn, C.J., Dinitz, J.H., Stinson, D.R.: Applications of combinatorial designs to communications, cryptography, and networking. Technical report, University of Vermont (2000)
Google Scholar
Kotz, D.: Introduction to multiprocessor i/o architecture. In: Jain, R., Werth, J., Browne, J.C. (eds.) Input/Output in Parallel and Distributed Computer Systems. Kluwer Academic Publishers, Dordrecht (1996)
Google Scholar
Mourad, A.N., Fuchs, K.W., Saab, D.G.: Site partitioning for redundant arrays of distributed disks. Journal of Parallel and Distributed Computing (33), 1–11 (1996)
Google Scholar
Berenbrink, P., Brinkmann, A., Scheideler, C.: Design of the presto multimedia storage network. In: International Workshop on Communication and Data Management in Large Networks, pp. 2–12 (1999)
Google Scholar
Russell, A., Shvartsman, A.A.: Distributed computation meets design theory: Local schedulling for disconnected operations. In: Bulletin of the EATCS, June 2002, pp. 120–131 (2002)
Google Scholar
Bhagwan, R., Moore, D., Savage, S., Voelker, G.: Replication strategies for highly available peer-to-peer storage. In: International Workshop on Future Directions in Distributed Computing (2002)
Google Scholar
Rabin, M.O.: Efficient dispersal of information for security, load balancing and fault tolerance. Journal of the ACM 36(2), 335–348 (1989)
Article MATH MathSciNet Google Scholar
Peleg, D., Wool, A.: The availability of quorum systems. Information and Computation 123(2), 210–223 (1995)
Article MATH MathSciNet Google Scholar
Garg, V.K., Ghosh, J.: Repeated computation of global functions in a distributed environment. IEEE Trans. on Parallel and Distributed Systems 5(8), 823–834 (1994)
Article Google Scholar
Lynch, N.: Distributed Algorithms. Morgan Kaufman Pub., San Francisco (1996)
MATH Google Scholar
Deswarte, Y.: Tolérance aux fautes, sécurité et protection. In: Balter, R., et al. (eds.) Construction des Systèmes d’exploitation Répartis. INRIA, Paris, France (1991)
Google Scholar
Kreher, D.L., Stinson, D.R.: Combinatorial Algorithms. CRC Press, Boca Raton (1998)
Google Scholar
Batten, L.M.: Combinatorics of finite geometries. Cambridge University Press, Cambridge (1997)
Book MATH Google Scholar
Dembowski, P.: Finite Geometries. Springer, Heidelberg (1968)
MATH Google Scholar
Cederberg, J.N.: A Course in Modern Geometries. Springer, Heidelberg (1989)
MATH Google Scholar
Bruen, A.A., Silverman, R.: Acs and blocking sets ii. European Journal of Combinatorics 8(4), 351–356 (1987)
MATH MathSciNet Google Scholar
Brouwer, A.E., Schrijver, A.: The blocking number of an affine space. Journal of Combinatorial Theory ser. A 24(2), 251–253 (1978)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

IIMAS, UNAM and EE Dept., UAM-Iztapalapa, Atlixco 186, 09340, México, D.F.
Ricardo Marcelín-Jiménez
Instituto de Matemáticas, UNAM, Ciudad Universitaria, 04510, México, D.F.
Sergio Rajsbaum

Authors

Ricardo Marcelín-Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Rajsbaum
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computing Laboratory, University of Kent,
Rogério de Lemos
Instituto de Informática, Universidade Federal do Rio Grande do Sul, Caixa Postal 15064 – 90501-970, Porto Alegre, RS, Brazil
Taisy Silva Weber
Escola Politécnica Travessa 3, Número, Universidade de São Paulo,, 158, Edifício de Engenharia Eletricidade, 05508-900, São Paulo, SP, Brazil
João Batista Camargo Jr.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Marcelín-Jiménez, R., Rajsbaum, S. (2003). Cyclic Strategies for Balanced and Fault-Tolerant Distributed Storage. In: de Lemos, R., Weber, T.S., Camargo, J.B. (eds) Dependable Computing. LADC 2003. Lecture Notes in Computer Science, vol 2847. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45214-0_17

Download citation

DOI: https://doi.org/10.1007/978-3-540-45214-0_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20224-0
Online ISBN: 978-3-540-45214-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics