SeWDReSS: on the design of an application independent, secure, wide-area disaster recovery storage system

Zeng, Lingfang; Veeravalli, Bharadwaj; Wei, Qingsong; Feng, Dan

doi:10.1007/s11042-011-0734-0

SeWDReSS: on the design of an application independent, secure, wide-area disaster recovery storage system

Published: 23 February 2011

Volume 58, pages 543–568, (2012)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Lingfang Zeng¹,
Bharadwaj Veeravalli¹,
Qingsong Wei² &
…
Dan Feng³

212 Accesses
2 Citations
Explore all metrics

Abstract

Distributed wide-area storage systems must tolerate both physical failure and logic errors. In particular, these functions are needed to enable the storage system to support remote disaster recovery. There are several solutions for distributed wide-area backup/archive systems implemented at application level, file system level or at storage subsystem level. However, they suffer from high deployment cost and security issues. Moreover, previous researches in literature only focus on any disk-related failures and ignore the fact that storage server linked predominantly to a Wide-Area-Network (WAN) which may be unavailable or owing to network failures. In this paper, we first model the efficiency and reliability of distributed wide area storage systems for all media, taking both network failures and disk failures into consideration. To provide higher performance, efficiency, reliability, and security to the wide-area disaster recovery storage systems, we present a configurable RAID-like data erasure-coding scheme referred to as Replication-based Snapshot Redundant Array of Independent Imagefiles (RSRAII). We argue that this scheme has benefits resulting from the consolidation of both erasure-coding and replication strategies. To this end, we propose a novel algorithm to improve the snapshot performance referred to as SMPDP (Snapshot based on Multi-Parallel Degree Pipeline). We also extend this study towards implementing a prototype system, called as SeWDReSS, which is shown to strike a tradeoff between reliability, storage space, security, and performance for distributed wide-area disaster recovery.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

CRL: Efficient Concurrent Regeneration Codes with Local Reconstruction in Geo-Distributed Storage Systems

Article 19 November 2018

H-V: An Improved Coding Layout Based on Erasure Coded Storage System

Novel Data Placement Algorithm for Distributed Storage System Based on Fault-Tolerant Domain

Article 26 November 2020

References

Anastasiadis MSS, Sevcik K (2002) Maximizing throughput in replicated disk striping of variable bit-rate streams. Proc. of the annual USENIX technical conference, pp 191–204
Apple Computer, Inc. (2009) Apple .mac service. http://www.apple.com/mobileme/. Last accessed September 2009
Amazon.com, Inc. (2009) Amazon simple storage service. http://aws.amazon.com/s3. Last accessed September 2009
Bell WH, Cameron DG, Carvajal-Schiaffino R, Millar AP, Stockinger K, Zini F (2003) Evaluation of an economy-based file replication strategy for a data grid. In: Proc. of the 3rd international symposium on cluster computing and the grid, pp 661–668
Bhagwat D, Pollack K, Long D, Schwarz T, Miller EL, Paris J-F (2006) Providing high reliability in a minimum redundancy archival storage system. In: Proc. of MASCOTS’06, pp 413–421
Blake C, Rodrigues R (2003) High availability, scalable storage, dynamic peer networks: pick two. In: Proc. of HotOS’03, pp 1–6
Bloemer J, Kalfane M, Karpinski M, Karp RM, Luby M, Zuckerman D (1995) An XOR-based erasure-resilient coding scheme. Technical Report TR-95-048. Berkeley, CA
Chen Y, Edler J, Goldberg A, Gottlieb A, Sobti S, Yianilos P (1996) Prototype implementation of archival intermemory. In: Proc. of ICDE’96, pp 485–495
Chou C, Golubchik L, Lui JCS (2000) Striping doesn’t scale: how to achieve scalability for continuous media servers with replication. In: Proc. of ICDCS’00, pp 64–71
Chun B, Dabek F, Haeberlen A, Sit E, Weatherspoon H, Kaashoek MF, Kubiatowicz J, Morris R (2006) Efficient replica maintenance for distributed storage systems. In: Proc. of NSDI’06
Douglis F, Iyengar A (2003) Application-specific delta encoding via resemblance detection. In: Proc. of the 2003 USENIX annual technical conference, pp 113–126
EMC Corporation (2002) EMC centera: content addressed storage system. Data Sheet
Gerasimov I, Zhuravlev A, Pershin M, Gerasimov DV (2003) Design and implementation of a block storage multi-protocol converter. In: Proc. of MSST’03
Haeberlen A, Mislove A, Druschel P (2005) Glacier: highly durable, decentralized storage despite massive correlated failures. In: Proc. of NSDI’05
Hartman J, Ousterhout J (1993) The Zebra striped network file system. In: Proc. of ACM SOSP’93, pp 29–43
Hennessy J, Patterson D (2006) Computer architecture: a quantitative approach, 4th edn. Morgan Kaufmann
Iometer benchmark (2006) http://sourceforge.net/projects/Iometer/. Accessed 27 July 2006.
Katcher J (1997) PostMark: a new file system benchmark. Network Appliance, Technical Report 3022
Kubiatowicz J, Bindel D, Chen Y et al (2000) Oceanstore: an architecture for global-scale persistent storage. In: Proc. of ASPLOS’00
Laure E (2003) The architecture of the European datagrid. The European DataGrid Project Team, Technical Report
Li J, Krohn M, Mazi‘eres D, Shasha D (2004) Secure untrusted data repository (sundr). In: Proc. of OSDI’04, pp 121–136
Mani Chandy K, Lamport L (1985) Distributed snapshots: determining global states of distributed systems. ACM Trans Comput Syst 3(1):63–75
Article Google Scholar
Maniatis P, Roussopoulos M, Giuli TJ, Rosenthal DSH, Baker M (2005) The LOCKSS Peer-to-Peer digital preservation system. ACM Trans Comput Syst 23(1):2–50
Article Google Scholar
Mauelshagen H (2004) Linux cluster logical volume manager. In: Proc. of the 11th Linux Kongress, Erlangen, Germany
Moore RW, Terekhov I, Chervenak A, Studham S, Watson C, Stockinger H (2002) Data grid implementations. Global Grid Forum. Technical report. http://www.ppdg.net/docs/WhitePapers/Capabilities-grids.v6.pdf
Muthitacharoen A, Chen B, Mazieres D (2001) A lowbandwidth network file system. In: Proc. of SOSP’01
National Security Agency (2006) Global information grid (GIG). http://www.nsa.gov/ia/industry/gig.cfm. Last accessed September 2006
Patterson D, Gibson G, Katz R (1988) The case for RAID: redundant arrays of inexpensive disks. In: Proc. of ACM SIGMOD conf., pp 106–113
Plank JS (2003) A tutorial on Reed–Solomon coding for fault tolerance in RAID-like systems. Technical Report UT-CS-03-504
Ranganathan K, Foster I (2001) Identifying dynamic replication strategies for a high performance data grid. In: Proc. of the international grid computing workshop
Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proc. of ACM SIGCOMM’01. San Diego, CA
Rhea S, Eaton P, Geels D, Weatherspoon H, Zhao B, Kubiatowicz J (2003) Pond: the oceanstore prototype. In: Proc. of FAST’03
Rowstron A, Druschel P (2001) Pastry: scalable, distributed object location and routing for large-scale Peer-to-Peer systems. In: Proc. of IFIP/ACM international conference on distributed systems platforms (Middleware’01)
Schnor B, Petri S, Oleyniczak R, Langendorfer H (1996) Scheduling of parallel applications on heterogeneous workstation clusters. In: Proc. of PDCS’96, pp 330–337
Schwarz TSJ, Miller EL (2006) Store, forget, and check: using algebraic signatures to check remotely administered storage. In: Proc. of ICDCS’06
Shenoy P, Vin HM (1999) Efficient striping techniques for variable bit rate continuous media file servers. Perform Eval 38(3–4):175–199
Article Google Scholar
Storer MW, Greenan K, Miller EL (2006) Long-term threats to secure archives. In: Proc. of StorageSS’06
Storer MW, Greenan K, Miller EL, Voruganti K (2007) POTSHARDS: secure long-term storage without encryption. In: Proc. of the 2007 USENIX technical conference
Trivedi KS (2002) Probability & statistics with reliability, queuing and computer science applications, 2nd edn. John Wiley & Sons
UNH iSCSI reference implementation. http://www.iol.unh.edu/consortiums/iscsi. Last accessed September 2006
Varma A, Jacobson Q (1998) Destage algorithms for disk arrays with nonvolatile caches. IEEE Trans Comput 47(2):228–235
Article Google Scholar
Wang RY, Krishnamurthy A, Martin RP et al (1998) Modeling communication pipeline latency. In: Proc. of the ACM SIGMETRICS joint international conference on measurement and modeling of computer systems, pp 22–32, NY, USA
Weatherspoon H (2006) Design and evaluation of distributed wide-area on-line archival storage systems. Technical Report No. UCB/EECS-2006-130
Weatherspoon H, Kubiatowicz JD (2002) Erasure coding vs. replication: a quantitative comparison. In: Proc. of IPTPS’02
Weatherspoon H, Eaton P, Chun B, Kubiatowicz J (2007) Antiquity: exploiting a secure log for wide-area distributed storage. In: Proc. of Eurosys’07
Xiao W, Liu Y, Yang Q, Ren J, Xie C (2006) Implementation and performance evaluation of two snapshot methods on iSCSI target storages. In: Proc. of MSST’06)
Xin Q (2007) Understanding and coping with failures in large-scale storage systems. Technical Report UCSC-SSRC-07-06
You LL, Pollack KT, Long DDE (2005) Deep store: an archival storage system architecture. In: Proc. of ICDE’05
Zhao B, Kubiatowicz J, Joseph A (2001) Tapestry: an infrastructure for fault-tolerant widearea location and routing. Technical Report UCB/CSD-01-1141, UC Berkeley

Download references

Acknowledgements

The authors would like to thank the referees for their comments. This work is supported partly by A*STAR, Singapore, under grant R − 263 − 000 − 345 − 305, and the National Basic Research Program of China (973 Program) No. 2011CB302301.

Author information

Authors and Affiliations

Computer Networks and Distributed Systems Laboratory, Department of Electrical and Computer Engineering, The National University of Singapore, Singapore, 117576, Singapore
Lingfang Zeng & Bharadwaj Veeravalli
Data Center Technologies Division, Data Storage Institute, Agency for Science, Technology and Research (A*STAR), Singapore, 117608, Singapore
Qingsong Wei
Key Lab of Information Storage, Ministry of Education, Huazhong University of Science and Technology, Hubei, 430074, People’s Republic of China
Dan Feng

Authors

Lingfang Zeng
View author publications
You can also search for this author inPubMed Google Scholar
Bharadwaj Veeravalli
View author publications
You can also search for this author inPubMed Google Scholar
Qingsong Wei
View author publications
You can also search for this author inPubMed Google Scholar
Dan Feng
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Bharadwaj Veeravalli.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zeng, L., Veeravalli, B., Wei, Q. et al. SeWDReSS: on the design of an application independent, secure, wide-area disaster recovery storage system. Multimed Tools Appl 58, 543–568 (2012). https://doi.org/10.1007/s11042-011-0734-0

Download citation

Published: 23 February 2011
Issue Date: June 2012
DOI: https://doi.org/10.1007/s11042-011-0734-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SeWDReSS: on the design of an application independent, secure, wide-area disaster recovery storage system

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

CRL: Efficient Concurrent Regeneration Codes with Local Reconstruction in Geo-Distributed Storage Systems

H-V: An Improved Coding Layout Based on Erasure Coded Storage System

Novel Data Placement Algorithm for Distributed Storage System Based on Fault-Tolerant Domain

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now