Abstract
In this paper, we introduce InterCloud RAIDer, which realizes a multi-cloud private data backup system by composing (i) a data deduplication technique to reduce the overall storage overhead, (ii) erasure coding to achieve redundancy at low overhead, which is dispersed across multiple cloud services to realize fault-tolerance against individual service providers, specifically we use non-systematic instances of erasure codes to provide a basic level of privacy from individual cloud stores, and finally, (iii) a proof of data possession mechanism to detect misbehaving services - where we optimize the implementation by exploiting hash digests that are created in the prior deduplication phase. Apart from the uniqueness and non-triviality of putting these modules together, the system design also had to deal with artefacts and heterogeneity across different cloud storage services we used, namely Dropbox, Google drive and SkyDrive.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Abu-Libdeh, H., Princehouse, L., Weatherspoon, H.: RACS: A Case for Cloud Storage Diversity. In: SOCC (2010)
Bessani, A., Correia, M., Quaresma, B., Andre, F., Sousa, P.: DEPSKY: Dependable and Secure Storage in a Cloud-of-Clouds. In: EuroSys 2011 (2011)
Sharma, R., Datta, A., Dell’Amico, M.: An empirical study of availability in friend-to-friend storage systems. In: P2P (2011)
Hewlett-Packard, “Understanding the HP Data Deduplication Strategy: Why One Size Doesn’t Fit Everyone” (2008), http://networkworld.com/documents/whitepaper/HPDataDeduper.pdf
Muthitacharoen, A., Chen, B., Mazieres, D.: A Low-bandwidth Network File System. In: SOSP (2001)
Eshghi, K., Tang, H.K.: A Framework for Analyzing and Improving Content-Based Chunking Algorithms. HP Labs Tech. Rep. HPL-2005-30(R.1) (2005)
Manber, U.: Finding Similar Files in a Large File System. USENIX ATC (2004)
Forman, G., Eshghi, K., Chiocchetti, S.: Finding Similar Files in Large Document Repositories. In: KDD (2005)
Quilan, S., Dorward, S.: Venti: A New Approach to Archival Storage. In: FAST (2002)
Zhu, B., Li, K., Patterson, H.: Avoiding the Disk Bottleneck in the Data Domain Deduplication File System. In: FAST (2008)
Lilibridge, M., Eshghi, K., Bhagwat, D., Deolaikar, V., Trezise, G., Campbell, P.: Sparse Indexing: Large Scale, Inline Deduplication Using Sampling and Locality. In: FAST (2009)
Rivest, R.: The MD5 Message-Digest Algorithm. IETF, Request For Comments (RFC) 1321 (1992), http://tools.ietf.org/html/rfc1321
National Institute of Standards and Technology, “Secure Hash Standard”, FIPS 180-1 (1995), http://www.itl.nist.gov/fipspubs/fip180-1.htm
National Institute of Standards and Technology, “Secure Hash Standard”, FIPS 180-4 (2012), http://csrc.nist.gov/publications/fips/fips180-4/fips-180-4.pdf
Bellare, M., Canetti, R., Krawczyk, H.: Keying Hash Functions for Message Authentication. In: Koblitz, N. (ed.) CRYPTO 1996. LNCS, vol. 1109, pp. 1–15. Springer, Heidelberg (1996)
Ateniese, G., Pietro, R.D., Mancini, L.V., Tsudik, G.: Scalable and Efficient Provable Data Possession. In: SecureComm (2008)
Ateniese, G., Burns, R., Curtmola, R., Herring, J., Kissner, L., Song, D.: Provable Data Possession at Untrusted Stores. In: CCS (2007)
Juels, A., Kaliski, B.: PORs: Proofs of Retrievability for Large Files. In: CCS 2007 (2007)
Oggier, F., Datta, A.: Self-repairing Homomorphic Codes for Distributed Storage Systems. In: Infocom 2011 (2011)
http://techcrunch.com/2011/06/20/dropbox-security-bug-made-passwords-optional-for-four-hours/
http://blogs.computerworld.com/carbonite_loses_7500_customers_files
http://gigaom.com/2009/10/10/when-cloud-fails-t-mobile-microsoft-lose-sidekick-customer-data/
http://www.pcworld.com/article/226128/Sony_Makes_it_Official_PlayStation_Network_Hacked.html
http://news.cnet.com/8301-1009_3-57448465-83/linkedin-confirms-passwords-were-compromised/
http://broadcast.oreilly.com/2011/04/the-aws-outage-the-clouds-shining-moment.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ling, C.W., Datta, A. (2014). InterCloud RAIDer: A Do-It-Yourself Multi-cloud Private Data Backup System. In: Chatterjee, M., Cao, Jn., Kothapalli, K., Rajsbaum, S. (eds) Distributed Computing and Networking. ICDCN 2014. Lecture Notes in Computer Science, vol 8314. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45249-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-45249-9_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45248-2
Online ISBN: 978-3-642-45249-9
eBook Packages: Computer ScienceComputer Science (R0)