Abstract
As the continuous development of cloud computing and big data, data storage as a service in the cloud is becoming increasingly popular. More and more individuals and organizations begin to store their data in cloud rather than building their own data centers. Cloud storage holds the advantages of high reliability, simple management and cost-effective. However, the privacy and availability of the data stored in cloud is still a challenge. In this paper, we design and implement a High Privacy and Availability Cloud Storage (HPACS) platform built on Apache Hadoop to improve the data privacy and availability. A matrix encryption and decryption module is integrated in HDFS, through which the data can be encoded and reconstructed to/from different storage servers transparently. Experimental results show that HPACS can achieve high privacy and availability but with reasonable write/read performance and storage capacity overhead as compared with the original HDFS.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vaquero, L.M., Rodero-Merino, L., Caceres, J., Lindner, M.: A break in the clouds: towards a cloud definition. ACM SIGCOMM Computer Communication Review 39, 50–55 (2009)
CSC cloud usage index, http://www.csc.com/
Microsoft SkyDrive cloud storage platform, https://skydrive.live.com
Amazon S3 service level agreement, http://aws.amazon.com/s3-sla/
Bialecki, A., Cafarella, M., Cutting, D.: O’MALLEY: Hadoop: a framework for running applications on large clusters built of commodity hardware (2005), Wiki at http://lucene.apache.org/hadoop
Serious cloud failures and disasters of 2011 (2011), http://www.cloudways.com/blog/cloud-failures-disasters-of-2011/
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: 26th IEEE Symposium on Mass Storage Systems and Technologies (MSST), pp. 1–10. IEEE, Florida (2010)
Parakh, A., Kak, S.: Online data storage using implicit security. Information Sciences 179(19), 3323–3331 (2009)
Cavoukian, A.: Privacy in the clouds. Identity in the Information Society 1(1), 89–108 (2008)
Zhang, S., Li, X., Wang, B.: Study on the Protection Method of Data Privacy Based on Cloud Storage. International Journal of Information and Computer Science 1(2) (2012)
Shin, S.H., Kobara, K.: Towards secure cloud storage. Demo for CloudCom2010 (2010)
Wang, C., Wang, Q., Ren, K., et al.: Privacy-preserving public auditing for data storage security in cloud computing. In: 2010 IEEE INFOCOM, pp. 1–9. IEEE, Florida (2010)
Bessani, A., Correia, M., Quaresma, B., et al.: DepSky: dependable and secure storage in a cloud-of-clouds. In: The 6th ACM Conference of Computer Systems (EuroSys 2011), pp. 31–46. ACM, Washington (2011)
Singh, Y., Kandah, F., Zhang, W.: A Secured cost-effective multi-cloud storage in cloud computing. In: 2011 IEEE Computer Communications Workshops, pp. 619–624. IEEE, Florida (2011)
Park, K.W., Kim, C., Park, K.H.: Blast: Applying streaming ciphers into outsourced cloud storage. In: 2010 IEEE Parallel and Distributed Systems (PADS), pp. 431–437. IEEE, Florida (2010)
Sheng, Z., Ma, Z., Gu, L., et al.: A privacy-protecting file system on public cloud storage. In: 2011 IEEE Cloud and Service Computing (CSC), pp. 141–149. IEEE, Florida (2011)
Abu-Libdeh, H., Princehouse, L., Weatherspoon, H.: RACS: a case for cloud storage diversity. In: The 1st ACM Symposium on Cloud Computing (SOCC), pp. 229–240. ACM, Washington (2003)
Bowers, K.D., Juels, A., Oprea, A.: HAIL: a high availability and integrity layer for cloud storage. In: The 16th ACM Conference on Computer and Communications Security (CCS), pp. 187–198. ACM, Washington (2009)
Mu, S., Chen, K., Gao, P., Ye, F., Wu, Y.W., Zheng, W.M.: μLibCloud: providing high available and uniform accessing to multiple cloud storages. In: 2012 13th ACM/IEEE International Conference on Grid Computing (GRID), pp. 201–208 (2012)
Wang, F., Qiu, J., Yang, J., et al.: Hadoop high availability through metadata replication. In: The First ACM International workshop on Cloud Data Management, pp. 37–44. ACM, Washington (2009)
Kadim, I.: Fast, low-overhead encryption for Apache Hadoop, https://hadoop.intel.com/pdfs/IntelEncryptionforHadoopSolutionBrief.pdf
Securing big data – what every organization needs to know. Gazzang’s whitepaper, http://www.gazzang.com/products/zncrypt/apache-hadoop
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
He, Y., Jiang, X., Ye, K., Ma, R., Li, X. (2013). HPACS: A High Privacy and Availability Cloud Storage Platform with Matrix Encryption. In: Wu, C., Cohen, A. (eds) Advanced Parallel Processing Technologies. APPT 2013. Lecture Notes in Computer Science, vol 8299. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-45293-2_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-45293-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-45292-5
Online ISBN: 978-3-642-45293-2
eBook Packages: Computer ScienceComputer Science (R0)