Abstract
Cloud Storage service providers such as Amazon Simple Storage Service (S3) and Google Storage for Developers offer low-cost and highly available scale storage resource with a simple pay-as-you-go charging model. The cost of running storage systems on such a Cloud Storage service mainly depends on occupied storage space, number of requests and amount of data transfer. Traditional design of storage stack based on disk driver or tape didn’t consider cost as a system metric, hence it brings considerable optimization space for the design of storage system based on Cloud Storage.
In this paper we propose Saga, a user mode file system based on Cloud Storage service, that is designed to support POSIX interface with the goal of minimizing cost. Saga is specially designed under the cost efficient principle that minimizes occupied storage space by store-one-copy and copy-on-write strategies and minimizes number of requests by distinguishing objects loaded by write or read requests. Saga is also efficient from a performance perspective and utilizes parallel characteristics of Cloud Storage to boost the performance. Experimental results show that Saga is cost efficient and works well with general-purpose I/O workloads.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fuse: Filesystem in userspace, http://fuse.sourceforge.net/
Brantner, M., Florescu, D., Graf, D., Kossmann, D., Kraska, T.: Building a database on s3. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, SIGMOD 2008, pp. 251–264. ACM, New York (2008)
Chiu, D., Shetty, A., Agrawal, G.: Elastic cloud caches for accelerating service-oriented computations. In: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2010, pp. 1–11. IEEE Computer Society, Washington, DC, USA (2010)
Kulkarni, P., Douglis, F., LaVoie, J., Tracey, J.M.: Redundancy elimination within large collections of files. In: Proceedings of the Annual Conference on USENIX Annual Technical Conference, ATEC 2004. USENIX Association, Berkeley (2004)
Palankar, M.R., Iamnitchi, A., Ripeanu, M., Garfinkel, S.: Amazon s3 for science grids: a viable solution? In: Proceedings of the 2008 International Workshop on Data-Aware Distributed Computing, DADC 2008, pp. 55–64. ACM, New York (2008)
Park, S.Y., Jung, D., Kang, J.u., Kim, J.s., Lee, J.: Cflru: a replacement algorithm for flash memory. In: Proceedings of the 2006 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, CASES 2006, pp. 234–241. ACM, New York (2006)
Qureshi, M.K., Jaleel, A., Patt, Y.N., Steely, S.C., Emer, J.: Adaptive insertion policies for high performance caching. In: Proceedings of the 34th Annual International Symposium on Computer Architecture, ISCA 2007, pp. 381–391. ACM, New York (2007)
Rosenblum, M., Ousterhout, J.K.: The design and implementation of a log-structured file system. ACM Trans. Comput. Syst. 10, 26–52 (1992)
Vrable, M., Savage, S., Voelker, G.M.: Cumulus: filesystem backup to the cloud. ACM Transactions on Storage (TOS)Â 5(4) (December 2009)
Zhang, X., Li, C., Wang, H., Wang, D.: A cache replacement policy using adaptive insertion and re-reference prediction. In: Proceedings of the 2010 22nd International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2010, pp. 95–102. IEEE Computer Society, Washington, DC, USA (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shi, W., Ju, D., Wang, D. (2012). Saga: A Cost Efficient File System Based on Cloud Storage Service. In: Vanmechelen, K., Altmann, J., Rana, O.F. (eds) Economics of Grids, Clouds, Systems, and Services. GECON 2011. Lecture Notes in Computer Science, vol 7150. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28675-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-28675-9_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28674-2
Online ISBN: 978-3-642-28675-9
eBook Packages: Computer ScienceComputer Science (R0)