Skip to main content
Log in

Optimizing storage performance in public cloud platforms

  • Published:
Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Abstract

Cloud computing is an elastic computing model where users can lease computing and storage resources on demand from a remote infrastructure. It is gaining popularity due to its low cost, high reliability, and wide availability. With the emergence of public cloud storage platforms like Amazon, Microsoft, and Google, individual applications and enterprise storage are being deployed on Clouds. However, a serious impediment to its wider deployment is the relative lack of effective data management services. Our experiments, as well as industry reports, have shown that the performance and service-level agreement (SLA) cannot be guaranteed when the data is served over public Clouds. The relatively slow access to persistent data and large variability in cloud storage I/O performance can significantly degrade the performance of data-intensive applications. This paper addresses the issue of I/O performance fluctuation over public cloud platforms and we propose a middleware called CloudMW between the Cloud storage and clients to provide the storage services with better performance and SLA satisfaction. Some technologies, including data virtualization, data chunking, caching, and replication, are integrated into CloudMW to achieve a more stable and predictable performance, and permit flexible sharing of storage among the virtual machines (VMs). Experimental results based on Amazon Web Services (AWS) show that CloudMW is able to improve the stability and help provide better SLAs and data sharing for cloud storage.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Amazon AWS, 2010. Amazon Web Services. Available from http://aws.amazon.com [Accessed on Sept. 21, 2010].

  • Amazon EBS, 2010. Amazon Elastic Block Store. Available from http://aws.amazon.com/ebs/ [Accessed on Sept. 21, 2010].

  • Amazon EC2, 2010. Amazon Elastic Compute Cloud. Available from http://aws.amazon.com/ec2 [Accessed on Sept. 21, 2010].

  • Amazon S3, 2010. Amazon Simple Storage Service. Available from http://aws.amazon.com/s3/ [Accessed on Sept. 21, 2010].

  • Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R.H., Konwinski, A., Lee, G., Patterson, D.A., Rabkin, A., Stoica, I., et al., 2009. Above the Clouds: a Berkeley View of Cloud Computing. Technical Report, No. UCB/EECS-2009-28, University of California, Berkeley, CA.

    Google Scholar 

  • Brantner, M., Florescu, D., Graf, D., Kossmann, D., Kraska, T., 2008. Building a Database on S3. Proc. ACM SIGMOD Int. Conf. on Management of Data, p.251–264. [doi:10.1145/1376616.1376645]

  • DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., Vogels, W., 2007. Dynamo: Amazon’s highly available key-value store. ACM SIGOPS Oper. Syst. Rev., 41(6):205–220. [doi:10.1145/1323293.1294281]

    Article  Google Scholar 

  • FUSE, 2010. Filesystem in Userspace. Available from http://fuse.sourceforge.net [Accessed on Aug. 21, 2010].

  • Gulati, A., Merchant, A., Varman, P., 2007. pClock: an Arrival Curve Based Approach for QoS in Shared Storage Systems. Proc. ACM SIGMETRICS Int. Conf. on Measurement and Modeling of Computer Systems, p.13–24. [doi:10.1145/1254882.1254885]

  • Gulati, A., Merchant, A., Varman, P., 2010. mClock: Handling Throughput Variability for Hypervisor IO Scheduling. 9th USENIX Symp. on Operating Systems Design and Implementation, p.1–7.

  • International Data Corporation, 2010. Citing Statistics Information. Available from http://www.idc.com/ [Accessed on Sept. 23, 2010].

  • Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Weimer, W., et al., 2000. OceanStore: an Architecture for Global-Scale Persistent Storage Platforms. Proc. 9th Int. Conf. on Architectural Support for Programming Languages and Operating Systems, p.190–201. [doi:10.1145/378993.379239]

  • Lim, H.C., Babu, S., Chase, J.S., 2010. Automated Control for Elastic Storage. Proc. 7th Int. Conf. on Autonomic Computing, p.1–10. [doi:10.1145/1809049.1809051]

  • Mahajan, P., Setty, S., Lee, S., Clement, A., Alvisi, L., Dahlin, M., Walfish, M., 2010. Depot: Cloud Storage with Minimal Trust. 9th USENIX Symp. on Operating Systems Design and Implementation, p.1–12.

  • Palankar, M.R., Iamnitchi, A., Ripeanu, M., Garfinkel, S., 2008. Amazon S3 for Science Grids: a Viable Solution? Proc. Int. Workshop on Data-Aware Distributed Computing, p.1–9. [doi:10.1145/1383519.1383526]

  • Postmark, 1997. Postmark: a New File System Benchmark. Available from http://packages.debian.org/stable/utils/postmark [Accessed on Aug. 21, 2010].

  • VMware ESX, 2010. VMware ESX and ESXi, Bare-Metal Hypervisor for Virtual Machines. Available from http://www.vmware.com/products/esx/ [Accessed on Aug. 21, 2010].

  • Walker, E., 2008. Benchmarking Amazon EC2 for highperformance scientic computing. USENIX Log. Mag., 33(5):18–23.

    Google Scholar 

  • Wang, G.H., Ng, T.S.E., 2010. The Impact of Virtulalization on Network Performance of Amazon EC2 Data Center. INFOCOM, p.1–9. [doi:10.1109/INFCOM.2010.5461931]

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Chang-sheng Xie.

Additional information

Project supported by the National Basic Research Program (973) of China (No. 2011CB302303), the National High-Tech R&D Program (863) of China (No. 2009AA01A402), the National Natural Science Foundation of China (No. 60933002), the Chenguang Plan of Wuhan, China (No. 201050231073), the Innovation Plan of WNLO, and the National Science Foundation of USA (Nos. CNS-0917157, CNS-0615376, and CNS-0541369)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Jz., Varman, P. & Xie, Cs. Optimizing storage performance in public cloud platforms. J. Zhejiang Univ. - Sci. C 12, 951–964 (2011). https://doi.org/10.1631/jzus.C1100097

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.C1100097

Key words

CLC number

Navigation