ABSTRACT
In this paper we describe the architecture of a quality-of-service (QoS) infrastructure for achieving controlled application performance over NoSQL distributed storage systems. We present an implementation of our architecture as an extension to the Apache Cassandra storage system and provide results from a preliminary evaluation using the Yahoo Cloud Serving Benchmark (YCSB). Along the way we also present details of an ongoing alternative implementation of our QoS infrastructure in the context of the Apache HBase storage system. Our evaluation provides evidence that our QoS infrastructure can achieve the type of controlled performance required by data intensive performance-critical applications.
- Amazon Web Services. DynamoDB. http://aws.amazon.com/dynamodb/, August 2012.Google Scholar
- Apache Software Foundation. HBase. http://hbase.apache.org/, August 2012.Google Scholar
- D. Chambliss et al. Performance virtualization for large-scale storage systems. In Proceedings of the Symposium on Reliable Distributed Systems (SRDS), Florence, Italy, 2003.Google ScholarCross Ref
- F. Chang et al. Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2):1--26, 2008. Google ScholarDigital Library
- B. F. Cooper et al. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM Symposium on Cloud computing (SoCC '10), Indianapolis, IN, June 2010. Google ScholarDigital Library
- J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. Communications of the ACM, 51(1):107--113, 2008. Google ScholarDigital Library
- G. DeCandia et al. Dynamo: Amazon's highly available key-value store. In Proceedings of 21st ACM Symposium on Operating Systems Principles, Stevenson, WA, October 2007. Google ScholarDigital Library
- P. Goyal, D. Jadav, D. S. Modha, and R. Tewari. CacheCOW: QoS for Storage System Caches. In Proceedings of 11th International Workshop on Quality of Service (IWQoS 03), Monterey, CA, June 2003. Google ScholarDigital Library
- A. Lakshman and P. Malik. Cassandra: A decentralized structured storage system. In Proceedings of 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware (LADIS), Big Sky, MT, October 2009.Google Scholar
- K. Magoutis, P. Sarkar, and G. Shah. OASIS: Self-Tuning Storage for Applications. In Proceedings of 23rd IEEE Conference on Mass Storage Systems and Technologies (MSST), College Park, MD, May 2006.Google Scholar
- P. O'Neil, E. Cheng, D. Gawlick, and E. O'Neil. The log-structured merge-tree (lsm-tree). Acta Informatica, 33(4):351--385, 1996. Google ScholarDigital Library
- J. Wilkes. Traveling to Rome: A retrospective on the journey. Operating Systems Review (OSR), 43(1):10--15, January 2009. Google ScholarDigital Library
Index Terms
- Managing service performance in NoSQL distributed storage systems
Recommendations
Cache Support in a High Performance Fault-Tolerant Distributed Storage System for Cloud and Big Data
IPDPSW '15: Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium WorkshopDue to the trends towards Big Data and Cloud Computing, one would like to provide large storage systems that are accessible by many servers. A shared storage can, however, become a performance bottleneck and a single-point of failure. Distributed ...
Comments