Abstract
Distributed NoSQL datastores have been developed to cater for the usage scenarios of Web 2.0 applications. These systems provide high availability through the replication of data across different machines and data centers. The performance characteristics of NoSQL datastores are determined by the degree of data replication and the consistency guarantees required by the application. This paper presents a novel performance study of the Cassandra NoSQL datastore deployed on the Amazon EC2 cloud platform. We show that a queueing Petri net model can scale to represent the characteristics of read workloads for different replication strategies and cluster sizes. We benchmark one Cassandra node and predict response times and throughput for these configurations. We study the relationship between cluster size and consistency guarantees on cluster performance and identify the effect that node capacity and configuration has on the overall performance of the cluster.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Amazon Web Services - Elastic Cloud Computing, http://aws.amazon.com
Bause, F.: Queueing Petri Nets–A Formalism for the Combined Qualitative and Quantitative Analysis of Systems. In: Fifth Intl Workshop Petri Nets and Performance Models (1993)
Cassandra-stress benchmarking tool, http://www.datastax.com/documentation/cassandra/1.2/cassandra/tools/toolsCStress_t.html
Cattell, R.: Scalable SQL and NoSQL Data Stores. SIGMOD Rec. 39(4), 12–27 (2011)
Cerotti, D., Gribaudo, M., Piazzolla, P., Serazzi, G.: End-to-End Performance of Multi-core Systems in Cloud Environments. In: Balsamo, M.S., Knottenbelt, W.J., Marin, A. (eds.) EPEW 2013. LNCS, vol. 8168, pp. 221–235. Springer, Heidelberg (2013)
Coulden, D., Osman, R., Knottenbelt, W.J.: Performance Modelling of Database Contention using Queueing Petri Nets. In: 4th ACM/SPEC International Conference on Performance Engineeering (2013)
Datastax Documentation Apache Cassandra 1.2 (2014), http://www.datastax.com/docs
Kounev, S., Spinner, S., Meier, P.: QPME 2.0 - A Tool for Stochastic Modeling and Analysis Using Queueing Petri Nets. In: Sachs, K., Petrov, I., Guerrero, P. (eds.) Buchmann Festschrift. LNCS, vol. 6462, pp. 293–311. Springer, Heidelberg (2010)
Kounev, S.: Performance Modeling and Evaluation of Distributed Component-based Systems using Queueing Petri Nets. IEEE Trans. Software Engineering 32(7), 486–502 (2006)
Kounev, S., Buchmann, A.: Performance Modelling of Distributed e-business Applications using Queuing Petri Nets. In: Proc. IEEE International Symposium on Performance Analysis of Systems and Software, pp. 143–155 (2003)
Kounev, S., et al.: Automated Simulation-Based Capacity Planning for Enterprise Data Fabrics. In: Proc. 4th SIMUTOOLS 2011, pp. 27–36 (2011)
Lakshman, A., Malik, P.: Cassandra: a Decentralized Structured Storage System. SIGOPS Operating Systems Review 44(2), 35–40 (2010)
Linden, G.: Make Data Useful (2006), http://www.gduchamp.com/media/StanfordDataMining.2006-11-28.pdf
Linden, G., Mayer, M.: Presented at Web 2.0 (2006), http://glinden.blogspot.com/2006/11/marissa-mayer-at-web-20.htm
Nou, R., Kounev, S., Julia, F., Torres, J.: Autonomic QoS Control in Enterprise Grid Environments using Online Simulation. Journal of Systems and Software 82(3), 486–502 (2009)
Omari, T., Derisavi, S., Franks, G., Woodside, M.: Performance Modeling of a Quorum Pattern in Layered Service Systems. In: QEST 2007 (2007)
Osman, R., Coulden, D., Knottenbelt, W.J.: Performance Modelling of Concurrency Control Schemes for Relational Databases. In: Dudin, A., De Turck, K. (eds.) ASMTA 2013. LNCS, vol. 7984, pp. 337–351. Springer, Heidelberg (2013)
Osman, R., Knottenbelt, W.J.: Database System Performance Evaluation Models: A Survey. Performance Evaluation 69(10), 471–493 (2012)
Schurman, E., Brutlag, J.: Performance Related Changes and their User Impact. Presented at Velocity Web Performance and Operations Conference (2009)
Stonebraker, M., Cattell, R.: 10 Rules for Scalable Performance in Simple Operation Datastores. Commun. ACM 54(6), 72–80 (2011)
Tadj, L., Rikli, N.E.: Matrix Analytic Solution to a Quorum Queueing System. Mathematical and Computer Modelling 32(3-4), 481–491 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Osman, R., Piazzolla, P. (2014). Modelling Replication in NoSQL Datastores. In: Norman, G., Sanders, W. (eds) Quantitative Evaluation of Systems. QEST 2014. Lecture Notes in Computer Science, vol 8657. Springer, Cham. https://doi.org/10.1007/978-3-319-10696-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-10696-0_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10695-3
Online ISBN: 978-3-319-10696-0
eBook Packages: Computer ScienceComputer Science (R0)