Skip to main content

Modelling Replication in NoSQL Datastores

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8657))

Abstract

Distributed NoSQL datastores have been developed to cater for the usage scenarios of Web 2.0 applications. These systems provide high availability through the replication of data across different machines and data centers. The performance characteristics of NoSQL datastores are determined by the degree of data replication and the consistency guarantees required by the application. This paper presents a novel performance study of the Cassandra NoSQL datastore deployed on the Amazon EC2 cloud platform. We show that a queueing Petri net model can scale to represent the characteristics of read workloads for different replication strategies and cluster sizes. We benchmark one Cassandra node and predict response times and throughput for these configurations. We study the relationship between cluster size and consistency guarantees on cluster performance and identify the effect that node capacity and configuration has on the overall performance of the cluster.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amazon Web Services - Elastic Cloud Computing, http://aws.amazon.com

  2. Bause, F.: Queueing Petri Nets–A Formalism for the Combined Qualitative and Quantitative Analysis of Systems. In: Fifth Intl Workshop Petri Nets and Performance Models (1993)

    Google Scholar 

  3. Cassandra-stress benchmarking tool, http://www.datastax.com/documentation/cassandra/1.2/cassandra/tools/toolsCStress_t.html

  4. Cattell, R.: Scalable SQL and NoSQL Data Stores. SIGMOD Rec. 39(4), 12–27 (2011)

    Article  Google Scholar 

  5. Cerotti, D., Gribaudo, M., Piazzolla, P., Serazzi, G.: End-to-End Performance of Multi-core Systems in Cloud Environments. In: Balsamo, M.S., Knottenbelt, W.J., Marin, A. (eds.) EPEW 2013. LNCS, vol. 8168, pp. 221–235. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  6. Coulden, D., Osman, R., Knottenbelt, W.J.: Performance Modelling of Database Contention using Queueing Petri Nets. In: 4th ACM/SPEC International Conference on Performance Engineeering (2013)

    Google Scholar 

  7. Datastax Documentation Apache Cassandra 1.2 (2014), http://www.datastax.com/docs

  8. Kounev, S., Spinner, S., Meier, P.: QPME 2.0 - A Tool for Stochastic Modeling and Analysis Using Queueing Petri Nets. In: Sachs, K., Petrov, I., Guerrero, P. (eds.) Buchmann Festschrift. LNCS, vol. 6462, pp. 293–311. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  9. Kounev, S.: Performance Modeling and Evaluation of Distributed Component-based Systems using Queueing Petri Nets. IEEE Trans. Software Engineering 32(7), 486–502 (2006)

    Article  Google Scholar 

  10. Kounev, S., Buchmann, A.: Performance Modelling of Distributed e-business Applications using Queuing Petri Nets. In: Proc. IEEE International Symposium on Performance Analysis of Systems and Software, pp. 143–155 (2003)

    Google Scholar 

  11. Kounev, S., et al.: Automated Simulation-Based Capacity Planning for Enterprise Data Fabrics. In: Proc. 4th SIMUTOOLS 2011, pp. 27–36 (2011)

    Google Scholar 

  12. Lakshman, A., Malik, P.: Cassandra: a Decentralized Structured Storage System. SIGOPS Operating Systems Review 44(2), 35–40 (2010)

    Article  Google Scholar 

  13. Linden, G.: Make Data Useful (2006), http://www.gduchamp.com/media/StanfordDataMining.2006-11-28.pdf

  14. Linden, G., Mayer, M.: Presented at Web 2.0 (2006), http://glinden.blogspot.com/2006/11/marissa-mayer-at-web-20.htm

  15. Nou, R., Kounev, S., Julia, F., Torres, J.: Autonomic QoS Control in Enterprise Grid Environments using Online Simulation. Journal of Systems and Software 82(3), 486–502 (2009)

    Article  Google Scholar 

  16. Omari, T., Derisavi, S., Franks, G., Woodside, M.: Performance Modeling of a Quorum Pattern in Layered Service Systems. In: QEST 2007 (2007)

    Google Scholar 

  17. Osman, R., Coulden, D., Knottenbelt, W.J.: Performance Modelling of Concurrency Control Schemes for Relational Databases. In: Dudin, A., De Turck, K. (eds.) ASMTA 2013. LNCS, vol. 7984, pp. 337–351. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  18. Osman, R., Knottenbelt, W.J.: Database System Performance Evaluation Models: A Survey. Performance Evaluation 69(10), 471–493 (2012)

    Article  Google Scholar 

  19. Schurman, E., Brutlag, J.: Performance Related Changes and their User Impact. Presented at Velocity Web Performance and Operations Conference (2009)

    Google Scholar 

  20. Stonebraker, M., Cattell, R.: 10 Rules for Scalable Performance in Simple Operation Datastores. Commun. ACM 54(6), 72–80 (2011)

    Article  Google Scholar 

  21. Tadj, L., Rikli, N.E.: Matrix Analytic Solution to a Quorum Queueing System. Mathematical and Computer Modelling 32(3-4), 481–491 (2000)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Osman, R., Piazzolla, P. (2014). Modelling Replication in NoSQL Datastores. In: Norman, G., Sanders, W. (eds) Quantitative Evaluation of Systems. QEST 2014. Lecture Notes in Computer Science, vol 8657. Springer, Cham. https://doi.org/10.1007/978-3-319-10696-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-10696-0_16

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-10695-3

  • Online ISBN: 978-3-319-10696-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics