Skip to main content

Performance Model of Apache Cassandra Under Heterogeneous Workload Using the Quantitative Verification Approach

  • Conference paper
  • First Online:
Computer Performance Engineering (EPEW 2018)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 11178))

Included in the following conference series:

  • 410 Accesses

Abstract

We report our experience using PRISM, a leading quantitative verification engine, to formulate a performance model of Apache Cassandra, a popular NoSQL database, under a simple form of hybrid operational/analytical workload, since such heterogeneous workloads have shown to have significant implications for the deployment and elastic strategies of these databases. Some current literature suggest that, compared to classical performance modelling tools, quantitative verification provides a more rigorous analysis framework. We aim to explore the effectiveness and applicability of this approach in practice which we identify as relevant to our use case. We present a partial model of a single Cassandra node that predicts its maximum throughput under various system and workload parameters and validate this model experimentally. Furthermore, we show the limitations of extending this model using PRISM to address other interesting properties, identifying the need for scalable analytical modelling approaches for realistic highly concurrent systems under heterogeneous workloads.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cruz, F., et al.: MeT: workload aware elasticity for NoSQL. In: Proceedings of the 8th European Conference on Computer Systems - EuroSys, pp. 183–196 (2013)

    Google Scholar 

  2. Kassela, E., Boumpouka, C., Konstantinou, I., Koziris, N.: Automated workload-aware elasticity of NoSQL clusters in the cloud. In: 2014 IEEE International Conference on Big Data (Big Data) (2014). https://doi.org/10.1109/bigdata.2014.7004232

  3. Anwar, A., Cheng, Y., Gupta, A., Butt, A.R.: MOS: workload-aware elasticity for cloud object stores. In: Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, pp. 177–188 (2016)

    Google Scholar 

  4. Becker, M., Luckey, M., Becker, S.: Model-driven performance engineering of self-adaptive systems: a survey. In: Proceedings of the 8th international ACM SIGSOFT Conference on Quality of Software Architectures, QoSA 2012, pp. 117–122 (2012)

    Google Scholar 

  5. Kwiatkowska, M., Norman, G., Parker, D.: PRISM: probabilistic model checking for performance and reliability analysis. ACM SIGMETRICS Perform. Eval. Rev. 36, 40–45 (2009)

    Article  Google Scholar 

  6. Berczes, T., Guta, G., Kusper, G., Schreiner, W., Sztrik, J.: Comparing the performance modeling environment MOSEL and the probabilistic model checker PRISM for modeling and analyzing retrial queueing systems. Technical report no 07–17 in RISC Report Series, Research Institute for Symbolic Computation (RISC), Johannes Kepler University Linz, Austria (2007)

    Google Scholar 

  7. Calinescu, R., Grunske, L., Kwiatkowska, M., Mirandola, R., Tamburrelli, G.: Dynamic QoS management and optimization in service-based systems. IEEE Trans. Softw. Eng. 37, 387–409 (2011)

    Article  Google Scholar 

  8. Naskos, A., Gounaris, A., Katsaros, P.: Cost-aware horizontal scaling of NoSQL databases using probabilistic model checking. Cluster Comput. 20, 2687–2701 (2017)

    Article  Google Scholar 

  9. Bérczes T., Guta, G., Kusper, G., Schreiner, W., Sztrik, J.: Analyzing a proxy cache server performance model with the probabilistic model checker PRISM. In: Automated Specification and Verification of Web Systems (2009)

    Google Scholar 

  10. Rabl, T., Gómez-Villamor, S., Sadoghi, M., Muntés-Mulero, V., Jacobsen, H.-A., Mankovskii, S.: Solving big data challenges for enterprise application performance management. Proc. VLDB Endow. 5, 1724–1735 (2012)

    Article  Google Scholar 

  11. Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing - SoCC 2010 (2010). https://doi.org/10.1145/1807128.1807152

  12. Benchmarking Top NoSQL Databases: Apache Cassandra, Couchbase, HBase, and MongoDB [Internet]. End Point Corporation, 2015 April. https://www.datastax.com/wp-content/themes/datastax-2014-08/files/NoSQL_Benchmarks_EndPoint.pdf

  13. Dipietro, S., Casale, G., Serazzi, G.: A queueing network model for performance prediction of Apache Cassandra. In: Proceedings of the 10th EAI International Conference on Performance Evaluation Methodologies and Tools (2017). https://doi.org/10.4108/eai.25-10-2016.2266606

  14. Gandini, A., Gribaudo, M., Knottenbelt, W.J., Osman, R., Piazzolla, P.: Performance evaluation of NoSQL databases. In: Horváth, A., Wolter, K. (eds.) EPEW 2014. LNCS, vol. 8721, pp. 16–29. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10885-8_2

    Chapter  Google Scholar 

  15. Huang, X., Wang, J., Qiao, J., Zheng, L., Zhang, J., Wong, R.K.: Performance and replica consistency simulation for quorum-based NoSQL system Cassandra. In: van der Aalst, W., Best, E. (eds.) PETRI NETS 2017. LNCS, vol. 10258, pp. 78–98. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-57861-3_6

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Al Amjad Tawfiq Isstaif .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Isstaif, A.A.T., Alhafez, N. (2018). Performance Model of Apache Cassandra Under Heterogeneous Workload Using the Quantitative Verification Approach. In: Bakhshi, R., Ballarini, P., Barbot, B., Castel-Taleb, H., Remke, A. (eds) Computer Performance Engineering. EPEW 2018. Lecture Notes in Computer Science(), vol 11178. Springer, Cham. https://doi.org/10.1007/978-3-030-02227-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-02227-3_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-02226-6

  • Online ISBN: 978-3-030-02227-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics