Abstract
We report our experience using PRISM, a leading quantitative verification engine, to formulate a performance model of Apache Cassandra, a popular NoSQL database, under a simple form of hybrid operational/analytical workload, since such heterogeneous workloads have shown to have significant implications for the deployment and elastic strategies of these databases. Some current literature suggest that, compared to classical performance modelling tools, quantitative verification provides a more rigorous analysis framework. We aim to explore the effectiveness and applicability of this approach in practice which we identify as relevant to our use case. We present a partial model of a single Cassandra node that predicts its maximum throughput under various system and workload parameters and validate this model experimentally. Furthermore, we show the limitations of extending this model using PRISM to address other interesting properties, identifying the need for scalable analytical modelling approaches for realistic highly concurrent systems under heterogeneous workloads.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cruz, F., et al.: MeT: workload aware elasticity for NoSQL. In: Proceedings of the 8th European Conference on Computer Systems - EuroSys, pp. 183–196 (2013)
Kassela, E., Boumpouka, C., Konstantinou, I., Koziris, N.: Automated workload-aware elasticity of NoSQL clusters in the cloud. In: 2014 IEEE International Conference on Big Data (Big Data) (2014). https://doi.org/10.1109/bigdata.2014.7004232
Anwar, A., Cheng, Y., Gupta, A., Butt, A.R.: MOS: workload-aware elasticity for cloud object stores. In: Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, pp. 177–188 (2016)
Becker, M., Luckey, M., Becker, S.: Model-driven performance engineering of self-adaptive systems: a survey. In: Proceedings of the 8th international ACM SIGSOFT Conference on Quality of Software Architectures, QoSA 2012, pp. 117–122 (2012)
Kwiatkowska, M., Norman, G., Parker, D.: PRISM: probabilistic model checking for performance and reliability analysis. ACM SIGMETRICS Perform. Eval. Rev. 36, 40–45 (2009)
Berczes, T., Guta, G., Kusper, G., Schreiner, W., Sztrik, J.: Comparing the performance modeling environment MOSEL and the probabilistic model checker PRISM for modeling and analyzing retrial queueing systems. Technical report no 07–17 in RISC Report Series, Research Institute for Symbolic Computation (RISC), Johannes Kepler University Linz, Austria (2007)
Calinescu, R., Grunske, L., Kwiatkowska, M., Mirandola, R., Tamburrelli, G.: Dynamic QoS management and optimization in service-based systems. IEEE Trans. Softw. Eng. 37, 387–409 (2011)
Naskos, A., Gounaris, A., Katsaros, P.: Cost-aware horizontal scaling of NoSQL databases using probabilistic model checking. Cluster Comput. 20, 2687–2701 (2017)
Bérczes T., Guta, G., Kusper, G., Schreiner, W., Sztrik, J.: Analyzing a proxy cache server performance model with the probabilistic model checker PRISM. In: Automated Specification and Verification of Web Systems (2009)
Rabl, T., Gómez-Villamor, S., Sadoghi, M., Muntés-Mulero, V., Jacobsen, H.-A., Mankovskii, S.: Solving big data challenges for enterprise application performance management. Proc. VLDB Endow. 5, 1724–1735 (2012)
Cooper, B.F., Silberstein, A., Tam, E., Ramakrishnan, R., Sears, R.: Benchmarking cloud serving systems with YCSB. In: Proceedings of the 1st ACM Symposium on Cloud Computing - SoCC 2010 (2010). https://doi.org/10.1145/1807128.1807152
Benchmarking Top NoSQL Databases: Apache Cassandra, Couchbase, HBase, and MongoDB [Internet]. End Point Corporation, 2015 April. https://www.datastax.com/wp-content/themes/datastax-2014-08/files/NoSQL_Benchmarks_EndPoint.pdf
Dipietro, S., Casale, G., Serazzi, G.: A queueing network model for performance prediction of Apache Cassandra. In: Proceedings of the 10th EAI International Conference on Performance Evaluation Methodologies and Tools (2017). https://doi.org/10.4108/eai.25-10-2016.2266606
Gandini, A., Gribaudo, M., Knottenbelt, W.J., Osman, R., Piazzolla, P.: Performance evaluation of NoSQL databases. In: Horváth, A., Wolter, K. (eds.) EPEW 2014. LNCS, vol. 8721, pp. 16–29. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10885-8_2
Huang, X., Wang, J., Qiao, J., Zheng, L., Zhang, J., Wong, R.K.: Performance and replica consistency simulation for quorum-based NoSQL system Cassandra. In: van der Aalst, W., Best, E. (eds.) PETRI NETS 2017. LNCS, vol. 10258, pp. 78–98. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-57861-3_6
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Isstaif, A.A.T., Alhafez, N. (2018). Performance Model of Apache Cassandra Under Heterogeneous Workload Using the Quantitative Verification Approach. In: Bakhshi, R., Ballarini, P., Barbot, B., Castel-Taleb, H., Remke, A. (eds) Computer Performance Engineering. EPEW 2018. Lecture Notes in Computer Science(), vol 11178. Springer, Cham. https://doi.org/10.1007/978-3-030-02227-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-02227-3_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02226-6
Online ISBN: 978-3-030-02227-3
eBook Packages: Computer ScienceComputer Science (R0)