Skip to main content
Log in

Architectural assessment of NoSQL and NewSQL systems

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

With the recent trend towards big data, a number of scalable data management systems: NoSQL and NewSQL are developed to manage massive data effectively. The algorithms involved in the architectural design of a data management system defines the response time of an application. The behavior and performance of different NoSQL and NewSQL systems vary on the basis of these architectural aspects. Hence, the architectural assessment of a data management system is a vital task to perform in order to understand their weaknesses and strengths. Therefore, this paper assesses the architecture of some well-known NoSQL and NewSQL systems in detail. To enhance the clarity of discussion and analysis, we identified and grouped together the logically related architectural features, forming a feature vector (FV). Feature vectors related to transactional properties, fault tolerance, data storage, and data handling are designed and involved in architectural assessment. Various significant features are identified and assigned to a feature vector. Some well-known NoSQL and NewSQL systems are analyzed, compared, and discussed in depth with respect to these feature vectors. The discussion involves describing the algorithms used in implementation of a particular architectural feature by each of the systems and their suitability analysis in various scenarios. Important guidelines are presented that helps in filtering the potential data management systems on the basis of application requirements.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  1. Abramova, V., Bernardino, J.: Nosql databases: Mongodb vs cassandra. In: Proceedings of the International C* Conference on Computer Science and Software Engineering, pp. 14–22. ACM (2013)

  2. Aerospike high performance nosql database. [online] http://www.aerospike.com/

  3. Aggarwal, D., Roopam, S.: Emerging technologies for big data processing: Nosql and newsql data stores

  4. Al-Saeedi, B.: Factors influencing the database selection for b2c web applications

  5. Amirian, P., Basiri, A., Winstanley, A.: Efficient online sharing of geospatial big data using nosql xml databases. In: Computing for Geospatial Research and Application (COM. Geo), 2013 Fourth International Conference on, pp. 152–152. IEEE (2013)

  6. Amirian, P., Basiri, A., Winstanley, A.: Evaluation of data management systems for geospatial big data. In: International Conference on Computational Science and Its Applications, pp. 678–690. Springer (2014)

  7. Andersson, E., Berggren, Z.: A comparison between mongodb and mysql document store considering performance (2017)

  8. Angles, R.: A comparison of current graph database models. In: Data Engineering Workshops (ICDEW), 2012 IEEE 28th International Conference on, pp. 171–177. IEEE (2012)

  9. Apache accumulo. [online] https://accumulo.apache.org/

  10. Apache cassandra. [online] http://cassandra.apache.org/

  11. Apache couchdb. [online] http://couchdb.apache.org/

  12. Apache hbase – apache hbase home. [online] https://hbase.apache.org/

  13. Bailis, P., Ghodsi, A.: Eventual consistency today: limitations, extensions, and beyond. Commun. ACM 56(5), 55–63 (2013)

    Article  Google Scholar 

  14. Barber, R.J., Herbert, D.M., Mohan, C., Somani, A., Watts, S.J., Zaharioudakis, M.: Data recovery in a transactional database using write-ahead logging and file caching (2001). US Patent 6,173,292

  15. Bermbach, D., Kuhlenkamp, J.: Consistency in distributed storage systems. In: Networked Systems, pp. 175–189. Springer (2013)

  16. Best database for integrating data from silos | marklogic. [online] http://www.marklogic.com/

  17. Bhamra, K.: A comparative analysis of mongodb and cassandra. Master’s thesis, The University of Bergen (2018)

  18. Bhatt, P.: Mongodb vs redis: Critical analysis and comparison

  19. Bhatt, P.: Performance comparison between column store nosql databases

  20. Bigtable - scalable nosql database service | google cloud platform. [online] https://cloud.google.com/bigtable/

  21. Bommena, S.: Cyclic redundancy check (crc) (2008)

  22. Buerli, M., Obispo, C.: The current state of graph databases. Department of Computer Science, Cal Poly San Luis Obispo, mbuerli@ calpoly. edu pp. 1–7 (2012)

  23. Caldarola, E.G., Rinaldi, A.M.: Big data: A survey the new paradigms, methodologies and tools

  24. Castruccio, S., Genton, M.G.: Principles for statistical inference on big spatio-temporal data from climate models. Stat. Probab. Lett. 136, 92–98 (2018)

    Article  MathSciNet  MATH  Google Scholar 

  25. Cattell, R.: Scalable sql and nosql data stores. ACM Sigmod Record 39(4), 12–27 (2011)

    Article  Google Scholar 

  26. Cloud data warehouse | home | snowflake. [online] http://www.snowflake.net/

  27. Cloudant. [online] https://cloudant.com/

  28. Clustrix - scale-out rdbms. [online] http://www.clustrix.com,

  29. Collet, Y.: Lz4: Extremely fast compression algorithm. code. google. com (2013)

  30. Database administrators stack exchange. [online] https://dba.stackexchange.com/

  31. Davidson, S.B., Garcia-Molina, H., Skeen, D.: Consistency in a partitioned network: a survey. ACM Comput. Surv. (CSUR) 17(3), 341–370 (1985)

    Article  Google Scholar 

  32. Davoudian, A., Chen, L., Liu, M.: A survey on nosql stores. ACM Comput. Surv. (CSUR) 51(2), 1–43 (2018)

    Article  Google Scholar 

  33. Db-engines ranking - popularity ranking of database management systems. [online] http://db-engines.com/en/ranking/

  34. Debnath, B., Sengupta, S., Li, J.: Flashstore: high throughput persistent key-value store. Proc. VLDB Endow. 3(1–2), 1414–1425 (2010)

    Article  Google Scholar 

  35. Deri, L., Mainardi, S., Fusco, F.: tsdb: A compressed database for time series. In: International Workshop on Traffic Monitoring and Analysis, pp. 143–156. Springer (2012)

  36. Dgraph labs. [online] https://dgraph.io/

  37. Dziedzic, A., Duggan, J., Elmore, A.J., Gadepally, V., Stonebraker, M.: Bigdawg: a polystore for diverse interactive applications. In: Data Syst. Interactive Anal. Workshop 2015 (2015)

  38. Ediger, D., Jiang, K., Riedy, E.J., Bader, D.A.: Graphct: multithreaded algorithms for massive graph analysis. IEEE Trans. Parallel Distrib. Syst. 24(11), 2220–2229 (2013)

    Article  Google Scholar 

  39. Ehcache. [online] http://www.ehcache.org/

  40. Fox, A., Eichelberger, C., Hughes, J., Lyon, S.: Spatio-temporal indexing in non-relational distributed databases. In: Big Data, 2013 IEEE International Conference on, pp. 291–299. IEEE (2013)

  41. Gajendran, S.K.: A survey on nosql databases. University of Illinois (2012)

  42. Gessert, F., Wingerath, W., Friedrich, S., Ritter, N.: Nosql database systems: a survey and decision guidance. Computer Science-Research and Development pp. 1–13 (2016)

  43. Ghrab, A., Romero, O., Skhiri, S., Zimányi, E.: Analytics-aware graph database modeling. Tech. rep, Technical report (2014)

  44. Ghrab, A., Skhiri, S., Jouili, S., Zimányi, E.: An analytics-aware conceptual model for evolving graphs. In: International Conference on Data Warehousing and Knowledge Discovery, pp. 1–12. Springer (2013)

  45. Giraph - welcome to apache giraph! [online] http://giraph.apache.org/

  46. Graphbase - the world’s most powerful graph dbms. [online] http://graphbase.net/

  47. Grolinger, K., Higashino, W.A., Tiwari, A., Capretz, M.A.: Data management in cloud environments: Nosql and newsql data stores. Journal of Cloud Computing: Advances, Systems and Applications 2(1), 1 (2013)

    Google Scholar 

  48. Gurevich, Y.: Comparative survey of nosql/newsql db systems. Ph.D. thesis, The Open University (2015)

  49. Gustavsson, S., Andler, S.F.: Self-stabilization and eventual consistency in replicated real-time databases. In: Proceedings of the first workshop on Self-healing systems, pp. 105–107. ACM (2002)

  50. Hajoui, O., Dehbi, R., Talea, M., Batouta, Z.I.: An advanced comparative study of the most promising nosql and newsql databases with a multi-criteria analysis method. J. Theor. Appl. Inf. Technol. 81(3), 579 (2015)

    Google Scholar 

  51. Home | hypertable - big data. big performance. [online] http://hypertable.org/

  52. Home | scylladb. [online] http://www.scylladb.com/

  53. Hoque, I., Gupta, I.: Lfgraph: Simple and fast distributed graph analytics. In: Proceedings of the First ACM SIGOPS Conference on Timely Results in Operating Systems, p. 9. ACM (2013)

  54. [online] https://www.quora.com/What-is-a-transaction-and-ACID-properties-in-DBMS

  55. Infinitegraph. [online] www.objectivity.com/products/infinitegraph/

  56. Infogrid web graph database. [online] http://infogrid.org/

  57. Jomeiri, A., Shamsi, M., Kazemi, E.: Comparative study of column oriented nosql databases on characteristics. Int. J. Enhanc. Res. Sci. Technol. Eng. 4(4), 118–124 (2015)

    Google Scholar 

  58. Kalid, S., Syed, A., Mohammad, A., Halgamuge, M.N.: Big-data nosql databases: A comparison and analysis of “big-table”, “dynamodb”, and “cassandra”. In: Big Data Analysis (ICBDA), 2017 IEEE 2nd International Conference on, pp. 89–93. IEEE (2017)

  59. Kamal, S.H., Elazhary, H.H., Hassanein, E.E.: A qualitative comparison of nosql data stores. Int. J. Adv. Comput. Sci. Appl 10(2), 330–338 (2019)

    Google Scholar 

  60. Karger, D., Lehman, E., Leighton, T., Panigrahy, R., Levine, M., Lewin, D.: Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the world wide web. In: Proceedings of the twenty-ninth annual ACM symposium on Theory of computing, pp. 654–663. ACM (1997)

  61. Kaur, K., Sachdeva, M.: Performance evaluation of newsql databases. In: Inventive Systems and Control (ICISC), 2017 International Conference on, pp. 1–5. IEEE (2017)

  62. Kepner, J., Gadepally, V., Hutchison, D., Jananthan, H., Mattson, T., Samsi, S., Reuther, A.: Associative array model of sql, nosql, and newsql databases. In: High Performance Extreme Computing Conference (HPEC), 2016 IEEE, pp. 1–9. IEEE (2016)

  63. Khan, W., Shahzad, W.: Predictive performance comparison analysis of relational & nosql graph databases. Int. J. Adv. Comput. Sci. Appl. 8(5), 523–530 (2017)

    Google Scholar 

  64. Khazaei, H., Fokaefs, M., Zareian, S., Beigi-Mohammadi, N., Ramprasad, B., Shtern, M., Gaikwad, P., Litoiu, M.: How do i choose the right nosql solution? A comprehensive theoretical and experimental survey. Big Data and Information Analytics (BDIA) 2, 185 (2016)

  65. Kumar, R., Charu, S.: Newsql databases: Scalable rdbms for oltp needs to handle big data

  66. Kumar, K.S., Mohanavalli, S., et al.: A performance comparison of document oriented nosql databases. In: Computer, Communication and Signal Processing (ICCCSP), 2017 International Conference on, pp. 1–6. IEEE (2017)

  67. Kunda, D., Phiri, H.: A comparative study of nosql and relational database. Zambia ICT J. 1(1), 1–4 (2017)

    Article  Google Scholar 

  68. Lee, J.G., Kang, M.: Geospatial big data: challenges and opportunities. Big Data Res. 2(2), 74–81 (2015)

    Article  MathSciNet  Google Scholar 

  69. Leveldb.org. [online] http://leveldb.org/

  70. Lloyd, W., Freedman, M.J., Kaminsky, M., Andersen, D.G.: Don’t settle for eventual: scalable causal consistency for wide-area storage with cops. In: Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, pp. 401–416. ACM (2011)

  71. Lourenço, J.R., Cabral, B., Carreiro, P., Vieira, M., Bernardino, J.: Choosing the right nosql database for the job: a quality attribute evaluation. J. Big Data 2(1), 1 (2015)

    Article  Google Scholar 

  72. Lucchese, F.: From p2p to nosql: a continuous metric for classifying large-scale storage systems. J. Parallel Distrib. Comput. 113, 227–249 (2018)

    Article  Google Scholar 

  73. Mahgoub, A., Ganesh, S., Meyer, F., Grama, A., Chaterji, S.: Suitability of nosql systems—cassandra and scylladb—for iot workloads. In: Communication Systems and Networks (COMSNETS), 2017 9th International Conference on, pp. 476–479. IEEE (2017)

  74. Maia, D.C.M., Camargos, B.D., Holanda, M.: Performance analysis on voluntary geographic information systems with document-based nosql database. In: Developments and Advances in Intelligent Systems and Applications, pp. 181–197. Springer (2018)

  75. Mariadb.org. [online] https://mariadb.org,

  76. memcached - a distributed memory object caching system. [online] http://www.memcached.org,

  77. Memsql: The fastest in-memory database. [online] http://memsql.com,

  78. Mongodb for giant ideas. [online] https://www.mongodb.com/

  79. Murugan, P., Sentraya, A.: A study of nosql and newsql databases for data aggregation on big data (2013)

  80. Nair, S.M., Roy, R., Varghese, S.M.: Performance evaluation of mongodb and couchdb databases (2017)

  81. Neo4j: The world’s leading graph database. [online] https://neo4j.com/

  82. Nguyen, T.T., Nguyen, M.H.: Zing database: high-performance key-value store for large-scale storage service. Vietnam J. Comput. Sci. 2(1), 13–23 (2015)

    Article  Google Scholar 

  83. Nosql database | couchbase. [online] https://www.couchbase.com/

  84. Nuodb.com. [online] https://www.nuodb.com,

  85. Orientdb - distributed multi-model and graph database. [online] http://orientdb.com/orientdb/

  86. Oussous, A., Benjelloun, F.Z., Lahcen, A.A., Belfkih, S.: Comparison and classification of nosql databases for big data. In: Proceedings of International Conference on Big Data, Cloud and Applications (2015)

  87. Patil, M.M., Hanni, A., Tejeshwar, C., Patil, P.: A qualitative analysis of the performance of mongodb vs mysql database based on insertion and retriewal operations using a web/android application to explore load balancing—sharding in mongodb and its advantages. In: I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC), 2017 International Conference on, pp. 325–330. IEEE (2017)

  88. Pereira, D.A., Ourique de Morais, W., Pignaton de Freitas, E.: Nosql real-time database performance comparison. Int. J. Parallel Emerg. Distrib. Syst. 32, 144–156 (2017)

    Google Scholar 

  89. Pivotal gemfire | big data. [online] http://pivotal.io/big-data/pivotal-gemfire

  90. Pouchdb, the javascript database that syncs! [online] https://pouchdb.com/

  91. Pouyanfar, S., Yang, Y., Chen, S.C., Shyu, M.L., Iyengar, S.: Multimedia big data analytics: a survey. ACM Comput. Surv. (CSUR) 51(1), 10 (2018)

    Article  Google Scholar 

  92. Pritchett, D.: Base: an acid alternative. Queue 6(3), 48–55 (2008)

    Article  Google Scholar 

  93. Priyanka, A.: A review of nosql databases, types and comparison with relational database. Int. J. Eng. Sci. 4963, 5 (2016)

    Google Scholar 

  94. Raut, A.: Nosql database and its comparison with rdbms. Int. J. Comput. Intell. Res. 13(7), 1645–1651 (2017)

    Google Scholar 

  95. Ravendb - the open source nosql database for .net. [online] https://ravendb.net/

  96. Redis. [online] http://www.redis.io/

  97. Reilly, E.D.: Memory-mapped i/o (2003)

  98. Rethinkdb: the open-source database for the realtime web. [online] https://www.rethinkdb.com/

  99. Rocksdb | a persistent key-value store. [online] http://rocksdb.org/

  100. Sahatqija, K., Ajdari, J., Zenuni, X., Raufi, B., Ismaili, F.: Comparison between relational and nosql databases. In: 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), pp. 0216–0221. IEEE (2018)

  101. Saino, L., Psaras, I., Pavlou, G.: Understanding sharded caching systems. In: Computer Communications, IEEE INFOCOM 2016-The 35th Annual IEEE International Conference on, pp. 1–9. IEEE (2016)

  102. Schreiner, G.A., Knob, R., Duarte, D., Vilain, P., Mello, R.d.S.: Newsql through the looking glass. In: Proceedings of the 21st International Conference on Information Integration and Web-based Applications & Services, pp. 361–369 (2019)

  103. Simmen, D., Schnaitter, K., Davis, J., He, Y., Lohariwala, S., Mysore, A., Shenoi, V., Tan, M., Xiao, Y.: Large-scale graph analytics in aster 6: bringing context to big data discovery. Proc. VLDB Endow. 7(13), 1405–1416 (2014)

    Article  Google Scholar 

  104. Sparksee (graph database) | wikiwand. [online] http://www.wikiwand.com

  105. Stack overflow - where developers learn, share, and build careers. [online] https://stackoverflow.com/

  106. Stonebraker, M.: Concurrency control and consistency of multiple copies of data in distributed ingres. IEEE Trans. Softw. Eng. 5(3), 188 (1979)

    Article  MATH  Google Scholar 

  107. Tan, H., Luo, W., Ni, L.M.: Clost: a hadoop-based storage system for big spatio-temporal data analytics. In: Proceedings of the 21st ACM international conference on Information and knowledge management, pp. 2139–2143. ACM (2012)

  108. Tanase, I., Xia, Y., Nai, L., Liu, Y., Tan, W., Crawford, J., Lin, C.Y.: A highly efficient runtime and graph library for large scale graph analytics. In: Proceedings of Workshop on GRAph Data management Experiences and Systems, pp. 1–6. ACM (2014)

  109. Tarantool - index. [online] https://tarantool.org/

  110. Tauro, C.J., Patil, B.R., Prashanth, K.: A comparative analysis of different nosql databases on data model, query model and replication model. In: Proceedings of the International Conference on ERCICA (2013)

  111. Terrastore system properties. [online] http://db-engines.com/en/system/Terrastore

  112. Tidke, B., Mehta, R.: A comprehensive review and open challenges of stream big data. In: Soft Computing: Theories and Applications, pp. 89–99. Springer (2018)

  113. Twitter. it’s what’s happening. [online] https://twitter.com/

  114. Velocitygraph - graph database. [online] https://velocitydb.com/VelocityGraph.aspx

  115. Vogels, W.: Eventually consistent. Commun. ACM 52(1), 40–44 (2009)

    Article  Google Scholar 

  116. Voltdb: World’s fastest, in-memory operational database. [online] http://voltdb.com,

  117. Wei, L.Y., Hsu, Y.T., Peng, W.C., Lee, W.C.: Indexing spatial data in cloud data managements. Pervasive Mob. Comput. 15, 48–61 (2014)

    Article  Google Scholar 

  118. Wlodarczyk, T.W.: Overview of time series storage and processing in a cloud environment. In: Cloud Computing Technology and Science (CloudCom), 2012 IEEE 4th International Conference on, pp. 625–628. IEEE (2012)

  119. Xia, Y., Tanase, I.G., Nai, L., Tan, W., Liu, Y., Crawford, J., Lin, C.Y.: Graph analytics and storage. In: Big Data (Big Data), 2014 IEEE International Conference on, pp. 942–951. IEEE (2014)

  120. Yuan, L.Y., Wu, L., You, J.H., Chi, Y.: A demonstration of rubato db: A highly scalable newsql database system for oltp and big data applications. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp. 907–912. ACM (2015)

  121. Zhang, X., Song, W., Liu, L.: An implementation approach to store gis spatial data on nosql database. In: Geoinformatics (GeoInformatics), 2014 22nd International Conference on, pp. 1–5. IEEE (2014)

  122. Zhong, Y., Han, J., Zhang, T., Fang, J.: A distributed geospatial data storage and processing framework for large-scale webgis. In: Geoinformatics (GEOINFORMATICS), 2012 20th International Conference on, pp. 1–7. IEEE (2012)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Natalia Chaudhry.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chaudhry, N., Yousaf, M.M. Architectural assessment of NoSQL and NewSQL systems. Distrib Parallel Databases 38, 881–926 (2020). https://doi.org/10.1007/s10619-020-07310-1

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-020-07310-1

Keywords

Navigation