research-article

A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence

Authors:

Dimitri Van Landuyt,

Julien Benaouda,

Vincent Reniers,

Wouter JoosenAuthors Info & Claims

SAC '23: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing

Pages 286 - 293

https://doi.org/10.1145/3555776.3577645

Published: 07 June 2023 Publication History

Abstract

Multi-model databases support different NoSQL data models at once, typically a combination of relational, key-value, document, and graph models. Their expected benefits include increased versatility, reduced installation complexity, improved database performance, and smaller storage footprint. However, these benefits are insufficiently investigated, and current studies lack a comparison between multi-model databases and equivalent polyglot database architectures, i.e. setups that combine different technologies to accomplish the same functionality.

To fill this gap, we conduct a series of benchmarks for the purpose of investigating the efficiency of two multi-model NoSQL databases (ArangoDB and OrientDB) in comparison to an equivalent polyglot baseline. These experiments have been performed with UniBench, a framework specifically designed to evaluate the performance of multi-model databases in terms of query execution time, which was also further extended for the purposes of this study.

The study results indicate that the choice between a multi-model NoSQL database and a combination of NoSQL databases depends on the types of data and queries, the underlying data models and the involved databases. These outcomes emphasize the necessity for database architects to prototype and evaluate alternative storage architectures in function of specific application requirements before committing to a technology or paradigm.

References

[1]

Jemal Abawajy. 2015. Comprehensive analysis of big data variety landscape. International journal of parallel, emergent and distributed systems 30, 1 (2015), 5--14.

Digital Library

[2]

Veronika Abramova, Jorge Bernardino, and Pedro Furtado. 2014. Experimental Evaluation of NoSQL Databases. International journal of database management systems 6, 3 (2014), 1--16.

[3]

A. Buble, L. Bulej, and P. Tuma. 2003. CORBA benchmarking: a course with hidden obstacles. In Proceedings International Parallel and Distributed Processing Symposium. 6 pp.-.

[4]

Luca Cabibbo. 2013. ONDM: an Object-NoSQL Datastore Mapper. Faculty of Engineering, Roma Tre University. Retrieved June 15th (2013).

[5]

Cody Coleman, Deepak Narayanan, Daniel Kang, Tian Zhao, Jian Zhang, Luigi Nardi, Peter Bailis, Kunle Olukotun, Chris Ré, and Matei Zaharia. 2017. Dawn-bench: An end-to-end deep learning benchmark and competition. Training 100, 101 (2017), 102.

[6]

Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM symposium on Cloud computing. 143--154.

Digital Library

[7]

DB-Engines. 2021. Ranking. https://db-engines.com/en/ranking.

[8]

Claudio de Lima and Ronaldo dos Santos Mello. 2015. A workload-driven logical design approach for NoSQL document databases. In Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services. 1--10.

Digital Library

[9]

Priya Dialani. 2022. The Future of Data Revolution will be Unstructured Data. https://www.analyticsinsight.net/the-future-of-data-revolution-will-be-unstructured-data/.

[10]

Ahmad Ghazal, Todor Ivanov, Pekka Kostamaa, Alain Crolotte, Ryan Voong, Mohammed Al-Kateb, Waleed Ghazal, and Roberto V Zicari. 2017. Bigbench V2: the new and improved bigbench. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 1225--1236.

[11]

Zhiqiang Gong, Ping Zhong, and Weidong Hu. 2019. Diversity in machine learning. IEEE Access 7 (2019), 64323--64350.

[12]

Katarina Grolinger, Wilson A Higashino, Abhinav Tiwari, and Miriam AM Capretz. 2013. Data management in cloud environments: NoSQL and NewSQL data stores. Journal of Cloud Computing: advances, systems and applications 2, 1 (2013), 1--24.

Digital Library

[13]

Rohmat Gunawan, Alam Rahmatulloh, and Irfan Darmawan. 2019. Performance Evaluation of Query Response Time in The Document Stored NoSQL Database. In 2019 16th International Conference on Quality in Research (QIR): International Symposium on Electrical and Computer Engineering. IEEE, 1--6.

[14]

Cláudio Lima and Ronaldo Santos Mello. 2016. On proposing and evaluating a NoSQL document database logical approach. International Journal of Web Information Systems (2016).

[15]

David Lion, Adrian Chiu, Hailong Sun, Xin Zhuang, Nikola Grcevski, and Ding Yuan. 2016. Don't Get Caught in the Cold, Warm-up Your JVM: Understand and Eliminate JVM Warm-up Overhead in Data-Parallel Systems. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). 383--400.

Digital Library

[16]

Matteo Lissandrini, Martin Brugnara, and Yannis Velegrakis. 2018. Beyond macrobenchmarks: microbenchmark-based graph database evaluation. Proceedings of the VLDB Endowment 12, 4 (2018), 390--403.

Digital Library

[17]

Jiaheng Lu and Irena Holubová. 2019. Multi-model databases: a new journey to handle the variety of data. ACM Computing Surveys (CSUR) 52, 3 (2019), 1--38.

Digital Library

[18]

Martin Macak, Matus Stovcik, Barbora Buhnova, and Michal Merjavy. 2020. How well a multi-model database performs against its single-model variants: Benchmarking OrientDB with Neo4j and MongoDB. In 2020 15th Conference on Computer Science and Information Systems (FedCSIS). IEEE, 463--470.

[19]

Peter Mattson, Vijay Janapa Reddi, Christine Cheng, Cody Coleman, Greg Diamos, David Kanter, Paulius Micikevicius, David Patterson, Guenther Schmuelling, Hanlin Tang, et al. 2020. MLPerf: An industry standard benchmark suite for machine learning performance. IEEE Micro 40, 2 (2020), 8--16.

[20]

Jean Moschetta and Giuliano Casale. 2012. OFBench: An Enterprise Application Benchmark for Cloud Resource Management Studies. In 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing. 393--399.

Digital Library

[21]

Fábio Roberto Oliveira and Luis del Val Cura. 2016. Performance evaluation of NoSQL multi-model data stores in polyglot persistence applications. In Proceedings of the 20th International Database Engineering & Applications Symposium. 230--235.

Digital Library

[22]

Omoruyi Osemwegie, Kennedy Okokpujie, Nsikan Nkordeh, Charles Ndujiuba, John Samuel, and Uzairue Stanley. 2018. Performance Benchmarking of Key-Value Store NoSQL Databases. International Journal of Electrical and Computer Engineering 8, 6 (2018), 5333--5341.

[23]

Diogo Augusto Pereira, Wagner Ourique de Morais, and Edison Pignaton de Freitas. 2018. NoSQL real-time database performance comparison. International Journal of Parallel, Emergent and Distributed Systems 33, 2 (2018), 144--156.

[24]

Ewa Płuciennik and Kamil Zgorzałek. 2017. The multi-model databases-a review. In International Conference: Beyond Databases, Architectures and Structures. Springer, 141--152.

[25]

Tilmann Rabl, Christoph Brücke, Philipp Härtling, Stella Stars, Rodrigo Escobar Palacios, Hamesh Patel, Satyam Srivastava, Christoph Boden, Jens Meiners, and Sebastian Schelter. 2019. ADABench-Towards an industry standard benchmark for advanced analytics. In Technology Conference on Performance Evaluation and Benchmarking. Springer, 47--63.

[26]

Dharavath Ramesh, Ekaansh Khosla, and Shankar Nayak Bhukya. 2016. Inclusion of e-commerce workflow with NoSQL DBMS: MongoDB document store. In 2016 IEEE international conference on computational intelligence and computing research (ICCIC). IEEE, 1--5.

[27]

Vincent Reniers, Ansar Rafique, Dimitri Van Landuyt, and Wouter Joosen. 2017. Object-NoSQL Database Mappers: a benchmark study on the performance overhead. Journal of Internet Services and Applications 8, 1 (2017), 1--16.

[28]

Noa Roy-Hubara, Peretz Shoval, and Arnon Sturm. 2022. Selecting databases for Polyglot Persistence applications. Data & Knowledge Engineering 137 (2022), 101950.

Digital Library

[29]

Pramod J Sadalage and Martin Fowler. 2013. NoSQL distilled: a brief guide to the emerging world of polyglot persistence. Pearson Education.

[30]

Michael Stonebraker. 2010. Errors in database systems, eventual consistency, and the cap theorem. Communications of the ACM, BLOG@ ACM (2010).

[31]

Uta Störl, Thomas Hauf, Meike Klettke, and Stefanie Scherzinger. 2015. Schemaless NoSQL data stores-Object-NoSQL Mappers to the rescue? Datenbanksysteme für Business, Technologie und Web (BTW 2015) (2015).

[32]

Enqing Tang and Yushun Fan. 2016. Performance Comparison between Five NoSQL Databases. In 2016 7th International Conference on Cloud Computing and Big Data (CCBD). IEEE, 105--109.

[33]

Dimitri Van Landuyt, Julien Benaouda, Vincent Reniers, Ansar Rafique, and Wouter Joosen. 2023. A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence: study results (SF1, SF10 and SF30). people.cs.kuleuven.be/~dimitri.vanlanduyt/dvanlanduyt_sac-dbdm_2023_results.zip.

[34]

Luís HN Villaça, Leonardo G Azevedo, and Fernanda Baião. 2018. Query strategies on polyglot persistence in microservices. In Proceedings of the 33rd Annual ACM Symposium on Applied Computing. 1725--1732.

Digital Library

[35]

Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, et al. 2014. Bigdatabench: A big data benchmark suite from internet services. In 2014 IEEE 20th international symposium on high performance computer architecture (HPCA). IEEE, 488--499.

[36]

Chao Zhang. 2022. UniBench - Towards Benchmarking the Multi-Model DBMS. https://github.com/HY-UDBMS/UniBench.

[37]

Chao Zhang and Jiaheng Lu. 2021. Holistic evaluation in multi-model databases benchmarking. Distributed and Parallel Databases 39, 1 (2021), 1--33.

Digital Library

[38]

Chao Zhang, Jiaheng Lu, Pengfei Xu, and Yuxing Chen. 2018. UniBench: A benchmark for multi-model database management systems. In Technology Conference on Performance Evaluation and Benchmarking. Springer, 7--23.

Cited By

Sachdeva SVasava J(2024)Plugging and Playing with Variety of Data using Multi-Model Database and Polyglot Persistence2024 Third International Conference on Electrical, Electronics, Information and Communication Technologies (ICEEICT)10.1109/ICEEICT61591.2024.10718481(1-8)Online publication date: 24-Jul-2024
https://doi.org/10.1109/ICEEICT61591.2024.10718481
Van Landuyt DLevrau MReniers VJoosen W(2024)An E-Commerce Benchmark for Evaluating Performance Trade-Offs in Document StoresBig Data Analytics and Knowledge Discovery10.1007/978-3-031-68323-7_23(284-290)Online publication date: 18-Aug-2024
https://doi.org/10.1007/978-3-031-68323-7_23

Index Terms

A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence
1. Information systems
  1. Data management systems
    1. Database administration
      1. Database performance evaluation
  2. Information storage systems
    1. Storage architectures
      1. Cloud based storage

Recommendations

Performance Evaluation of NoSQL Multi-Model Data Stores in Polyglot Persistence Applications
IDEAS '16: Proceedings of the 20th International Database Engineering & Applications Symposium

NoSQL data store systems have recently been introduced as alternatives to traditional relational database management systems. These data stores systems implement simpler and scalable data models that increase the performance and efficiency of a new kind ...
A performance evaluation of NoSQL databases to manage proteomics data

NoSQL databases have recently been introduced as alternatives to traditional relational database management systems because of their capabilities in terms of storing data and query retrieval. Biological datasets can be modelled using various models, for ...
A performance evaluation of NoSQL databases to manage proteomics data

NoSQL databases have recently been introduced as alternatives to traditional relational database management systems because of their capabilities in terms of storing data and query retrieval. Biological datasets can be modelled using various models, for ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SAC '23: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing

March 2023

1932 pages

ISBN:9781450395175

DOI:10.1145/3555776

Conference Chairs:
Jiman Hong
Soongsil University, South Korea
,
Maart Lanperne
Tallinn University, Estonia
,
Program Chairs:
Juw Won Park
University of Louisville, USA
,
Tomas Cerny
Baylor University, USA
,
Publication Chair:
Hossain Shahriar
Kennesaw State University, USA

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGAPP: ACM Special Interest Group on Applied Computing

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SAC '23

Sponsor:

SIGAPP

SAC '23: 38th ACM/SIGAPP Symposium on Applied Computing

March 27 - 31, 2023

Tallinn, Estonia

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Upcoming Conference

SAC '25

Sponsor:
sigapp

The 40th ACM/SIGAPP Symposium on Applied Computing

March 31 - April 4, 2025

Catania , Italy

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
151
Total Downloads

Downloads (Last 12 months)75
Downloads (Last 6 weeks)9

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Sachdeva SVasava J(2024)Plugging and Playing with Variety of Data using Multi-Model Database and Polyglot Persistence2024 Third International Conference on Electrical, Electronics, Information and Communication Technologies (ICEEICT)10.1109/ICEEICT61591.2024.10718481(1-8)Online publication date: 24-Jul-2024
https://doi.org/10.1109/ICEEICT61591.2024.10718481
Van Landuyt DLevrau MReniers VJoosen W(2024)An E-Commerce Benchmark for Evaluating Performance Trade-Offs in Document StoresBig Data Analytics and Knowledge Discovery10.1007/978-3-031-68323-7_23(284-290)Online publication date: 18-Aug-2024
https://doi.org/10.1007/978-3-031-68323-7_23

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten