skip to main content
10.1145/3555776.3577645acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence

Published:07 June 2023Publication History

ABSTRACT

Multi-model databases support different NoSQL data models at once, typically a combination of relational, key-value, document, and graph models. Their expected benefits include increased versatility, reduced installation complexity, improved database performance, and smaller storage footprint. However, these benefits are insufficiently investigated, and current studies lack a comparison between multi-model databases and equivalent polyglot database architectures, i.e. setups that combine different technologies to accomplish the same functionality.

To fill this gap, we conduct a series of benchmarks for the purpose of investigating the efficiency of two multi-model NoSQL databases (ArangoDB and OrientDB) in comparison to an equivalent polyglot baseline. These experiments have been performed with UniBench, a framework specifically designed to evaluate the performance of multi-model databases in terms of query execution time, which was also further extended for the purposes of this study.

The study results indicate that the choice between a multi-model NoSQL database and a combination of NoSQL databases depends on the types of data and queries, the underlying data models and the involved databases. These outcomes emphasize the necessity for database architects to prototype and evaluate alternative storage architectures in function of specific application requirements before committing to a technology or paradigm.

References

  1. Jemal Abawajy. 2015. Comprehensive analysis of big data variety landscape. International journal of parallel, emergent and distributed systems 30, 1 (2015), 5--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Veronika Abramova, Jorge Bernardino, and Pedro Furtado. 2014. Experimental Evaluation of NoSQL Databases. International journal of database management systems 6, 3 (2014), 1--16.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. Buble, L. Bulej, and P. Tuma. 2003. CORBA benchmarking: a course with hidden obstacles. In Proceedings International Parallel and Distributed Processing Symposium. 6 pp.-. Google ScholarGoogle ScholarCross RefCross Ref
  4. Luca Cabibbo. 2013. ONDM: an Object-NoSQL Datastore Mapper. Faculty of Engineering, Roma Tre University. Retrieved June 15th (2013).Google ScholarGoogle Scholar
  5. Cody Coleman, Deepak Narayanan, Daniel Kang, Tian Zhao, Jian Zhang, Luigi Nardi, Peter Bailis, Kunle Olukotun, Chris Ré, and Matei Zaharia. 2017. Dawn-bench: An end-to-end deep learning benchmark and competition. Training 100, 101 (2017), 102.Google ScholarGoogle Scholar
  6. Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, and Russell Sears. 2010. Benchmarking cloud serving systems with YCSB. In Proceedings of the 1st ACM symposium on Cloud computing. 143--154.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. DB-Engines. 2021. Ranking. https://db-engines.com/en/ranking.Google ScholarGoogle Scholar
  8. Claudio de Lima and Ronaldo dos Santos Mello. 2015. A workload-driven logical design approach for NoSQL document databases. In Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services. 1--10.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Priya Dialani. 2022. The Future of Data Revolution will be Unstructured Data. https://www.analyticsinsight.net/the-future-of-data-revolution-will-be-unstructured-data/.Google ScholarGoogle Scholar
  10. Ahmad Ghazal, Todor Ivanov, Pekka Kostamaa, Alain Crolotte, Ryan Voong, Mohammed Al-Kateb, Waleed Ghazal, and Roberto V Zicari. 2017. Bigbench V2: the new and improved bigbench. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE). IEEE, 1225--1236.Google ScholarGoogle ScholarCross RefCross Ref
  11. Zhiqiang Gong, Ping Zhong, and Weidong Hu. 2019. Diversity in machine learning. IEEE Access 7 (2019), 64323--64350.Google ScholarGoogle ScholarCross RefCross Ref
  12. Katarina Grolinger, Wilson A Higashino, Abhinav Tiwari, and Miriam AM Capretz. 2013. Data management in cloud environments: NoSQL and NewSQL data stores. Journal of Cloud Computing: advances, systems and applications 2, 1 (2013), 1--24.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Rohmat Gunawan, Alam Rahmatulloh, and Irfan Darmawan. 2019. Performance Evaluation of Query Response Time in The Document Stored NoSQL Database. In 2019 16th International Conference on Quality in Research (QIR): International Symposium on Electrical and Computer Engineering. IEEE, 1--6.Google ScholarGoogle Scholar
  14. Cláudio Lima and Ronaldo Santos Mello. 2016. On proposing and evaluating a NoSQL document database logical approach. International Journal of Web Information Systems (2016).Google ScholarGoogle ScholarCross RefCross Ref
  15. David Lion, Adrian Chiu, Hailong Sun, Xin Zhuang, Nikola Grcevski, and Ding Yuan. 2016. Don't Get Caught in the Cold, Warm-up Your JVM: Understand and Eliminate JVM Warm-up Overhead in Data-Parallel Systems. In 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). 383--400.Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Matteo Lissandrini, Martin Brugnara, and Yannis Velegrakis. 2018. Beyond macrobenchmarks: microbenchmark-based graph database evaluation. Proceedings of the VLDB Endowment 12, 4 (2018), 390--403.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Jiaheng Lu and Irena Holubová. 2019. Multi-model databases: a new journey to handle the variety of data. ACM Computing Surveys (CSUR) 52, 3 (2019), 1--38.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Martin Macak, Matus Stovcik, Barbora Buhnova, and Michal Merjavy. 2020. How well a multi-model database performs against its single-model variants: Benchmarking OrientDB with Neo4j and MongoDB. In 2020 15th Conference on Computer Science and Information Systems (FedCSIS). IEEE, 463--470.Google ScholarGoogle ScholarCross RefCross Ref
  19. Peter Mattson, Vijay Janapa Reddi, Christine Cheng, Cody Coleman, Greg Diamos, David Kanter, Paulius Micikevicius, David Patterson, Guenther Schmuelling, Hanlin Tang, et al. 2020. MLPerf: An industry standard benchmark suite for machine learning performance. IEEE Micro 40, 2 (2020), 8--16.Google ScholarGoogle ScholarCross RefCross Ref
  20. Jean Moschetta and Giuliano Casale. 2012. OFBench: An Enterprise Application Benchmark for Cloud Resource Management Studies. In 2012 14th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing. 393--399. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Fábio Roberto Oliveira and Luis del Val Cura. 2016. Performance evaluation of NoSQL multi-model data stores in polyglot persistence applications. In Proceedings of the 20th International Database Engineering & Applications Symposium. 230--235.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Omoruyi Osemwegie, Kennedy Okokpujie, Nsikan Nkordeh, Charles Ndujiuba, John Samuel, and Uzairue Stanley. 2018. Performance Benchmarking of Key-Value Store NoSQL Databases. International Journal of Electrical and Computer Engineering 8, 6 (2018), 5333--5341.Google ScholarGoogle Scholar
  23. Diogo Augusto Pereira, Wagner Ourique de Morais, and Edison Pignaton de Freitas. 2018. NoSQL real-time database performance comparison. International Journal of Parallel, Emergent and Distributed Systems 33, 2 (2018), 144--156.Google ScholarGoogle ScholarCross RefCross Ref
  24. Ewa Płuciennik and Kamil Zgorzałek. 2017. The multi-model databases-a review. In International Conference: Beyond Databases, Architectures and Structures. Springer, 141--152.Google ScholarGoogle ScholarCross RefCross Ref
  25. Tilmann Rabl, Christoph Brücke, Philipp Härtling, Stella Stars, Rodrigo Escobar Palacios, Hamesh Patel, Satyam Srivastava, Christoph Boden, Jens Meiners, and Sebastian Schelter. 2019. ADABench-Towards an industry standard benchmark for advanced analytics. In Technology Conference on Performance Evaluation and Benchmarking. Springer, 47--63.Google ScholarGoogle Scholar
  26. Dharavath Ramesh, Ekaansh Khosla, and Shankar Nayak Bhukya. 2016. Inclusion of e-commerce workflow with NoSQL DBMS: MongoDB document store. In 2016 IEEE international conference on computational intelligence and computing research (ICCIC). IEEE, 1--5.Google ScholarGoogle ScholarCross RefCross Ref
  27. Vincent Reniers, Ansar Rafique, Dimitri Van Landuyt, and Wouter Joosen. 2017. Object-NoSQL Database Mappers: a benchmark study on the performance overhead. Journal of Internet Services and Applications 8, 1 (2017), 1--16.Google ScholarGoogle ScholarCross RefCross Ref
  28. Noa Roy-Hubara, Peretz Shoval, and Arnon Sturm. 2022. Selecting databases for Polyglot Persistence applications. Data & Knowledge Engineering 137 (2022), 101950.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Pramod J Sadalage and Martin Fowler. 2013. NoSQL distilled: a brief guide to the emerging world of polyglot persistence. Pearson Education.Google ScholarGoogle Scholar
  30. Michael Stonebraker. 2010. Errors in database systems, eventual consistency, and the cap theorem. Communications of the ACM, BLOG@ ACM (2010).Google ScholarGoogle Scholar
  31. Uta Störl, Thomas Hauf, Meike Klettke, and Stefanie Scherzinger. 2015. Schemaless NoSQL data stores-Object-NoSQL Mappers to the rescue? Datenbanksysteme für Business, Technologie und Web (BTW 2015) (2015).Google ScholarGoogle Scholar
  32. Enqing Tang and Yushun Fan. 2016. Performance Comparison between Five NoSQL Databases. In 2016 7th International Conference on Cloud Computing and Big Data (CCBD). IEEE, 105--109.Google ScholarGoogle ScholarCross RefCross Ref
  33. Dimitri Van Landuyt, Julien Benaouda, Vincent Reniers, Ansar Rafique, and Wouter Joosen. 2023. A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence: study results (SF1, SF10 and SF30). people.cs.kuleuven.be/~dimitri.vanlanduyt/dvanlanduyt_sac-dbdm_2023_results.zip.Google ScholarGoogle Scholar
  34. Luís HN Villaça, Leonardo G Azevedo, and Fernanda Baião. 2018. Query strategies on polyglot persistence in microservices. In Proceedings of the 33rd Annual ACM Symposium on Applied Computing. 1725--1732.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Lei Wang, Jianfeng Zhan, Chunjie Luo, Yuqing Zhu, Qiang Yang, Yongqiang He, Wanling Gao, Zhen Jia, Yingjie Shi, Shujie Zhang, et al. 2014. Bigdatabench: A big data benchmark suite from internet services. In 2014 IEEE 20th international symposium on high performance computer architecture (HPCA). IEEE, 488--499.Google ScholarGoogle ScholarCross RefCross Ref
  36. Chao Zhang. 2022. UniBench - Towards Benchmarking the Multi-Model DBMS. https://github.com/HY-UDBMS/UniBench.Google ScholarGoogle Scholar
  37. Chao Zhang and Jiaheng Lu. 2021. Holistic evaluation in multi-model databases benchmarking. Distributed and Parallel Databases 39, 1 (2021), 1--33.Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Chao Zhang, Jiaheng Lu, Pengfei Xu, and Yuxing Chen. 2018. UniBench: A benchmark for multi-model database management systems. In Technology Conference on Performance Evaluation and Benchmarking. Springer, 7--23.Google ScholarGoogle Scholar

Index Terms

  1. A Comparative Performance Evaluation of Multi-Model NoSQL Databases and Polyglot Persistence

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SAC '23: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing
        March 2023
        1932 pages
        ISBN:9781450395175
        DOI:10.1145/3555776

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 7 June 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate1,650of6,669submissions,25%
      • Article Metrics

        • Downloads (Last 12 months)96
        • Downloads (Last 6 weeks)13

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader