ABSTRACT
The support for multi-model data has become a standard for most of the existing DBMSs. The tools for database design are general enough to cover multiple models, too. However, the step from a conceptual (e.g., ER or UML) schema to a logical multi-model schema of a DBMS (or their combination) is not straightforward.
In this paper, we show how category theory can be used for representation of multi-model data and schema and how the mutual mapping between the categorical representation and logical models of particular DBMSs can be designed. For this purpose we define the notion of an access path which enables to specify the mapping for all the currently popular models. To demonstrate advantages of the proposal we introduce framework MM-cat which applies the proposed approaches on MongoDB and PostgreSQL.
- 2021. DB-Engines Ranking. (2021). https://db-engines.com/en/rankingGoogle Scholar
- Paolo Atzeni, Francesca Bugiotti, Luca Cabibbo, and Riccardo Torlone. 2020. Data Modeling in the NoSQL World. Computer Standards and Interfaces 67 (2020), 103--149.Google ScholarDigital Library
- Michael Barr and Charles Wells. 1990. Category Theory for Computing Science. Vol. 49. Prentice Hall New York.Google Scholar
- Francesco Basciani, Juri Di Rocco, Davide Di Ruscio, Alfonso Pierantonio, and Ludovico Iovino. 2020. TyphonML: A Modeling Environment to Develop Hybrid Polystores. In MODELS '20 (Virtual Event, Canada). ACM, Article 2, 5 pages.Google ScholarDigital Library
- Carlos Javier Fernández Candel, Diego Sevilla Ruiz, and Jesús García-Molina. 2021. A Unified Metamodel for NoSQL and Relational Databases. CoRR abs/2105.06494 (2021). arXiv:2105.06494Google Scholar
- Boyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, and José Orlando Pereira. 2016. Benchmarking Polystores: The CloudMdSQL Experience. In BigData '16. 2574--2579.Google Scholar
- Matthias Kolonko and Sabine Müllenbach. 2020. Polyglot Persistence in Conceptual Modeling for Information Analysis. In ACIT '20. 590--594.Google Scholar
- Pavel Koupil, Martin Svoboda, and Irena Holubova. 2021. MM-cat: A Tool for Modeling and Transformation of Multi-Model Data using Category Theory.. In MODELS '21. IEEE.Google ScholarCross Ref
- Eric Leclercq and Marinette Savonnet. 2019. TDM: A Tensor Data Model for Logical Data Independence in Polystore Systems. In VLDB Workshops '18. Springer, 39--56.Google Scholar
- Lippe, E. and Ter Hofstede, A. H. M. 1996. A Category Theory Approach to Conceptual Data Modeling. RAIRO-Theor. Inf. Appl. 30, 1 (1996), 31--79.Google ScholarCross Ref
- Jiaheng Lu and Irena Holubová. 2019. Multi-Model Databases: A New Journey to Handle the Variety of Data. ACM Comput. Surv. 52, 3, Article 55 (2019).Google Scholar
- David I Spivak and Ryan Wisnesky. 2015. Relational Foundations for Functorial Data Migration. In DBPL '15. ACM, 21--28.Google Scholar
- Martin Svoboda, Pavel Contos, and Irena Holubova. 2021. Categorical Modeling of Multi-Model Data: One Model to Rule Them All. In MEDI '21. Springer, 1--8.Google ScholarDigital Library
- Laurent Thiry, Heng Zhao, and Michel Hassenforder. 2018. Categories for (Big) Data Models and Optimization. Journal of Big Data 5, 1 (2018), 1--20.Google ScholarCross Ref
- Chris Tuijn and Marc Gyssens. 1996. CGOOD, a Categorical Graph-oriented Object Data Model. Theoretical Computer Science 160, 1--2 (1996), 217--239.Google ScholarDigital Library
Index Terms
- Unifying categorical representation of multi-model data
Recommendations
Categorical Management of Multi-Model Data
IDEAS '21: Proceedings of the 25th International Database Engineering & Applications SymposiumIn this vision paper, we introduce an idea of a framework that would enable us to model, represent, and manage multi-model data in a unified and abstract way. Its core idea exploits constructs provided by category theory, which is sufficiently general ...
Multi-model Databases: A New Journey to Handle the Variety of Data
The variety of data is one of the most challenging issues for the research and practice in data management systems. The data are naturally organized in different formats and models, including structured data, semi-structured data, and unstructured data. ...
Performance Evaluation of NoSQL Multi-Model Data Stores in Polyglot Persistence Applications
IDEAS '16: Proceedings of the 20th International Database Engineering & Applications SymposiumNoSQL data store systems have recently been introduced as alternatives to traditional relational database management systems. These data stores systems implement simpler and scalable data models that increase the performance and efficiency of a new kind ...
Comments