Abstract
In recent times, data science has seen a rapid increase in the need for individuals and teams to analyze and manipulate data at scale for various scientific and commercial purposes. Groups often collaboratively analyze datasets, thereby leading to a proliferation of dataset versions at each stage of iterative exploration and analysis. Thus, an efficient collaborative system compatible with handling various versions is needed rather than the current most often used ad-hoc versioning mechanism. In a collaborative database, all the collaborators working together on a project need to interact together to perform extensive curation activities. In a typical scenario, when an update is made by one of the collaborators, it should become visible to the whole team for possible comments and modifications, which in turn aid the data custodian in making a better decision. Relational databases provide efficient data management and querying. However, it lacks various features to support efficient collaboration. In these databases, the approval and authorization of updates are based completely on the identity of the user, e.g., via SQL GRANT and REVOKE commands. In this paper, we present a framework well suited for collaboration and implemented on top of relational databases that will enable the team to manage as well as query the dataset versions efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mershad, K., Malluhi, Q.M., Ouzzani, M., Tang, M., Gribskov, M., Aref, W.G.: AUDIT: approving and tracking updates with dependencies in collaborative databases. Distrib. Parallel Databases 36(1), 81–119 (2017). https://doi.org/10.1007/s10619-017-7208-y
Mershad, K., et al.: COACT: a query interface language for collaborative databases. Distrib. Parallel Databases 36(1), 121–151 (2017). https://doi.org/10.1007/s10619-017-7213-1
Huang, S., Xu, L., Liu, J., Elmore, A.J., Parameswaran, A.: Orpheus DB: bolt-on versioning for relational databases. Proc. VLDB Endow. 10(10), 1130–1141 (2017)
Xu, L., Huang, S., Hui, S., Elmore, A.J., Parameswaran, A.: ORPHEUSDB: a lightweight approach to relational dataset versioning. In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 1655–1658. ACM, May 2017
Navathe, S.B., Patil, U., Guan, W.: Genomic and proteomic databases: foundations, current status and future applications. J. Comput. Sci. Eng. 1(1), 1–30 (2007)
Goldberg, D., Nichols, D., Oki, B.M., Terry, D.: Using collaborative filtering to weave an information tapestry. Commun. ACM 35(12), 61–70 (1992)
Howe, B., Halperin, D., Ribalet, F., Chitnis, S., Armbrust, E.V.: Collaborative science workflows in SQL. Comput. Sci. Eng. 15(3), 22–31 (2013)
Halperin, D., Ribalet, F., Weitz, K., Saito, M.A., Howe, B., Armbrust, E.: Real-time collaborative analysis with (almost) pure SQL: a case study in biogeochemical oceanography. In: Proceedings of the 25th International Conference on Scientific and Statistical Database Management, p. 28. ACM, July 2013
Eirinaki, M., Abraham, S., Polyzotis, N., Shaikh, N.: QueRIE: collaborative database exploration. IEEE Trans. Knowl. Data Eng. 26(7), 1778–1790 (2014)
Harrington, J.L.: Relational Database Design and Implementation. Morgan Kaufmann (2016)
Coronel, C., Morris, S.: Database Systems: Design, Implementation, & Management. Cengage Learning (2016)
Nascimento, M.A., Sellis, T., Cheng, R.: Special issue on spatial and temporal database management. GeoInformatica 19(2), 297–298 (2015). https://doi.org/10.1007/s10707-015-0224-z
Radhakrishna, V., Kumar, P.V., Janaki, V.: An efficient approach to find similar temporal association patterns performing only single database scan. Revista Tecnica De La Facultad De Ingenieria Universidad Del Zulia 39(1), 241–255 (2016)
Collaborate your way to better SQL queries and data visualizations: https://blog.modeanalytics.com/collaborate-your-way-to-better-sql-queries/
SqlDBM’s Latest And Greatest: Team Project Collaboration: http://blog.sqldbm.com/team-collaboration/
Painless Data Versioning for Collaborative Data Science: https://medium.com/data-people/painless-data-versioning-for-collaborative-data-science-90cf3a2e279d
Bioinformatics Databases: https://www.ebi.ac.uk/training/online/course/bioinformatics-terrified-2018/what-bioinformatics
MySQL Database - very good thesis: http://www.engpaper.com/mysql-database-very-good-thesis.html
Relational Database Management System: https://searchdatamanagement.techtarget.com/definition/RDBMS-relational-database-management-system
Nambiar, U.B., Deshpande, P.M., Halasipuram, R.S., Iyer, B.R.: U.S. Patent No. 9,262,491. U.S. Patent and Trademark Office, Washington, DC (2016)
Alromema, N.A., Rahim, M.S.M., Albidewi, I.: Temporal database models validation and verification using mapping methodology. VFAST Trans. Softw. Eng. 11(2), 15–26 (2016)
Alromema, N., Rahim, M.S.M., Albidewi, I.: An Efficient approach for modeling temporal database with interval-based timestamping in conventional database management systems. J. Comput. Theor. Nanosci. 14(9), 4569–4575 (2017)
Kalanat, N., Kangavari, M.R.: Data mining methods for rule designing and rule triggering in active database systems. Int. J. Datab. Theory Appl. 8(1), 39–44 (2015)
Berndtsson, M., Mellin, J.: Active database knowledge model. In: Liu, L., Özsu, M.T. (eds.) Encyclopedia of Database Systems. Springer, Boston (2009). https://doi.org/10.1007/978-0-387-39940-9_508
Veldhuizen, T.L.: U.S. Patent No. 9,424,304. Patent and Trademark Office, Washington, DC (2016)
Meagher, M.: U.S. Patent No. 8,822,848. U.S. Patent and Trademark Office, Washington, DC (2014)
Vance, J. R., et al.: U.S. Patent No. 9,277,833. Patent and Trademark Office. Washington, DC (2016)
Collins Jr., D.A., Amada, J.: U.S. Patent No. 8,925,811. U.S. Patent and Trademark Office. Washington, DC (2015)
Active Databases: http://web.cs.ucla.edu/classes/winter04/cs240A/notes/node1.html
Apache HBase Reference Guide: https://hbase.apache.org/book.html
The Architecture of Apache HBase: https://intellipaat.com/blog/what-is-apache-hbase/
Gómez, A., Benelallam, A., Tisi, M.: Decentralized model persistence for distributed computing. In: 3rd BigMDE Workshop, July 2015
Ramesh, D., Kumar, C.: An incremental protocol approach for secure collaboration between Byzantine processes in heterogeneous distributed processing systems. Glob. J. Technol. 3 (2013)
Ramesh, D., Khosla, E., Bhukya, S.N.: Inclusion of e-commerce workflow with NoSQL DBMS: MongoDB document store. In: 2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), pp. 1–5. IEEE, December 2016
Acknowledgments
This research work is supported by the Indian Institute of Technology (Indian School of Mines), Dhanbad, Govt. of India. The authors wish to express their gratitude and heartiest thanks to the Department of Computer Science & Engineering, Indian Institute of Technology (ISM), Dhanbad, India, for their research support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ramesh, D., Trivedi, M.C. (2022). MySQL Collaboration by Approving and Tracking Updates with Dependencies: A Versioning Approach. In: Gervasi, O., Murgante, B., Misra, S., Rocha, A.M.A.C., Garau, C. (eds) Computational Science and Its Applications – ICCSA 2022 Workshops. ICCSA 2022. Lecture Notes in Computer Science, vol 13381. Springer, Cham. https://doi.org/10.1007/978-3-031-10548-7_28
Download citation
DOI: https://doi.org/10.1007/978-3-031-10548-7_28
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-10547-0
Online ISBN: 978-3-031-10548-7
eBook Packages: Computer ScienceComputer Science (R0)