Abstract
The division operator from the relational algebra allows simple and intuitive representation of queries with the concept of “for all”, and thus it is required in many real applications. However, the relational division is unable to support the needs of modern applications that manipulate complex data, such as images, audio, long texts, genetic sequences, etc. These data are better compared by similarity, whereas relational algebra always compares data by equality or inequality. Recent works focus on extending relational operators to support similarity comparisons and their inclusion in relational database management systems. This work incorporates and studies the behavior of several similarity-aware division algorithms in a commercial RDBMS. We compared the two state-of-art algorithms against several SQL statements and found when to use each one of them in order to improve query time execution. We then propose an extension of the SQL syntax and the query analyzer to support this new operator.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Codd, E.F.: The Relational Model for Database Management, Version 2. Addison-Wesley, Boston (1990)
Vasconcelos, G.Q., et al.: Tender-sims - similarity retrieval system for public tenders. In: ICEIS 2018, pp. 143–150 (2018)
Gonzaga, A.S., Cordeiro, R.L.F.: A new division operator to handle complex objects in very large relational datasets. In: EDBT 2017, pp. 474–477 (2017)
Chávez, E., Navarro, G., Baeza-Yates, R., Marroquín, J.L.: Searching in metric spaces. ACM Comput. Surv. 33(3), 273–321 (2001)
Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search: The Metric Space Approach, 1st edn. Springer, Heidelberg (2010). https://doi.org/10.1007/0-387-29151-2
Marri, W.J.A., Malluhi, Q.M., Ouzzani, M., Tang, M., Aref, W.G.: The similarity-aware relational database set operators. Inf. Syst. 59, 79–93 (2016)
Pola, I.R.V., Cordeiro, R.L.F., Traina Jr., C., Traina, A.J.M.: Similarity sets: a new concept of sets to seamlessly handle similarity in database management systems. Inf. Syst. 52, 130–148 (2015)
Silva, Y.N., Aref, W.G., Ali, M.H.: Similarity group-by. In: Proceedings of the 25th International Conference on Data Engineering, ICDE 2009, 29 March 2009–2 April 2009, Shanghai, China, pp. 904–915 (2009)
Matos, V.M., Grasser, R.: Assessing performance of the relational division operator. In: Database Management. Auerbach Publications, February 2001
Gonzaga, A.S., Cordeiro, R.L.F.: Fast and scalable relational division on database systems. In: SBBD 2016, pp. 169–174 (2016)
Draken, E., Gao, S., Alhajj, R.: Making query coding in SQL easier by implementing the SQL divide keyword: an experimental query rewriter in Java. In: Advanced Database Query Systems: Techniques, Applications and Technologies, 1st edn. IGI Global (2001)
Guliato, D., Melo, E.V., Rangayyan, R.M., Soares, R.C.: POSTGRESQL-IE: an image-handling extension for PostgreSQL. J. Digit. Imaging 22(2), 149–165 (2009)
Oliveira, P.H., et al.: On the support of a similarity-enabled relational database management system in civilian crisis situations. In: ICEIS 2016, pp. 119–126 (2016)
Silva, Y.N., Aly, A.M., Aref, W.G., Larson, P.: SimDB: a similarity-aware database system. In: SIGMOD 2010, pp. 1243–1246 (2010)
Barioni, M.C.N., Razente, H., Traina, A., Traina Jr., C.: SIREN: a similarity retrieval engine for complex data. In: VLDB 2006, pp. 1155–1158 (2006)
Bedo, M.V.N., Traina, A.J.M., Traina Jr., C.: Seamless integration of distance functions and feature vectors for similarity-queries processing. JIDM 5(3), 308–320 (2014)
dos Kaster, D.S., Bugatti, P.H., Traina, A.J.M., Traina Jr., C.: FMI-SiR: a flexible and efficient module for similarity searching on Oracle database. JIDM 1(2), 229–244 (2010)
Acknowledgements
We would like to thank CNPq, CAPES project 10357907/M and FAPESP project 2016/170780 for financial support.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Vasconcelos, G.Q., Kaster, D.S., Cordeiro, R.L.F. (2018). On the Support of the Similarity-Aware Division Operator in a Commercial RDBMS. In: Benczúr, A., Thalheim, B., Horváth, T. (eds) Advances in Databases and Information Systems. ADBIS 2018. Lecture Notes in Computer Science(), vol 11019. Springer, Cham. https://doi.org/10.1007/978-3-319-98398-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-98398-1_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98397-4
Online ISBN: 978-3-319-98398-1
eBook Packages: Computer ScienceComputer Science (R0)