Abstract
Query optimization is a crucial process across data mining and big data analytics. As the size of the data in the modern applications is increasing due to various sources, types and multi-modal records across databases, there is an urge to optimize lookup and search operations. Therefore, indexes can be utilized to solve the matter of rapid data growth as they enhance the performance of the database and subsequently the cloud server where it is stored. In this paper an index on spatial data, i.e. coordinates on the plane or on the map is presented. This index is be based on the R-Tree which is suitable for spatial data and is distributed so that it can scale and adapt to massive amounts of data without losing its performance. The results of the proposed method are encouraging across all experiments and future directions of this work include experiments on skewed data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
which stores the coordinates in ascending order, beginning with the lower left corner of the rectangle and ending with the top left corner.
- 2.
Even when many users execute the same query at the same time.
- 3.
This indicates that there are no thick or sparse regions within the space covered within the datasets and ensures that the burden is dispersed evenly across the reducers.
- 4.
The distributed index contains Terabytes of records.
- 5.
Because sorting is being done by HBase, PUT functions on a lower table size are substantially quicker than on a much bigger table.
References
Babcock, B., Chaudhuri, S.: Towards a robust query optimizer: a principled and practical approach. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 119–130 (2005)
Dayan, N., Athanassoulis, M., Idreos, S.: Optimal bloom filters and adaptive merging for LSM-trees. ACM Trans. Database Syst (TODS) 43(4), 1–48 (2018)
Gu, T., Feng, K., Cong, G., Long, C., Wang, Z., Wang, S.: A Reinforcement Learning Based R-Tree for Spatial Data Indexing in Dynamic Environments. arXiv preprint arXiv:2103.04541 (2021)
Haider, C., Wang, J., Aref, W.G.: The AI+ R-tree: An Instance-optimized R-tree. arXiv preprint arXiv:2207.00550 (2022)
He, J., Chen, H.: An LSM-tree index for spatial data. Algorithms 15(4), 113 (2022)
Izenov, Y., Datta, A., Rusu, F., Shin, J.H.: COMPASS: Online sketch-based query optimization for in-memory databases. In: Proceedings of the 2021 International Conference on Management of Data, pp. 804–816 (2021)
Langendoen, K., Glasbergen, B., Daudjee, K.: NIR-Tree: A Non-Intersecting R-Tree. In: 33rd International Conference on Scientific and Statistical Database Management, pp. 157–168 (2021)
Liu, Y., Hao, T., Gong, X., Kong, D., Wang, J.: Research on hybrid index based on 3D multi-level adaptive grid and R+Tree. IEEE Access 9, 146010–146022 (2021)
Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., Kraska, T.: Bao: Making learned query optimization practical. ACM SIGMOD Rec. 51(1), 6–13 (2022)
Marcus, R., et al.: Neo: A learned query optimizer. In: Proceeding of the VLDB Endow. 12(11), 1705–1718 (2019). https://doi.org/10.14778/3342263.3342644
Markl, V., Lohman, G.M., Raman, V.: LEO: An autonomic query optimizer for DB2. IBM Syst. J. 42(1), 98–106 (2003)
O’Neil, P., Cheng, E., Gawlick, D., O’Neil, E.: The log-structured merge-tree (LSM-tree). Acta Informatica 33(4), 351–385 (1996)
Sellami, R., Defude, B.: Complex queries optimization and evaluation over relational and NoSQL data stores in cloud environments. IEEE Trans. Big Data 4(2), 217–230 (2017)
Sprenger, S., Schäfer, P., Leser, U.: BB-Tree: A main-memory index structure for multidimensional range queries. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 1566–1569 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Karras, A., Karras, C., Samoladas, D., Giotopoulos, K.C., Sioutas, S. (2022). Query Optimization in NoSQL Databases Using an Enhanced Localized R-tree Index. In: Pardede, E., Delir Haghighi, P., Khalil, I., Kotsis, G. (eds) Information Integration and Web Intelligence. iiWAS 2022. Lecture Notes in Computer Science, vol 13635. Springer, Cham. https://doi.org/10.1007/978-3-031-21047-1_33
Download citation
DOI: https://doi.org/10.1007/978-3-031-21047-1_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21046-4
Online ISBN: 978-3-031-21047-1
eBook Packages: Computer ScienceComputer Science (R0)