Skip to main content

Query Optimization in NoSQL Databases Using an Enhanced Localized R-tree Index

  • Conference paper
  • First Online:
Information Integration and Web Intelligence (iiWAS 2022)

Abstract

Query optimization is a crucial process across data mining and big data analytics. As the size of the data in the modern applications is increasing due to various sources, types and multi-modal records across databases, there is an urge to optimize lookup and search operations. Therefore, indexes can be utilized to solve the matter of rapid data growth as they enhance the performance of the database and subsequently the cloud server where it is stored. In this paper an index on spatial data, i.e. coordinates on the plane or on the map is presented. This index is be based on the R-Tree which is suitable for spatial data and is distributed so that it can scale and adapt to massive amounts of data without losing its performance. The results of the proposed method are encouraging across all experiments and future directions of this work include experiments on skewed data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    which stores the coordinates in ascending order, beginning with the lower left corner of the rectangle and ending with the top left corner.

  2. 2.

    Even when many users execute the same query at the same time.

  3. 3.

    This indicates that there are no thick or sparse regions within the space covered within the datasets and ensures that the burden is dispersed evenly across the reducers.

  4. 4.

    The distributed index contains Terabytes of records.

  5. 5.

    Because sorting is being done by HBase, PUT functions on a lower table size are substantially quicker than on a much bigger table.

References

  1. Babcock, B., Chaudhuri, S.: Towards a robust query optimizer: a principled and practical approach. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 119–130 (2005)

    Google Scholar 

  2. Dayan, N., Athanassoulis, M., Idreos, S.: Optimal bloom filters and adaptive merging for LSM-trees. ACM Trans. Database Syst (TODS) 43(4), 1–48 (2018)

    Article  MathSciNet  MATH  Google Scholar 

  3. Gu, T., Feng, K., Cong, G., Long, C., Wang, Z., Wang, S.: A Reinforcement Learning Based R-Tree for Spatial Data Indexing in Dynamic Environments. arXiv preprint arXiv:2103.04541 (2021)

  4. Haider, C., Wang, J., Aref, W.G.: The AI+ R-tree: An Instance-optimized R-tree. arXiv preprint arXiv:2207.00550 (2022)

  5. He, J., Chen, H.: An LSM-tree index for spatial data. Algorithms 15(4), 113 (2022)

    Article  Google Scholar 

  6. Izenov, Y., Datta, A., Rusu, F., Shin, J.H.: COMPASS: Online sketch-based query optimization for in-memory databases. In: Proceedings of the 2021 International Conference on Management of Data, pp. 804–816 (2021)

    Google Scholar 

  7. Langendoen, K., Glasbergen, B., Daudjee, K.: NIR-Tree: A Non-Intersecting R-Tree. In: 33rd International Conference on Scientific and Statistical Database Management, pp. 157–168 (2021)

    Google Scholar 

  8. Liu, Y., Hao, T., Gong, X., Kong, D., Wang, J.: Research on hybrid index based on 3D multi-level adaptive grid and R+Tree. IEEE Access 9, 146010–146022 (2021)

    Article  Google Scholar 

  9. Marcus, R., Negi, P., Mao, H., Tatbul, N., Alizadeh, M., Kraska, T.: Bao: Making learned query optimization practical. ACM SIGMOD Rec. 51(1), 6–13 (2022)

    Article  Google Scholar 

  10. Marcus, R., et al.: Neo: A learned query optimizer. In: Proceeding of the VLDB Endow. 12(11), 1705–1718 (2019). https://doi.org/10.14778/3342263.3342644

  11. Markl, V., Lohman, G.M., Raman, V.: LEO: An autonomic query optimizer for DB2. IBM Syst. J. 42(1), 98–106 (2003)

    Article  Google Scholar 

  12. O’Neil, P., Cheng, E., Gawlick, D., O’Neil, E.: The log-structured merge-tree (LSM-tree). Acta Informatica 33(4), 351–385 (1996)

    Article  MATH  Google Scholar 

  13. Sellami, R., Defude, B.: Complex queries optimization and evaluation over relational and NoSQL data stores in cloud environments. IEEE Trans. Big Data 4(2), 217–230 (2017)

    Article  Google Scholar 

  14. Sprenger, S., Schäfer, P., Leser, U.: BB-Tree: A main-memory index structure for multidimensional range queries. In: 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 1566–1569 (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aristeidis Karras .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Karras, A., Karras, C., Samoladas, D., Giotopoulos, K.C., Sioutas, S. (2022). Query Optimization in NoSQL Databases Using an Enhanced Localized R-tree Index. In: Pardede, E., Delir Haghighi, P., Khalil, I., Kotsis, G. (eds) Information Integration and Web Intelligence. iiWAS 2022. Lecture Notes in Computer Science, vol 13635. Springer, Cham. https://doi.org/10.1007/978-3-031-21047-1_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-21047-1_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-21046-4

  • Online ISBN: 978-3-031-21047-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics