ABSTRACT
A corpus of recent work has revealed that the learned index can improve query performance while reducing the storage overhead. It potentially offers an opportunity to address the spatial query processing challenges caused by the surge in location-based services. Although several learned indexes have been proposed to process spatial data, the main idea behind these approaches is to utilize the existing one-dimensional learned models, which requires either converting the spatial data into one-dimensional data or applying the learned model on individual dimensions separately. As a result, these approaches cannot fully utilize or take advantage of the information regarding the spatial distribution of the original spatial data. To this end, in this paper, we exploit it by using the spatial (multi-dimensional) interpolation function as the learned model, which can be directly employed on the spatial data. Specifically, we design an efficient SPatial inteRpolation functIon based Grid index (SPRIG) to process the range and kNN queries. Detailed experiments are conducted on real-world datasets. The results indicate that, compared to the traditional spatial indexes, our proposed learned index can significantly improve the index building and query processing performance with less storage overhead. Moreover, in the best case, our index achieves up to an order of magnitude better performance than ZM-index in range queries and is about 2.7 × , 3 × , and 9 × faster than the multi-dimensional learned index Flood in terms of index building, range queries, and kNN queries, respectively.
- Abdullah Al-Mamun, Hao Wu, and Walid G Aref. 2020. A Tutorial on Learned Multi-dimensional Indexes. In SIGSPATIAL. 1–4.Google Scholar
- Francesc Aràndiga. 2016. A nonlinear algorithm for monotone piecewise bicubic interpolation. Appl. Math. Comput. 272(2016), 100–113.Google ScholarDigital Library
- Paolo Ciaccia, Marco Patella, and Pavel Zezula. 1997. M-tree: An Efficient Access Method for Similarity Search in Metric Spaces. In VLDB. Citeseer, 426–435.Google Scholar
- Angjela Davitkova, Evica Milchevski, and Sebastian Michel. 2020. The ML-Index: A Multidimensional, Learned Index for Point, Range, and Nearest-Neighbor Queries.. In EDBT. 407–410.Google Scholar
- Francesco Dell’Accio and Filomena Di Tommaso. 2016. Scattered data interpolation by Shepard’s like methods: classical results and recent advances. Dolomites Research Notes on Approximation 9, Special_Issue (2016).Google Scholar
- Jialin Ding, Vikram Nathan, Mohammad Alizadeh, and Tim Kraska. 2020. Tsunami: a learned multi-dimensional index for correlated data and skewed workloads. Proceedings of the VLDB Endowment 14, 2 (2020), 74–86.Google ScholarDigital Library
- Paolo Ferragina and Giorgio Vinciguerra. 2020. The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds. VLDB 13, 8 (2020), 1162–1175.Google ScholarDigital Library
- Alex Galakatos, Michael Markovitch, Carsten Binnig, Rodrigo Fonseca, and Tim Kraska. 2019. Fiting-tree: A data-aware index structure. In SIGMOD. 1189–1206.Google ScholarDigital Library
- Tim Kraska, Alex Beutel, Ed H Chi, Jeffrey Dean, and Neoklis Polyzotis. 2018. The case for learned index structures. In SIGMOD. 489–504.Google Scholar
- Jin Li and Andrew D Heap. 2008. A review of spatial interpolation methods for environmental scientists. (2008).Google Scholar
- Pengfei Li, Hua Lu, Qian Zheng, Long Yang, and Gang Pan. 2020. LISA: A Learned Index Structure for Spatial Data. In SIGMOD. 2119–2133.Google Scholar
- Lubos Mitas and Helena Mitasova. 1999. Spatial interpolation. Geographical information systems: principles, techniques, management and applications (1999).Google Scholar
- Donald E Myers. 1994. Spatial interpolation: an overview. Geoderma (1994).Google Scholar
- Vikram Nathan, Jialin Ding, Mohammad Alizadeh, and Tim Kraska. 2020. Learning Multi-dimensional Indexes. In SIGMOD. 985–1000.Google Scholar
- Jianzhong Qi, Guanli Liu, Christian S Jensen, and Lars Kulik. 2020. Effectively learning spatial indices. VLDB 13, 12 (2020), 2341–2354.Google ScholarDigital Library
- Robert Schaback. 1995. Error estimates and condition numbers for radial basis function interpolation. Advances in Computational Mathematics(1995), 251–264.Google Scholar
- Chuzhe Tang, Youyun Wang, Zhiyuan Dong, Gansen Hu, Zhaoguo Wang, Minjie Wang, and Haibo Chen. 2020. XIndex: a scalable learned index for multicore data storage. In SIGPLAN. 308–320.Google Scholar
- Saul A Teukolsky, Brian P Flannery, WH Press, and WT Vetterling. 1992. Numerical recipes in C. SMR 693, 1 (1992), 59–70.Google Scholar
- [19] Twitter Developer 2018. https://developer.twitter.com/en.Google Scholar
- Haixin Wang, Xiaoyi Fu, Jianliang Xu, and Hua Lu. 2019. Learned index for spatial queries. In 2019 20th IEEE MDM. IEEE, 569–574.Google Scholar
Index Terms
- SPRIG: A Learned Spatial Index for Range and kNN Queries
Recommendations
Spatial Queries Based on Learned Index
Spatial Data and IntelligenceAbstractWith the popularity of location-based services, the scale of spatial data is increasing. Spatial indexes play an important role in spatial databases, and their performance determines the efficiency of data access and query processing. Most of the ...
A learned spatial textual index for efficient keyword queries
AbstractSpatial textual indexing techniques allow one to efficiently access and process large volume of geospatial data, and recent research efforts have demonstrated that learned indexes can lead to better performance in comparison to conventional ...
The RLR-Tree: A Reinforcement Learning Based R-Tree for Spatial Data
PACMMODLearned indexes have been proposed to replace classic index structures like B-Tree with machine learning (ML) models. They require to replace both the indexes and query processing algorithms currently deployed by the databases, and such a radical ...
Comments