skip to main content
10.1145/3469830.3470897acmotherconferencesArticle/Chapter ViewAbstractPublication PagessstdConference Proceedingsconference-collections
research-article

A Novel Indexing Method for Spatial-Keyword Range Queries

Published:23 August 2021Publication History

ABSTRACT

Spatial-keyword queries are important for a wide range of applications that retrieve data based on a combination of keyword search and spatial constraints. However, efficient processing of spatial-keyword queries is not a trivial task because the combination of textual and spatial data results in a high-dimensional representation that is challenging to index effectively. To address this problem, in this paper, we propose a novel indexing scheme for efficient support of spatial-keyword range queries. At the heart of our approach lies a carefully-designed mapping of spatio-textual data to a two-dimensional (2D) space that produces compact partitions of spatio-textual data. In turn, the mapped 2D data can be indexed effectively by traditional spatial data structures, such as an R-tree. We propose bounds, theoretically proven for correctness, that lead to the design of a filter-and-refine algorithm that prunes the search space effectively. In this way, our approach for spatial-keyword range queries is readily applicable to any database system that provides spatial support. In our experimental evaluation, we demonstrate how our algorithm can be implemented over PostgreSQL and exploit its underlying spatial index provided by PostGIS, in order to process spatial-keyword range queries efficiently. Moreover, we show that our solution outperforms different competitor approaches.

References

  1. Sattam Alsubaiee, Alexander Behm, and Chen Li. 2010. Supporting location-based approximate-keyword queries. In 18th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, ACM-GIS 2010, November 3-5, 2010, San Jose, CA, USA, Proceedings. ACM, 61–70.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Walid G. Aref and Hanan Samet. 1990. Efficient Processing of Window Queries in The Pyramid Data Structure. In Proceedings of the Ninth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, April 2-4, 1990, Nashville, Tennessee, USA. ACM Press, 265–272.Google ScholarGoogle Scholar
  3. Lisi Chen, Gao Cong, Christian S. Jensen, and Dingming Wu. 2013. Spatial Keyword Query Processing: An Experimental Evaluation. PVLDB 6, 3 (2013), 217–228.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Lisi Chen, Shuo Shang, Chengcheng Yang, and Jing Li. 2020. Spatial keyword search: a survey. GeoInformatica 24, 1 (2020), 85–106.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Yen-Yu Chen, Torsten Suel, and Alexander Markowetz. 2006. Efficient query processing in geographic web search engines. In SIGMOD Conference. 277–288.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Zhida Chen, Lisi Chen, Gao Cong, and Christian S. Jensen. 2021. Location- and keyword-based querying of geo-textual data: A survey. VLDB Journal (2021).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Maria Christoforaki, Jinru He, Constantinos Dimopoulos, Alexander Markowetz, and Torsten Suel. 2011. Text vs. space: efficient geo-search query processing. In CIKM. 423–432.Google ScholarGoogle Scholar
  8. Gao Cong, Christian S. Jensen, and Dingming Wu. 2009. Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. PVLDB 2, 1 (2009), 337–348.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Ian De Felipe, Vagelis Hristidis, and Naphtali Rishe. 2008. Keyword Search on Spatial Databases. In ICDE. 656–665.Google ScholarGoogle Scholar
  10. Ramaswamy Hariharan, Bijit Hore, Chen Li, and Sharad Mehrotra. 2007. Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems. In SSDBM. 16.Google ScholarGoogle Scholar
  11. H. V. Jagadish, Beng Chin Ooi, Kian-Lee Tan, Cui Yu, and Rui Zhang. 2005. iDistance: An adaptive B+-tree based indexing method for nearest neighbor search. ACM Transactions on Database Systems 30, 2 (June 2005), 364–397.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. George Karypis and Vipin Kumar. 1997. METIS—A Software Package for Partitioning Unstructured Graphs, Partitioning Meshes and Computing Fill-Reducing Ordering of Sparse Matrices. (01 1997).Google ScholarGoogle Scholar
  13. Taesung Lee, Jin-Woo Park, Sanghoon Lee, Seung-won Hwang, Sameh Elnikety, and Yuxiong He. 2015. Processing and Optimizing Main Memory Spatial-Keyword Queries. Proc. VLDB Endow. 9, 3 (2015), 132–143.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Zhisheng Li, Ken C. K. Lee, Baihua Zheng, Wang-Chien Lee, Dik Lun Lee, and Xufa Wang. 2011. IR-Tree: An Efficient Index for Geographic Document Search. IEEE Trans. Knowl. Data Eng. 23, 4 (2011), 585–599.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Youzhong Ma, Yu Zhang, and Xiaofeng Meng. 2013. ST-HBase: A Scalable Data Management System for Massive Geo-tagged Objects. In Web-Age Information Management - 14th International Conference, WAIM 2013, Beidaihe, China, June 14-16, 2013. Proceedings(Lecture Notes in Computer Science, Vol. 7923). Springer, 155–166.Google ScholarGoogle Scholar
  16. Ahmed R. Mahmood, Ahmed M. Aly, and Walid G. Aref. 2018. FAST: Frequency-Aware Indexing for Spatio-Textual Data Streams. In 34th IEEE International Conference on Data Engineering, ICDE 2018, Paris, France, April 16-19, 2018. IEEE Computer Society, 305–316.Google ScholarGoogle Scholar
  17. Ahmed R. Mahmood and Walid G. Aref. 2019. Scalable Processing of Spatial-Keyword Queries. Morgan & Claypool Publishers.Google ScholarGoogle Scholar
  18. João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg. 2011. Efficient Processing of Top-k Spatial Keyword Queries. In SSTD. 205–222.Google ScholarGoogle Scholar
  19. Akrivi Vlachou, Christos Doulkeridis, Nikolaos Koutroumanis, Dimitrios Poulopoulos, and Kjetil Nørvåg. 2020. The SPADES Framework for Scalable Management of Spatio-textual Data. In Proceedings of 24th Pan-Hellenic Conference on Informatics (PCI’20). ACM.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Xiang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, and Wei Wang. 2015. AP-Tree: Efficiently support continuous spatial-keyword queries over stream. In 31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13-17, 2015. IEEE Computer Society, 1107–1118.Google ScholarGoogle ScholarCross RefCross Ref
  21. Cui Yu, Beng Chin Ooi, Kian-Lee Tan, and H. V. Jagadish. 2001. Indexing the Distance: An Efficient Method to KNN Processing. In Proceedings of VLDB’01.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Dongxiang Zhang, Yeow Meng Chee, Anirban Mondal, Anthony K. H. Tung, and Masaru Kitsuregawa. 2009. Keyword Search in Spatial Databases: Towards Searching by Document. In ICDE. 688–699.Google ScholarGoogle Scholar

Index Terms

  1. A Novel Indexing Method for Spatial-Keyword Range Queries
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Other conferences
              SSTD '21: Proceedings of the 17th International Symposium on Spatial and Temporal Databases
              August 2021
              173 pages
              ISBN:9781450384254
              DOI:10.1145/3469830

              Copyright © 2021 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 23 August 2021

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed limited

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format .

            View HTML Format