skip to main content
10.1145/3539618.3591983acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper
Open access

Exploiting Cluster-Skipping Inverted Index for Semantic Place Retrieval

Published: 18 July 2023 Publication History

Abstract

Semantic place retrieval aims to find the top-k place entities, which are both textually relevant and spatially close to a given query, from a knowledge graph. In this work, our contribution toward improving the efficiency of semantic place retrieval is two-fold. First, we show that by applying an ad hoc yet intuitive restriction on the depth of search on the knowledge graph, it is possible to adopt IR-tree indexing scheme [7], which has been introduced for processing spatial keyword queries, for the semantic place retrieval scenario. Secondly, as a novel solution to this problem, we adapt the idea of cluster-skipping inverted index (CS-IIS) [1, 4], which has been originally proposed for retrieval over topically clustered document collections. Our experiments show that CS-IIS is comparable to IR-tree in terms of CPU time, while it yields substantial efficiency gains in terms of I/O time during query processing.

Supplemental Material

MP4 File
Semantic place retrieval aims to find the top-k place entities, which are both textually relevant and spatially close to a given query, from a knowledge graph. In this work, our contribution toward improving the efficiency of semantic place retrieval is two-fold. First, we show that by applying an ad hoc yet intuitive restriction on the depth of search on the knowledge graph, it is possible to adopt IR- tree indexing scheme, which has been introduced for processing spatial keyword queries, for the semantic place retrieval scenario. Secondly, as a novel solution to this problem, we adapt the idea of cluster-skipping inverted index (CS-IIS), which has been originally proposed for retrieval over topically clustered document collections. Our experiments show that CS-IIS is comparable to IR-tree in terms of CPU time, while it yields substantial efficiency gains in terms of I/O time during query processing.

References

[1]
Ismail Sengor Altingovde, Engin Demir, Fazli Can, and Özgür Ulusoy. 2008a. Incremental cluster-based retrieval using compressed cluster-skipping inverted files. ACM Transactions on Information Systems (TOIS), Vol. 26, 3 (2008), 15:1--15:36.
[2]
Ismail Sengor Altingovde, Engin Demir, Fazli Can, and Özgür Ulusoy. 2008b. Site-based dynamic pruning for query processing in search engines. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 861--862.
[3]
Zhi Cai, Georgios Kalamatianos, Georgios John Fakas, Nikos Mamoulis, and Dimitris Papadias. 2020. Diversified spatial keyword search on RDF data. VLDB J., Vol. 29, 5 (2020), 1171--1189.
[4]
Fazli Can, Ismail Sengor Altingovde, and Engin Demir. 2004. Efficiency and effectiveness of query processing in cluster-based retrieval. Information Systems, Vol. 29, 8 (2004), 697--717.
[5]
Lisi Chen, Gao Cong, Christian S. Jensen, and Dingming Wu. 2013. Spatial keyword query processing: An experimental evaluation. Proceedings of the VLDB Endowment, Vol. 6, 3 (2013), 217--228.
[6]
Maria Christoforaki, Jinru He, Constantinos Dimopoulos, Alexander Markowetz, and Torsten Suel. 2011. Text vs. space: efficient geo-search query processing. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM). 423--432.
[7]
Gao Cong, Christian S. Jensen, and Dingming Wu. 2009. Efficient retrieval of the top-k most relevant spatial web objects. Proceedings of the VLDB Endowment, Vol. 2, 1 (2009), 337--348.
[8]
Antonin Guttman. 1984. R-trees: A dynamic index structure for spatial searching. In Proceedings of the 1984 ACM SIGMOD International Conference on Management of Data. 47--57.
[9]
Shuo Han, Lei Zou, Jeffrey Xu Yu, and Dongyan Zhao. 2017. Keyword Search on RDF Graphs - A Query Graph Assembly Approach. In Proceedings of the 2017 ACM Conference on Information and Knowledge Management. 227--236.
[10]
Zhisheng Li, Ken C. K. Lee, Baihua Zheng, Wang-Chien Lee, Dik Lun Lee, and Xufa Wang. 2011. IR-Tree: An Efficient Index for Geographic Document Search. IEEE Trans. Knowl. Data Eng., Vol. 23, 4 (2011), 585--599.
[11]
Joel Mackenzie, Farhana M. Choudhury, and J. Shane Culpepper. 2015. Efficient location-aware web search. In Proceedings of the 20th Australasian Document Computing Symposium. 1--8.
[12]
Joel Mackenzie, Matthias Petri, and Alistair Moffat. 2021. Anytime ranking on document-ordered indexes. ACM Transactions on Information Systems (TOIS), Vol. 40, 1 (2021), 1--32.
[13]
Farzaneh Mahdisoltani, Joanna Biega, and Fabian Suchanek. 2015. Yago3: A knowledge base from multilingual wikipedias. In Proceedings of the 7th Biennial Conference on Innovative Data Systems Research (CIDR).
[14]
Jo a o B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg. 2011. Efficient Processing of Top-k Spatial Keyword Queries. In Proceedings of the 12th International Symposium on Advances in Spatial and Temporal Databases (SSTD). 205--222.
[15]
Jieming Shi, Dingming Wu, and Nikos Mamoulis. 2016. Top-k relevant semantic place retrieval on spatial RDF data. In Proceedings of the 2016 International Conference on Management of Data (SIGMOD). 1977--1990.
[16]
Yuxuan Shi, Gong Cheng, and Evgeny Kharlamov. 2020. Keyword Search over Knowledge Graphs via Static and Dynamic Hub Labelings. In Proceedings of The Web Conference 2020. 235--245.
[17]
Thomas Pellissier Tanon, Gerhard Weikum, and Fabian M. Suchanek. 2020. YAGO 4: A Reason-able Knowledge Base. In Proceedings of the 17th International Conference on The Semantic Web. 583--596.
[18]
Yufei Tao and Cheng Sheng. 2014. Fast Nearest Neighbor Search with Keywords. IEEE Trans. Knowl. Data Eng., Vol. 26, 4 (2014), 878--888.
[19]
Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual Indexing for Geographical Search on the Web. In Proceedings of 9th International Symposium on Advances in Spatial and Temporal Databases (SSTD). 218--235.
[20]
Sheng Wang, Zhifeng Bao, Shixun Huang, and Rui Zhang. 2018. A Unified Processing Paradigm for Interactive Location-based Web Search. In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM). 601--609.
[21]
Dingming Wu, Gao Cong, and Christian S. Jensen. 2012a. A framework for efficient spatial web object retrieval. VLDB J., Vol. 21, 6 (2012), 797--822.
[22]
Dingming Wu, Can Hou, Erjia Xiao, and Christian S. Jensen. 2020a. Semantic Region Retrieval from Spatial RDF Data. In Proceedings of the 25th International Conference on Database Systems for Advanced Applications (DASFAA). 415--431.
[23]
Dingming Wu, Man Lung Yiu, Gao Cong, and Christian S. Jensen. 2012b. Joint Top-K Spatial Keyword Query Processing. IEEE Trans. Knowl. Data Eng., Vol. 24, 10 (2012), 1889--1903.
[24]
Dingming Wu, Hao Zhou, Jieming Shi, and Nikos Mamoulis. 2020b. Top-k relevant semantic place retrieval on spatiotemporal RDF data. The VLDB Journal, Vol. 29, 4 (2020), 893--917.
[25]
YAGO 3. 2022. https://yago-knowledge.org/downloads/yago-3. Accessed: 2022-

Index Terms

  1. Exploiting Cluster-Skipping Inverted Index for Semantic Place Retrieval

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
    July 2023
    3567 pages
    ISBN:9781450394086
    DOI:10.1145/3539618
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 July 2023

    Check for updates

    Author Tags

    1. location-based search
    2. spatial keyword query processing

    Qualifiers

    • Short-paper

    Conference

    SIGIR '23
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 218
      Total Downloads
    • Downloads (Last 12 months)123
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media