skip to main content
10.1145/3397536.3422225acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
poster

Optimizing Continuous kNN Queries over Large-Scale Spatial-Textual Data Streams

Published: 13 November 2020 Publication History

Abstract

The continuous k-Nearest Neighbor queries over spatial-textual data streams (abbr. CkQST) retrieve and continuously monitor at most k nearest neighbor (abbr. kNN) objects to the user-specified location containing all the user-specified keywords, which is the core operation of numerous location-based publish/subscribe systems. Such a system is usually subscribed with a massive number of CkQST and evaluated simultaneously whenever new objects are incoming and old objects are expiring. The approach to evaluating CkQST is to construct a spatial-textual hybrid index for subscribed queries and matching the incoming objects utilizing the filtering capabilities of the index. For CkQST, the minimal spatial search range covering kNN objects changes frequently with the arrival and expiration of qualified objects, and the cost of updating the index is prohibitively high. To efficiently evaluate CkQST, we extend Quad-tree with an inverted index, and exploit it with three techniques, i.e. a memory-based cost model, a block-based ordered inverted index and an adaptive insertion strategy. The experiments on comprehensive datasets demonstrate the effectiveness and efficiency of our proposed techniques.

References

[1]
Lisi Chen, Gao Cong and Xin Cao. 2013. An efficient query indexing mechanism for filtering geo-textual data. In Proceedings of the 40th. ACM SIGMOD International Conference on Management of Data (SIGMOD '13). ACM Press, New York, NY, 749--760.
[2]
Guoliang Li, Yang Wang, Ting Wang, Jianhua Feng. 2013. Location-aware publish/subscribe. In Proceedings of the 19th. ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (SIGKDD '13). Chicago, USA, 802--810. https://doi.org/10.1145/2487575.2487617.
[3]
Xiang Wang, Ying Zhang, Wenjie Zhang, Xuemin Lin, and Wei Wang. 2015. AP-Tree: Efficiently support location-aware publish/subscribe. The VLDB Journal, 24(6): 823--848.
[4]
Ze Deng, Meng Wang, Lizhe Wang, Xiaohui Huang, Wei Han, Junde Chu and Albert Y. Zomaya. An efficient indexing approach for continuous spatial approximate keyword queries over geo-textual streaming data. International Journal of Geo-Information, 2019, 8(2):57--76.
[5]
Long Guo, Dongxiang Zhang, Guoliang Li, Kian-Lee Tan and Zhifeng Bao. 2015. Location-aware pub/sub system: when continuous moving queries meet dynamic event streams. In Proceedings of the 42nd. ACM SIGMOD International Conference on Management of Data (SIGMOD '15). ACM Press, Melbourne, Australia, 843--857.
[6]
Ahmed R. Mahmood, Ahmed M. Aly, Walid G. Aref. 2018. FAST: Frequency-Aware Indexing for Spatio-Textual Data Streams. In Proceedings of the 34th. IEEE International Conference on Data Engineering (ICDE '18). IEEE Press, Paris, France, 305--316.
[7]
Huiqi Hu, Yiqun Liu, Guoliang Li, Jianhua Feng. 2015. A location-aware publish/subscribe framework for parameterized spatio-textual subscriptions. In Proceedings of the 31st. IEEE International Conference on Data Engineering (ICDE '15). IEEE Press, Seoul, South Korea, 711--722.
[8]
Lisi Chen, Gao Cong, Xin Cao, Kian-Lee Tan. 2015. Temporal spatial-keyword top-k publish/subscribe. In Proceedings of the 31st. IEEE International Conference on Data Engineering (ICDE '15). IEEE Press, Seoul, South Korea, 255--266.
[9]
Lisi Chen, Shuo Shang. Approximate spatio-temporal top-k publish/subscribe. 2019. World Wide Web, 22(5): 2153--2175. https://doi.org/10.1007/s11280-018-0564-3.
[10]
Wang X, Zhang WJ, Zhang Y, Lin XM, Huang ZF. Top-k spatial-keyword publish/subscribe over sliding window. The VLDB Journal, 2017, 26(3): 301--326.
[11]
Zhida Chen, Gao Cong, Zhenjie Zhang, Tom Z. J. Fu and Lisi Chen. 2017. Distributed Publish/Subscribe Query Processing on the Spatio-Textual Data Stream. In Proceedings of the 33rd. IEEE International Conference on Data Engineering (ICDE '17). IEEE Press, San Diego, USA, 1095--1106.
[12]
Ahmed Mahmood, Anas Daghistani, Ahmed M. Aly, Mingjie Tang. 2018. Adaptive processing of spatial-keyword data over a distributed streaming cluster. In Proceedings of the 21st. ACM International Conference on Advances in Geographic Information Systems (SIGSPATIAL '18). ACM Press, Seattle, USA, 219--228. 10.1145/3274895.3274932.
[13]
Christian BÖhm, Beng Chin Ooi, Claudia Plant, and Ying Yan. 2007. Efficiently processing continuous k-NN queries on data streams. In Proceedings of the 23rd. IEEE International Conference on Data Engineering (ICDE '07). IEEE Press, Istanbul, Turkey, 156--165.
[14]
Xiaopeng Xiong, Mohamed F. Mokbel, Walid G. Aref. 2005. SEA-CNN: Scalable processing of continuous k-nn Queries in spatio-temporal databases. In Proceedings of the 21st. IEEE International Conference on Data Engineering (ICDE '05). IEEE Press, Tokyo, Japan, 643--654.
[15]
Xiaohui Yu, K.Q. Pu, N. Koudas. 2005. Monitoring k-nearest neighbor queries over moving objects. In Proceedings of the 21st. IEEE International Conference on Data Engineering (ICDE '05). IEEE Press, Tokyo, Japan, 631--642.

Cited By

View all

Index Terms

  1. Optimizing Continuous kNN Queries over Large-Scale Spatial-Textual Data Streams

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGSPATIAL '20: Proceedings of the 28th International Conference on Advances in Geographic Information Systems
    November 2020
    687 pages
    ISBN:9781450380195
    DOI:10.1145/3397536
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 November 2020

    Check for updates

    Author Tags

    1. Continuous queries
    2. Data Streams
    3. Nearest neighbor query
    4. Spatial-textual queries

    Qualifiers

    • Poster
    • Research
    • Refereed limited

    Funding Sources

    • National Natural Science Foundation of China

    Conference

    SIGSPATIAL '20
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 257 of 1,238 submissions, 21%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media