Abstract
Real-time processing of spatial keyword queries has been playing an indispensable role in location-based services. In this light, we propose and study a novel problem of processing continuous spatial keyword queries over geo-textual data streams. We define a new location-based continuously query that enable users to define personalized spatial requirement and textual requirement. Each query continuously feeds users with geo-textual objects that satisfy both spatial and textual requirements set by the query. To process massive-scale continuous spatial keyword queries efficiently, we develop a Continuous Spatial Keyword Query Matching (CSKQM) framework that takes a stream of queries as input and applies hierarchical dynamic grid cells to index each batch of queries. We also propose effective index update algorithm and efficient geo-textual object matching algorithm to process massive-scale continuous spatial keyword queries simultaneously over a stream of geo-textual objects. We conduct comprehensive experimental study on two real datasets to verify the performance of the CSKQM framework.






Similar content being viewed by others
Data Availability
Not applicable.
References
Shraer, A., Gurevich, M., Fontoura, M., Josifovski, V.: Top-k publish-subscribe for social annotation of news. PVLDB 6(6), 385–396 (2013)
Chen, L.,Cong, G., Cao, X., Tan, K.-L.: Temporal spatial-keyword top-k publish/subscribe. In ICDE, pp 255–266. (2015)
Felipe, I.D., Hristidis, V., Rishe, N.: Keyword search on spatial databases. In ICDE, pp. 656–665. (2008)
Cong, G., Jensen, C.S., Wu, D.: Efficient retrieval of the top-k most relevant spatial web objects. In PVLDB, pp. 337–348. (2009)
Rocha-Junior, J.B., Gkorgkas, O., Jonassen S., Nørv˚ag, K.: Efficient processing of top-k spatial keyword queries. In SSTD, pp. 205–222. (2011)
Zhang,D., Tan, K.-L., Tung, A.K.H.: Scalable top-k spatial keyword search. In EDBT, pp. 359–370. (2013)
Zhang,C., Zhang, Y., Zhang, W., Lin, X.: Inverted linear quadtree: Efficient top k spatial keyword search. In ICDE, pp. 901–912. (2013)
Wu, D., Yiu, M.L., Jensen, C.S., Cong, G.: Efficient continuously moving top-k spatial keyword query processing. In ICDE, pp. 541–552. (2011)
Yang, C., Chen, L., Shang, S., Zhu, F., Liu, L., Shao, L.: Toward efficient navigation of massive-scale geo-textual streams. In IJCAI, pp. 4838–4845. (2019)
Li, M., Chen, L., Cong, G., Gu, Y., Yu, G.: Efficient processing of location-aware group preference queries. In CIKM, pp. 559–568. ACM (2016)
Kalamatianos, G., Fakas, G.J., Mamoulis, N.: Proportionality in spatial keyword search. In SIGMOD ’21: International Conference on Management of Data, Virtual Event, China, June 20–25, 2021, pp. 885–897. ACM (2021)
Jiajie, Xu., Sun, J., Zhou, R., Liu, C., Yin, L.: CISK: an interactive framework for conceptual inference based spatial keyword query. Neurocomputing 428, 368–375 (2021)
Chen, X., Jiajie, Xu., Zhou, R., Zhao, P., Liu, C., Fang, J., Zhao, L.: S2r-tree: a pivot-based indexing structure for semantic-aware spatial keyword search. GeoInformatica 24(1), 3–25 (2020)
Chen, L., Cong, G., Jensen, C.S., Wu, D.: Spatial keyword query processing: an experimental evaluation. In PVLDB, pp. 217–228. (2013)
Chen, L., Shang, S., Yang, C., Li, J.: Spatial keyword search: a survey. GeoInformatica 24(1), 85–106 (2020)
Chen, Z., Chen, L., Cong, G., Jensen, C.S.: Location- and keyword- based querying of geo-textual data: a survey. VLDB J. 30(4), 603–640 (2021)
Zhang, C., Han, J., Shou, L., Jiajun, Lu., La Porta, T.: Splitter: Mining fine-grained sequential patterns in semantic trajectories. Proc VLDB Endow 7(9), 769–780 (2014)
Renhe, J., Jing, Z., Tingting, D., Yoshiharu, I., Chuan, X., Yuya, S.: A density-based approach for mining movement patterns from semantic trajectories. In TENCON 2015–2015 IEEE Region 10 Conference, pp. 1–6. IEEE (2015)
Shan, Z., Sun, W., Zheng, B.: Extract human mobility patterns powered by city semantic diagram. IEEE Transactions on Knowledge and Data Engineering, (2020)
Pripuˇzi´c, K., Zarko, I.P., Aberer, K.: Top-k/w publish/subscribe: Finding k most relevant publications in sliding time window w. DEBS, pp. 127–138. (2008)
Haghani, P., Michel, S., Aberer, K.: Evaluating top-k queries over incomplete data streams. In CIKM, pp. 877–886. (2009)
Haghani, P., Michel, S., Aberer, K.: The gist of everything new: Personalized top-k processing over web 2.0 streams. In CIKM, pp. 489–498. (2010)
Chen, L., Cong, G.: Diversity-aware top-k publish/subscribe for text stream. In SIGMOD, p. 347–362. (2015)
Machanavajjhala, A., Vee, E., Garofalakis, M., Shanmugasundaram, J.: Scalable ranked publish/subscribe. PVLDB 1(1), 451–462 (2008)
Li,G., Wang, Y., Wang, T., Feng, J.: Location-aware publish/subscribe. In KDD, pp. 802–810. (2013)
Chen, L., Cong, G., Cao, X.: An efficient query indexing mechanism for filtering geo-textual data. In SIGMOD, pp. 749–760. (2013)
Chen, L., Cui, Y., Cong, G., Cao, X.: SOPS: A system for efficient processing of spatial-keyword publish/subscribe. PVLDB 7(13), 1601–1604 (2014)
Chen, L., Shang, S., Jensen, C.S., Jianliang, Xu., Kalnis, P., Yao, B., Shao, L.: Top-k term publish/subscribe for geo-textual data streams. VLDB J 29(5), 1101–1128 (2020)
Chen, L., Shang, S.: Approximate spatio-temporal top-k publish/subscribe. World Wide Web 22(5), 2153–2175 (2019)
Chen, L., Shang, S., Zhang, Z., Cao, X., Jensen, C.S., Kalnis, P.: Location-aware top-k term publish/subscribe. In ICDE, pp. 749–760. IEEE Computer Society (2018)
Chen, Z., Cong, G., Zhang, Z., Fu, T.Z.J., Chen, L.: Distributed publish/subscribe query processing on the spatio-textual data stream. In ICDE, pp. 1095–1106. IEEE Computer Society (2017)
Wang, X., Zhang, W., Zhang, Y., Lin, X., Huang, Z.: Top-k spatial-keyword publish/subscribe over sliding window. VLDB J. 26(3), 301–326 (2017)
Wang, X., Zhang, Y., Zhang, W., Lin, X., Wang, W.: Ap-tree: efficiently support location-aware publish/subscribe. VLDB J. 24(6), 823–848 (2015)
Acknowledgements
The work is supported by the National Natural Science Foundation of China (Grant No. 61972077), LiaoNing Revitalization Talents Program (Grant No. XLYC2007079), the Science and Technology Plan Project of Shen Fu Reform and Innovation demonstration Zone in 2021 (Big Data Deep Analysis Platform for New Energy Vehicles).
Funding
The work is supported by the National Natural Science Foundation of China (Grant No. 61972077), LiaoNing Revitalization Talents Program (Grant No. XLYC2007079), the Science and Technology Plan Project of Shen Fu Reform and Innovation demonstration Zone in 2021 (Big Data Deep Analysis Platform for New Energy Vehicles).
Author information
Authors and Affiliations
Contributions
Hongwei Liu: Algorithm design and development, and paper writing
Yongjiao Sun: Experimental study
Guoren Wang: Algorithm design, and paper proofreading
All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Ethical approval and consent to participate
Not applicable.
Human and animal ethics
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Liu, H., Sun, Y. & Wang, G. Continuous spatial keyword query processing over geo-textual data streams. World Wide Web 26, 889–903 (2023). https://doi.org/10.1007/s11280-022-01062-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-022-01062-x