Abstract
Multi-objective spatial keyword query aims to find a set of objects that are reasonably distributed in spatial, with all query objectives to be satisfied. However, existing approaches mainly take the coverage of query keywords into account, while leaving the semantics of the textual data to be largely ignored. This limits us to return those rational results that are synonyms but morphologically different. To address this problem, this paper studies the problem of multi-objective spatial keyword query with semantics, and targets to return the object set that is optimum regarding to both spatial proximity and semantic relevance. Specifically, we take advantage of the probabilistic topic model and locality sensitive hashing (LSH), so that all query objectives can be satisfied in terms of their semantics. Afterwards, a novel indexing structure called LIR-tree is designed to integrate the spatial and semantic information of all objects in a balanced way. On top of the LIR-tree, we further propose a distance-owner based query processing algorithm, which provides tight bounds to achieve superb pruning effect in the searching phase. To speed up the processing, a distance owners based replacement strategy can be used to conduct approximate querying more efficiently. Empirical study based on a real dataset demonstrates the good effectiveness and efficiency of our proposed algorithms.
Similar content being viewed by others
References
Barbieri, D.F., Braga, D., Ceri, S., Della Valle, E., Grossniklaus, M.: C-sparql: Sparql for continuous querying. In: Proceedings of the 18th International Conference on World Wide Web, pp. 1061–1062. ACM (2009)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2003)
Buhler, J.: Efficient large-scale sequence comparison by locality-sensitive hashing. Bioinformatics 17(5), 419–428 (2001)
Cao, X., Cong, G., Jensen, C.S.: Retrieving top-k prestige-based relevant spatial web objects. Proc. VLDB Endow. 3(1–2), 373–384 (2010)
Cao, X., Cong, G., Jensen, C.S., Ooi, B.C.: Collective spatial keyword querying. In: Proceedings of the 2011 ACM SIGMOD International Conference on Management of Data, pp. 373–384. ACM (2011)
Chen, G., Zhao, J., Gao, Y., Chen, L., Chen, R.: Time-aware Boolean spatial keyword queries. IEEE Trans. Knowl. Data Eng. 29(11), 2601–2614 (2017)
Chen, J., Xu, J., Liu, C., Li, Z., Liu, A., Ding, Z.: Multi-objective spatial keyword query with semantics. In: International Conference on Database Systems for Advanced Applications, pp. 34–48. Springer (2017)
Chen, L., Cui, Y., Cong, G., Cao, X.: SOPS: a system for efficient processing of spatial-keyword publish/subscribe. PVLDB 7(13), 1601–1604 (2014)
Chen, L., Shang, S., Yang, C., Li, J.: Spatial keyword search: a survey. Geoinformatica (2019). https://doi.org/10.1007/s10707-019-00373-y
Chen, L., Shang, S., Zhang, Z., Cao, X., Jensen, C.S., Kalnis, P.: Location-aware top-k term publish/subscribe. In: 34th IEEE International Conference on Data Engineering, pp. 749–760 (2018)
Chen, X., Xu, J., Zhou, R., Zhao, P., Liu, C., Fang, J., Zhao, L.: S2R-tree: a pivot-based indexing structure for semantic-aware spatial keyword search. GeoInformatica (2019). https://doi.org/10.1007/s10707-019-00372-z
Chen, Y.Y., Suel, T., Markowetz, A.: Efficient query processing in geographic web search engines. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp. 277–288. ACM (2006)
Cong, G., Jensen, C.S., Wu, D.: Efficient retrieval of the top-k most relevant spatial web objects. Proc. VLDB Endow. 2(1), 337–348 (2009)
Dai, J., Liu, C., Xu, J., Ding, Z.: On personalized and sequenced route planning. World Wide Web J. 19(4), 679–705 (2016)
Datar, M., Immorlica, N., Indyk, P., Mirrokni, V.S.: Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the 20th ACM Symposium on Computational Geometry, pp. 253–262. ACM (2004)
De Felipe, I., Hristidis, V., Rishe, N.: Keyword search on spatial databases. In: Proceedings of the 24th International Conference on Data Engineering, pp. 656–665. IEEE (2008)
Ding, Z., Xu, J., Yang, Q.: Seaclouddm: a database cluster framework for managing and querying massive heterogeneous sensor sampling data. J. Supercomput. 66(3), 1260–1284 (2013)
Fang, Y., Cheng, R., Cong, G., Mamoulis, N., Li, Y.: On spatial pattern matching. In: 34th IEEE International Conference on Data Engineering, pp. 293–304. IEEE (2018)
Gao, Y., Zhao, J., Zheng, B., Chen, G.: Efficient collective spatial keyword query processing on road networks. IEEE Trans. Intell. Transp. Syst. 17(2), 469–480 (2016)
Hu, H., Liu, Y., Li, G., Feng, J., Tan, K.L.: A location-aware publish/subscribe framework for parameterized spatio-textual subscriptions. In: 31st IEEE International Conference on Data Engineering, pp. 711–722. IEEE (2015)
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the Thirtieth Annual ACM Symposium on the Theory of Computing, pp. 604–613. ACM (1998)
Jiang, H., Zhao, P., Sheng, V.S., Xu, J., Liu, A., Wu, J., Cui, Z.: An efficient location-aware top-k subscription matching for publish/subscribe with boolean expressions. In: International Conference on Database Systems for Advanced Applications, pp. 335–350. Springer (2016)
Jin, J., Szekely, P.: Interactive querying of temporal data using a comic strip metaphor. In: 2010 IEEE Symposium on Visual Analytics Science and Technology, pp. 163–170. IEEE (2010)
Li, F., Yao, B., Tang, M., Hadjieleftheriou, M.: Spatial approximate string search. IEEE Trans. Knowl. Data Eng. 25(6), 1394–1409 (2013)
Li, G., Wang, Y., Wang, T., Feng, J.: Location-aware publish/subscribe. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 802–810. ACM (2013)
Liu, H., Xu, J., Zheng, K., Liu, C., Du, L., Wu, X.: Semantic-aware query processing for activity trajectories. In: Proceedings of the 2017 ACM International Conference on Web Search and Data Mining, pp. 283–292 (2017)
Long, C., Wong, R.C.W., Wang, K., Fu, A.W.C.: Collective spatial keyword queries: a distance owner-driven approach. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pp. 689–700. ACM (2013)
Mahmood, A., Aref, W.G.: Query processing techniques for big spatial-keyword data. In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 1777–1782. ACM (2017)
Qian, Z., Xu, J., Zheng, K., Sun, W., Li, Z., Guo, H.: On efficient spatial keyword querying with semantics. In: International Conference on Database Systems for Advanced Applications, pp. 149–164. Springer (2016)
Qian, Z., Xu, J., Zheng, K., Zhao, P., Zhou, X.: Semantic-aware top-k spatial keyword queries. World Wide Web J. 21(3), 573–594 (2018)
Rocha-Junior, J.B., Gkorgkas, O., Jonassen, S., Nørvåg, K.: Efficient processing of top-k spatial keyword queries. In: International Symposium on Spatial and Temporal Databases, pp. 205–222. Springer (2011)
Slaney, M., Casey, M.: Locality-sensitive hashing for finding nearest neighbors [lecture notes]. IEEE Signal Process. Mag. 25(2), 128–131 (2008)
Song, X., Xu, J., Zhou, R., Liu, C., Zheng, K., Zhao, P., Falkner, N.: Collective spatial keyword search on activity trajectories. GeoInformatica (2019). https://doi.org/10.1007/s10707-019-00358-x
Sun, J., Xu, J., Zheng, K., Liu, C.: Interactive spatial keyword querying with semantics. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 1727–1736 (2017)
Tran, Q.T., Chan, C.Y.: How to conquer why-not questions. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 15–26. ACM (2010)
Wang, K., Cao, X., Lin, X., Zhang, W., Qin, L.: Efficient computing of radius-bounded k-cores. In: 34th IEEE International Conference on Data Engineering, pp. 233–244. IEEE (2018)
Xu, J., Chen, J., Zhou, R., Fang, J., Liu, C.: On workflow aware location-based service composition for personal trip planning. Future Gener. Comput. Syst. 98, 274–285 (2019)
Yao, B., Li, F., Hadjieleftheriou, M., Hou, K.: Approximate string search in spatial databases. In: Proceedings of the 26th International Conference on Data Engineering, pp. 545–556. IEEE (2010)
Zhang, C., Zhang, Y., Zhang, W., Lin, X.: Inverted linear quadtree: efficient top k spatial keyword search. IEEE Trans. Knowl. Data Eng. 28(7), 1706–1721 (2016)
Zheng, B., Yuan, N.J., Zheng, K., Xie, X., Sadiq, S., Zhou, X.: Approximate keyword search in semantic trajectory database. In: 31st IEEE International Conference on Data Engineering, pp. 975–986. IEEE (2015)
Zheng, K., Huang, Z., Zhou, A., Zhou, X.: Discovering the most influential sites over uncertain data: a rank-based approach. IEEE Trans. Knowl. Data Eng. 24(12), 2156–2169 (2012)
Zheng, K., Su, H., Zheng, B., Shang, S., Xu, J., Liu, J., Zhou, X.: Interactive top-k spatial keyword queries. In: 31st IEEE International Conference on Data Engineering, pp. 423–434. IEEE (2015)
Acknowledgements
This work was partially supported by the National Key Research and Development Program of China (No. 2018YFB2100400), and the National Science Foundation of China (Nos. 61872258, 61572335, 61872100).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xu, J., Chen, J. & Yin, L. Multi-objective spatial keyword query with semantics: a distance-owner based approach. Distrib Parallel Databases 38, 625–647 (2020). https://doi.org/10.1007/s10619-020-07283-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10619-020-07283-1