Skip to main content
Log in

On spatial keyword covering

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

This article introduces and solves a spatial keyword cover problem (SK-Cover for short), which aims to identify the group of spatio-textual objects covering all the keywords in a query and minimizing a distance cost function that leads to fewer objects in the answer set. In a broad sense, SK-Cover has been actively studied in the literature of spatial keyword search, such as the m-closest keywords query and the collective spatial keyword query. However, these existing works focus on minimizing only the largest pairwise distance even though the actual spatial cost is highly influenced by the number of objects in the answer group. Motivated by this, the present article further generalizes the problem definition in such a way that the total cost takes the cardinality of the group as well as the spatial distance. We prove that SK-Cover is not only NP-hard but also does not allow an approximation better than \(O(\log {|T|})\) in polynomial time, where T is the set of query keywords. We first establish an \(O(\log {|T|})\)-approximation algorithm, which is asymptotically optimal in terms of the approximability of SK-Cover, together with effective accessing strategies and pruning rules to improve the overall efficiency and scalability. Despite the NP-hardness of SK-Cover, this article also develops exact solutions that find the optimal group of objects in a reasonably fast manner in practice, especially when it is required to cover a relatively small number of query keywords. In addition to our algorithmic results, we empirically show that our approximation algorithm always achieves the best accuracy and the efficiency comparable to that of a state-of-the-art algorithm intended for \(m\hbox {CK}\), a problem similar to yet theoretically easier than SK-Cover, and also demonstrate that our exact algorithm using the proposed approximation scheme runs much faster than the baseline algorithm adapted from the existing solution for \(m\hbox {CK}\).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22

Similar content being viewed by others

Notes

  1. Although a more efficient version of the \(m\hbox {CK}\) query algorithm is recently proposed by Guo et al. [17], their exact algorithm is not trivially adapted to our problem because the key procedure, called circleScan, attempts to find the optimal (i.e., minimum) diameter without considering the cardinality, which is the unique property of \(m\hbox {CK}\) but not relevant to SK-Cover. In \(m\hbox {CK}\), a group with a smaller diameter always has a lower cost, but there will be cases where a group with a smaller diameter has a higher cost in SK-Cover. This makes circleScan less useful for our SK-Cover problem as we anyhow have to examine all the combinations of objects regardless of the minimum diameter.

  2. http://www.pocketgpsworld.com.

  3. https://foursquare.com.

References

  1. Abellanas M, Hurtado F, Icking C, Klein R, Langetepe E, Ma L, Palop B, Sacristán V (2001) The farthest color voronoi diagram and related problems. In: Abstracts 17th European workshop comput geom, pp 113–116

  2. Arge L, de Berg M, Haverkort HJ, Yi K (2008) The priority r-tree: a practically efficient and worst-case optimal r-tree. ACM Trans Algorithms 4(1):9:1–9:30

    Article  MathSciNet  Google Scholar 

  3. Cao X, Cong G, Guo T, Jensen CS, Ooi BC (2015) Efficient processing of spatial group keyword queries. ACM Trans Database Syst 40(2):13:1–13:48

    Article  MathSciNet  Google Scholar 

  4. Cao X, Cong G, Jensen CS, Ooi BC (2011) Collective spatial keyword querying. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 373–384

  5. Cary A, Wolfson O, Rishe N (2010) Efficient and scalable method for processing top-k spatial boolean queries. In: Scientific and statistical database management, 22nd international conference, SSDBM, pp 87–95

  6. Chan HK, Long C, Wong RC (2017) Inherent-cost aware collective spatial keyword queries. In: Advances in spatial and temporal databases—15th international symposium, SSTD, pp 357–375

  7. Chan HK, Long C, Wong RC (2018) On generalizing collective spatial keyword queries. IEEE Trans Knowl Data Eng 30(9):1712–1726

    Article  Google Scholar 

  8. Chen H, Ku W, Sun M, Zimmermann R (2008) The multi-rule partial sequenced route query. In: 16th ACM SIGSPATIAL international symposium on advances in geographic information systems, ACM-GIS, p 10

  9. Choi D, Pei J, Lin X (2016) Finding the minimum spatial keyword cover. In: 32nd IEEE international conference on data engineering, ICDE, pp 685–696

  10. Chvátal V (1979) A greedy heuristic for the set-covering problem. Math Oper Res 4(3):233–235

    Article  MathSciNet  Google Scholar 

  11. Cong G, Jensen CS, Wu D (2009) Efficient retrieval of the top-k most relevant spatial web objects. PVLDB 2(1):337–348

    Google Scholar 

  12. Cormode G, Karloff HJ, Wirth A (2010) Set cover algorithms for very large datasets. In: Proceedings of the 19th ACM conference on information and knowledge management, CIKM, pp 479–488

  13. de Berg M, Cheong O, van Kreveld M, Overmars M (2008) Computational geometry, 3 revised edn. Springer, Berlin

    Book  Google Scholar 

  14. Deng K, Li X, Lu J, Zhou X (2015) Best keyword cover search. IEEE Trans Knowl Data Eng 27(1):61–73

    Article  Google Scholar 

  15. Felipe ID, Hristidis V, Rishe N (2008) Keyword search on spatial databases. In: Proceedings of the 24th international conference on data engineering, ICDE, pp 656–665

  16. Fleischer R, Xu X (2010) Computing minimum diameter color-spanning sets. In: Frontiers in algorithmics, 4th international workshop, FAW, pp 285–292

  17. Guo T, Cao X, Cong G (2015) Efficient algorithms for answering the m-closest keywords query. In: Proceedings of the 2015 ACM SIGMOD international conference on management of data, pp 405–418

  18. Li F, Cheng D, Hadjieleftheriou M, Kollios G, Teng S (2005) On trip planning queries in spatial databases. In: Advances in spatial and temporal databases, 9th international symposium, SSTD, pp 273–290

  19. Li J, Yang YD, Mamoulis N (2013) Optimal route queries with arbitrary order constraints. IEEE Trans Knowl Data Eng 25(5):1097–1110

    Article  Google Scholar 

  20. Li Z, Lee KCK, Zheng B, Lee W, Lee DL, Wang X (2011) Ir-tree: an efficient index for geographic document search. IEEE Trans Knowl Data Eng 23(4):585–599

    Article  Google Scholar 

  21. Long C, Wong RC, Wang K, Fu AW (2013) Collective spatial keyword queries: a distance owner-driven approach. In: Proceedings of the ACM SIGMOD international conference on management of data, pp 689–700

  22. Lund C, Yannakakis M (1994) On the hardness of approximating minimization problems. J ACM 41(5):960–981

    Article  MathSciNet  Google Scholar 

  23. Ma X, Shekhar S, Xiong H, Zhang P (2006) Exploiting a page-level upper bound for multi-type nearest neighbor queries. In: 14th ACM international symposium on geographic information systems, ACM-GIS, pp 179–186

  24. Rocha-Junior JB, Gkorgkas O, Jonassen S, Nørvåg K (2011) Efficient processing of top-k spatial keyword queries. In: Advances in spatial and temporal databases—12th international symposium, SSTD, pp 205–222

  25. Sharifzadeh M, Kolahdouzan MR, Shahabi C (2008) The optimal sequenced route query. VLDB J 17(4):765–787

    Article  Google Scholar 

  26. Tao Y, Sheng C (2014) Fast nearest neighbor search with keywords. IEEE Trans Knowl Data Eng 26(4):878–888

    Article  Google Scholar 

  27. Wu D, Yiu ML, Cong G, Jensen CS (2012) Joint top-k spatial keyword query processing. IEEE Trans Knowl Data Eng 24(10):1889–1903

    Article  Google Scholar 

  28. Zhang D, Chee YM, Mondal A, Tung AKH, Kitsuregawa M (2009) Keyword search in spatial databases: towards searching by document. In: Proceedings of the 25th international conference on data engineering, ICDE, pp 688–699

  29. Zhang D, Ooi BC, Tung AKH (2010) Locating mapped resources in web 2.0. In: Proceedings of the 26th international conference on data engineering, ICDE, pp 521–532

  30. Zhang L, Sun X, Zhuge H (2012) Density based collective spatial keyword query. In: Eighth international conference on semantics. Knowledge and grids, SKG, pp 213–216

Download references

Acknowledgements

We thank the anonymous reviewers for their valuable feedbacks and comments. This work was supported in part by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2018R1D1A1B07049934), in part by Institute for Information & communications Technology Promotion (IITP) Grants funded by the Korea government (MSIT) (No. 2019-0-00240, Deep Partition-and-Merge: Merging and Splitting Deep Neural Networks on Smart Embedded Devices for Real Time Inference, No. 2019-0-00064, Intelligent Mobile Edge Cloud Solution for Connected Car, and No. 2017-0-00396, Autonomic BigData Cloud Computing (\(\hbox {ABC}^2\)): Enhancing Efficiency of BigData Processing using Various Cloud Computing Resources), and in part by INHA UNIVERSITY Research Grant.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dong-Wan Choi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Choi, DW., Pei, J. & Lin, X. On spatial keyword covering. Knowl Inf Syst 62, 2577–2612 (2020). https://doi.org/10.1007/s10115-020-01446-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-020-01446-3

Keywords

Navigation