Skip to main content
Log in

Key-core: cohesive keyword subgraph exploration in large graphs

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Keyword search in graphs has been extensively studied in the literature. Given a keyword query, existing solutions mainly focus on finding all/top-k individual answers. Each individual answer is a subgraph/subtree that contains some structural information regarding a certain subset of nodes containing the keywords. Nevertheless, from the individually answers, it is difficult for a user to see the big picture and identify how the answers are correlated to each other. In this paper, we define a new structure, named key-core, to find cohesive subgraphs for a keyword query. Briefly speaking, a key-core is a cohesive subgraph that contains many highly correlated keyword search answers. A key-core is not only cohesive structurally, but also closely related to the user given keywords. In order to make the keyword search more flexible, we also define four key-operators, namely key-intersection, key-union, key-difference, and key-association, to manipulate the key-cores. The key-operators enable users to form complex queries and refine the queries on demand. We propose algorithms to compute the key-cores and key-operators efficiently. We conduct extensive performance studies on large real datasets to demonstrate the effectiveness and efficiency of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12

Similar content being viewed by others

Notes

  1. http://wiki.dbpedia.org/about

References

  1. Agrawal, S., Chaudhuri, S., Das, G.: Dbxplorer: A system for keyword-based search over relational databases. In: Proceedings of ICDE’02 (2002)

  2. Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using banks. In: Proceedings of ICDE’02 (2002)

  3. Bicer, V., Tran, T., Nedkov, R.: Ranking support for keyword search on structured data using relevance models. In: Proceedings of CIKM’11 (2011)

  4. Chen, Y., Wang, W., Liu, Z., Lin, X.: Keyword search on structured and semi-structured data. In: Proceedings of SIGMOD’09 (2009)

  5. Ding, B., Yu, J.X., Wang, S., Qin, L., Zhang, X., Lin, X.: Finding top-k min-cost connected trees in databases. In: Proceedings of ICDE’07 (2007)

  6. Fang, L., Sarma, A.D., Yu, C., Bohannon, P.: Rex: Explaining relationships between entity pairs. Proc. VLDB Endow. 5(3) (2011)

  7. Fang, Y., Cheng, R., Luo, S., Hu, J.: Effective community search for large attributed graphs. Proc. VLDB Endow. 9(12), 1233–1244 (2016). 10.14778/2994509.2994538

    Article  Google Scholar 

  8. He, H., Wang, H., Yang, J., Yu, P.S.: Blinks: Ranked keyword searches on graphs. In: Proceedings of SIGMOD’07 (2007)

  9. Hristidis, V., Gravano, L., Papakonstantinou, Y.: Efficient ir-style keyword search over relational databases. In: Proceedings of VLDB’03 (2003)

  10. Hristidis, V., Papakonstantinou, Y.: Discover: Keyword search in relational databases. In: Proceedings of VLDB’02 (2002)

  11. Huang, Y., Liu, Z., Chen, Y.: Query biased snippet generation in xml search. In: Proceedings of SIGMOD’08 (2008)

  12. Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Desai, R., Karambelkar, H.: Bidirectional expansion for keyword search on graph databases. In: Proceedings of VLDB’05 (2005)

  13. Kargar, M., An, A.: Keyword search in graphs: Finding r-clique. PVLDB 4(10) (2011)

  14. Kimelfeld, B., Sagiv, Y.: Finding and approximating top-k answers in keyword proximity search. In: Proceedingss of the Twenty-fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS ’06, pp 173–182. ACM, New York (2006). https://doi.org/10.1145/1142351.1142377

  15. Koutrika, G., Zadeh, Z.M., Garcia-Molina, H.: Data clouds: Summarizing keyword search results over structured data. In: Proceedings of EDBT’09 (2009)

  16. Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: Ease: An effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of SIGMOD’08 (2008)

  17. Li, Y., Yu, C., Jagadish, H.V.: Schema-free xquery. In: Proceedings of VLDB’04 (2004)

  18. Liu, F., Yu, C.T., Meng, W., Chowdhury, A.: Effective keyword search in relational databases. In: Proceedings of SIGMOD’06 (2006)

  19. Liu, Z., Chen, Y.: Return specification inference and result clustering for keyword search on xml. ACM Trans. Database Syst. 35(2) (2010)

  20. Liu, Z., Chen, Y.: Processing keyword search on xml: A survey. World Wide Web 14(5-6) (2011)

  21. Liu, Z., Cher, Y.: Reasoning and identifying relevant matches for xml keyword search. PVLDB 1(1) (2008)

  22. Luo, Y., Lin, X., Wang, W., Zhou, X.: Spark: Top-k keyword query in relational databases. In: Proceedings of SIGMOD’07 (2007)

  23. Qiao, M., Qin, L., Cheng, H., Yu, J.X., Tian, W.: Top-k nearest keyword search on large graphs. PVLDB 6(10) (2013)

  24. Qin, L., Yu, J.X., Chang, L.: Keyword search in databases: The power of rdbms. In: Proceedings of SIGMOD’09 (2009)

  25. Qin, L., Yu, J.X., Chang, L., Tao, Y.: Querying communities in relational databases. In: Proceedings of ICDE’09 (2009)

  26. Tao, Y., Papadopoulos, S., Sheng, C., Stefanidis, K.: Nearest keyword search in xml documents. In: Proceedingss of SIGMOD’11 (2011)

  27. Tao, Y., Yu, J.X.: Finding frequent co-occurring terms in relational keyword search. In: Proceedingss of EDBT’09, pp. 839–850 (2009)

  28. Tran, T., Wang, H., Rudolph, S., Cimiano, P.: Top-K exploration of query candidates for efficient keyword search on graph-shaped (Rdf) data. In: Proceedings of ICDE’09 (2009)

  29. Wang, H., Aggarwal, C.C.: A survey of algorithms for keyword search on graph data. In: Managing and Mining Graph Data. https://doi.org/10.1007/978-1-4419-6045-0_8, pp 249–273 (2010)

  30. Wu, Y., Yang, S., Srivatsa, M., Iyengar, A., Yan, X.: Summarizing answer graphs induced by keyword queries. PVLDB 6(14) (2013)

  31. Yang, M., Ding, B., Chaudhuri, S., Chakrabarti, K.: Finding patterns in a knowledge base using keywords to compose table answers. PVLDB 7(14) (2014)

  32. Yu, J.X., Qin, L., Chang, L., Keyword Search in Databases (2009)

  33. Zhu, Y., He, J., Ye, J., Qin, L., Huang, X., Yu, J.X.: When Structure Meets Keywords: Cohesive Attributed Community Search, pp 1913–1922. Association for Computing Machinery, New York (2020). https://doi.org/10.1145/3340531.3412006

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhiwei Zhang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article belongs to the Topical Collection: Special Issue on Large Scale Graph Data Analytics

Guest Editors: Xuemin Lin, Lu Qin, Wenjie Zhang, and Ying Zhang

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Z., Yu, J.X., Wang, G. et al. Key-core: cohesive keyword subgraph exploration in large graphs. World Wide Web 25, 831–856 (2022). https://doi.org/10.1007/s11280-021-00926-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-021-00926-y

Keywords

Navigation