Abstract
Keyword search in graphs has been extensively studied in the literature. Given a keyword query, existing solutions mainly focus on finding all/top-k individual answers. Each individual answer is a subgraph/subtree that contains some structural information regarding a certain subset of nodes containing the keywords. Nevertheless, from the individually answers, it is difficult for a user to see the big picture and identify how the answers are correlated to each other. In this paper, we define a new structure, named key-core, to find cohesive subgraphs for a keyword query. Briefly speaking, a key-core is a cohesive subgraph that contains many highly correlated keyword search answers. A key-core is not only cohesive structurally, but also closely related to the user given keywords. In order to make the keyword search more flexible, we also define four key-operators, namely key-intersection, key-union, key-difference, and key-association, to manipulate the key-cores. The key-operators enable users to form complex queries and refine the queries on demand. We propose algorithms to compute the key-cores and key-operators efficiently. We conduct extensive performance studies on large real datasets to demonstrate the effectiveness and efficiency of our approach.
Similar content being viewed by others
References
Agrawal, S., Chaudhuri, S., Das, G.: Dbxplorer: A system for keyword-based search over relational databases. In: Proceedings of ICDE’02 (2002)
Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S.: Keyword searching and browsing in databases using banks. In: Proceedings of ICDE’02 (2002)
Bicer, V., Tran, T., Nedkov, R.: Ranking support for keyword search on structured data using relevance models. In: Proceedings of CIKM’11 (2011)
Chen, Y., Wang, W., Liu, Z., Lin, X.: Keyword search on structured and semi-structured data. In: Proceedings of SIGMOD’09 (2009)
Ding, B., Yu, J.X., Wang, S., Qin, L., Zhang, X., Lin, X.: Finding top-k min-cost connected trees in databases. In: Proceedings of ICDE’07 (2007)
Fang, L., Sarma, A.D., Yu, C., Bohannon, P.: Rex: Explaining relationships between entity pairs. Proc. VLDB Endow. 5(3) (2011)
Fang, Y., Cheng, R., Luo, S., Hu, J.: Effective community search for large attributed graphs. Proc. VLDB Endow. 9(12), 1233–1244 (2016). 10.14778/2994509.2994538
He, H., Wang, H., Yang, J., Yu, P.S.: Blinks: Ranked keyword searches on graphs. In: Proceedings of SIGMOD’07 (2007)
Hristidis, V., Gravano, L., Papakonstantinou, Y.: Efficient ir-style keyword search over relational databases. In: Proceedings of VLDB’03 (2003)
Hristidis, V., Papakonstantinou, Y.: Discover: Keyword search in relational databases. In: Proceedings of VLDB’02 (2002)
Huang, Y., Liu, Z., Chen, Y.: Query biased snippet generation in xml search. In: Proceedings of SIGMOD’08 (2008)
Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Desai, R., Karambelkar, H.: Bidirectional expansion for keyword search on graph databases. In: Proceedings of VLDB’05 (2005)
Kargar, M., An, A.: Keyword search in graphs: Finding r-clique. PVLDB 4(10) (2011)
Kimelfeld, B., Sagiv, Y.: Finding and approximating top-k answers in keyword proximity search. In: Proceedingss of the Twenty-fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS ’06, pp 173–182. ACM, New York (2006). https://doi.org/10.1145/1142351.1142377
Koutrika, G., Zadeh, Z.M., Garcia-Molina, H.: Data clouds: Summarizing keyword search results over structured data. In: Proceedings of EDBT’09 (2009)
Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: Ease: An effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of SIGMOD’08 (2008)
Li, Y., Yu, C., Jagadish, H.V.: Schema-free xquery. In: Proceedings of VLDB’04 (2004)
Liu, F., Yu, C.T., Meng, W., Chowdhury, A.: Effective keyword search in relational databases. In: Proceedings of SIGMOD’06 (2006)
Liu, Z., Chen, Y.: Return specification inference and result clustering for keyword search on xml. ACM Trans. Database Syst. 35(2) (2010)
Liu, Z., Chen, Y.: Processing keyword search on xml: A survey. World Wide Web 14(5-6) (2011)
Liu, Z., Cher, Y.: Reasoning and identifying relevant matches for xml keyword search. PVLDB 1(1) (2008)
Luo, Y., Lin, X., Wang, W., Zhou, X.: Spark: Top-k keyword query in relational databases. In: Proceedings of SIGMOD’07 (2007)
Qiao, M., Qin, L., Cheng, H., Yu, J.X., Tian, W.: Top-k nearest keyword search on large graphs. PVLDB 6(10) (2013)
Qin, L., Yu, J.X., Chang, L.: Keyword search in databases: The power of rdbms. In: Proceedings of SIGMOD’09 (2009)
Qin, L., Yu, J.X., Chang, L., Tao, Y.: Querying communities in relational databases. In: Proceedings of ICDE’09 (2009)
Tao, Y., Papadopoulos, S., Sheng, C., Stefanidis, K.: Nearest keyword search in xml documents. In: Proceedingss of SIGMOD’11 (2011)
Tao, Y., Yu, J.X.: Finding frequent co-occurring terms in relational keyword search. In: Proceedingss of EDBT’09, pp. 839–850 (2009)
Tran, T., Wang, H., Rudolph, S., Cimiano, P.: Top-K exploration of query candidates for efficient keyword search on graph-shaped (Rdf) data. In: Proceedings of ICDE’09 (2009)
Wang, H., Aggarwal, C.C.: A survey of algorithms for keyword search on graph data. In: Managing and Mining Graph Data. https://doi.org/10.1007/978-1-4419-6045-0_8, pp 249–273 (2010)
Wu, Y., Yang, S., Srivatsa, M., Iyengar, A., Yan, X.: Summarizing answer graphs induced by keyword queries. PVLDB 6(14) (2013)
Yang, M., Ding, B., Chaudhuri, S., Chakrabarti, K.: Finding patterns in a knowledge base using keywords to compose table answers. PVLDB 7(14) (2014)
Yu, J.X., Qin, L., Chang, L., Keyword Search in Databases (2009)
Zhu, Y., He, J., Ye, J., Qin, L., Huang, X., Yu, J.X.: When Structure Meets Keywords: Cohesive Attributed Community Search, pp 1913–1922. Association for Computing Machinery, New York (2020). https://doi.org/10.1145/3340531.3412006
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to the Topical Collection: Special Issue on Large Scale Graph Data Analytics
Guest Editors: Xuemin Lin, Lu Qin, Wenjie Zhang, and Ying Zhang
Rights and permissions
About this article
Cite this article
Zhang, Z., Yu, J.X., Wang, G. et al. Key-core: cohesive keyword subgraph exploration in large graphs. World Wide Web 25, 831–856 (2022). https://doi.org/10.1007/s11280-021-00926-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-021-00926-y