Abstract
Keyword query processing over graph structured data is beneficial across various real world applications. The basic unit, of search and retrieval, in keyword search over graph, is a structure (interconnection of nodes) that connects all the query keywords. This new answering paradigm, in contrast to single web page results given by search engines, brings forth new challenges for ranking. In this paper, we propose a simple but effective Fuzzy set theory based Ranking measure, called FRank. Fuzzy sets acknowledge the contribution of each individual query keyword, discretely, to enumerate node relevance. A novel aggregation operator is defined, to combine the content relevance based fuzzy sets and, compute query dependent edge weights. The final rank, of an answer, is computed by non-monotonic addition of edge weights, as per their relevance to keyword query. FRank evaluates each answer based on the distribution of query keywords and structural connectivity between those keywords. An extensive empirical analysis shows superior performance by our proposed ranking measure as compared to the ranking measures adopted by current approaches in the literature.
Similar content being viewed by others
References
Aditya, B., Bhalotia, G., Chakrabarti, S., Hulgeri, A., Nakhe, C., Parag, P. and Sudarshan, S., “BANKS: Browsing and Keyword Searching in Relational Databases,” in Proc. VLDB, pp. 1083–1086, 2002.
Agrawal, S., Chaudhuri, S., Das, G., “DBXplorer: A System for Keyword-Based Search over Relational Databases,” in Proc. ICDE, pp. 5–16, 2002.
Arora, N., Lee, W., Park, S., Leung, C., Kim, J. and Kumar, H., “Efficient Fuzzy Ranking for Keyword Search on Graphs,” in Proc. DEXA, pp. 502–510, 2012.
Bompada, T., Chang, C. C., Chen, J., Kumar, R. and Shenoy, R., “On the Robustness of Relevance Measures with Incomplete Judgments,” in Proc. SIGIR, pp. 359–366, 2007.
Bruno, N. and Wang, H., “The Threshold Algorithm: From Middleware Systems to the Relational Engine,” in IEEE Trans. on Knowledge and Data Eng., pp. 523–537, April 2007.
Dalvi, B. B., Kshirsagar, M. and Sudarshan, S., “Keyword Search on External Memory Data Graphs,” in Proc. VLDB, pp. 1189–1204, 2008.
Dupret, G. and Piwowarski, B., “Model Based Comparison of Discounted Cumulative Gain and Average Precision,” Journal of Discrete Algorithms, pp. 49–62, 2013.
Fagin, R., “Combining Fuzzy Information from Multiple Systems,” in Proc. PODS, pp. 83–99, 1996.
Guo, L., Shao, F., Botev, C. and Shanmugasundaram, J., “XRANK: Ranked Keyword Search Over XML Documents,” in Proc. SIGMOD, pp.16–27, 2003.
He, H., Wang, H., Yang, J. and Philip, S. Y., “BLINKS: Ranked Keyword Searches on Graphs,” in Proc. SIGMOD, pp. 305–316, 2007.
Hristidis, V. and Papakonstantinou, Y., “Discover: Keyword Search in Relational Databases,” in Proc. VLDB, pp. 670–681, 2002.
Jarvelin, K., Price, S., Delcambre, L. and Nielsen, M., “Discounted Cumulated Gain Based Evaluation of Multiple-Query Information Retrieval Sessions,” in Proc. ECIR, pp. 4–15, 2008.
Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Desai, R. and Karambelkar, H., “Bidirectional Expansion for Keyword Search on Graph Databases,” in Proc. VLDB, pp. 505–516, 2005.
Kim, S., Lee, W., Arora, N. R., Jo, T. C., Kang, S. H., “Retrieving Keyworded Subgraphs with Graph Ranking Score,” Expert Systems with Applications, pp. 4647–4656, 2011.
Lee, W., Leung, C. and Lee, J., “MobileWeb Navigation in Digital Ecosystems Using Rooted Directed Trees,” IEEE Trans. Indus. Elect., 58, 6, pp. 2154–2162, 2011.
Lee, W., Leung, C., Song, J. and Eom, C., “A Network-Flow Based Influence Propagation Model for Social Networks,” in Proc. CGC, pp. 601–608, 2012.
Li, W. S., Candan, K. S., Vu, Q. and Agrawal, D., “Retrieving and Organizing Web Pages by Information Unit,” in Proc. WWW, pp. 230–244, 2001.
Li, C. and Chiang, T. W., “Function Approximation with Complex Neuro-Fuzzy System Using Complex Fuzzy Sets; A New Approach,” New Generation Computing, pp. 151–156, 2011.
Li, G., Ooi, B. C., Feng, J., Wang, J. and Zhou, L., “EASE: an Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data,” in Proc. SIGMOD, pp. 903–914, 2008.
Liu, F., Yu, C., Meng, W. and Chowdhury, A., “Effective Keyword Search in Relational Databases,” in Proc. SIGMOD, pp. 563–574, 2006.
Luo, Y., Lin, X., Wang, W. and Zhou, X., “Spark: Top-k Keyword Query in Relational Databases,” in Proc. SIGMOD, pp. 115–126, 2007.
Ma, Z. M. and Yan, L., “Fuzzy XML Data Modeling with the UML and Relational Data Models,” Data Knowledge Engineering, pp. 972–996, December 2007.
Matveeva, I., Burges, C., Burkard, T., Laucius, A., Wong, L., “High Accuracy Retrieval with Multiple Nested Ranker,” in Proc. SIGIR, pp. 437–444, 2006.
Moussalli, R., Salloum, M., Najjar, W. and Tsotras, V., “Massively Parallel XML Twig Filtering Using Dynamic Programming on FPGAs,” in Proc. ICDE, pp. 948–959, 2011.
Qin, L., Jeffrey, X. Y., Chang, L. and Yufei, T., “Querying Communities in Relational Databases,” in Proc. ICDE, pp. 724–735, 2009.
Sanchez, D., Castella-Roca, J. and Viejo, A., “Knowledge-based Scheme to Create Privacy-preserving but Semantically-related Queries for Web Search Engines,” Information Science, pp.17–30, 2013.
Setek, M. and Trawinski, B., “Selection of Heterogeneous Fuzzy Model Ensembles Using Self-adaptive Genetic Algorithms,” New Generation Computing, pp. 309–327, 2011.
Song, Y., Zhou, D. and He, L., “Post-ranking Query Suggestion by Diversifying Search Results,” in Proc. SIGIR, pp. 815–824, 2011.
Suganuma, S., Huynh, V. N., Nakamori, Y. and Wang, S., “A Fuzzy Set based Approach to Generalized Landscape Theory of Aggregation,” New Generation Computing, pp. 57–66, 2005.
Talukdar, P. P., Jacob, M., Mehmood, M. S., Crammer, K., Ives, Z. G., Pereira, F. and Guha, S., “Learning to Create Data-integrating Queries,” in Proc. VLDB, pp. 785–796, 2008.
Yildirim, Y., Yazici, A. and Yilmaz, T., “Automatic Semantic Content Extraction in Videos Using a Fuzzy Ontology and Rule-Based Model,” IEEE Trans. Knowl. Data Eng., 25, 1, pp.47–61, 2013.
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Arora, N.R., Lee, W. Graph based Ranked Answers for Keyword Graph Structure. New Gener. Comput. 31, 115–134 (2013). https://doi.org/10.1007/s00354-013-0203-6
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00354-013-0203-6