Skip to main content
Log in

Interactive search over Web scale RDF data using predicates as constraints

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

RDF (Resource Description Framework) data are more and more prevalent in the applications of semantic web and web data publication. The search over Web scale RDF data is essential for users to retrieve desired information from the huge RDF datasets, which typically applied as knowledge bases supporting many advanced information seeking tasks. In this paper, we propose some techniques that allow users to interactively search over the Web scale RDF data by using keywords as well as their predicates as additional constraints. We observe that the straightforward way of keyword search over the Web scale RDF data often generates a huge number of matching sub-structures (i.e., graphs containing the query keywords) due to the ambiguity of query intention generated from a small number of query keywords, although most of them are false interpretations of the query intention. To effectively interpret the semantics of queries, we define a novel keyword query called structure-aware keyword query that utilizes the predicates of RDF triples to assist users in clarifying their query intention. The challenge of such queries is to effectively and efficiently find a proper set of predicate-keyword pairs for query interpretation, to reduce the manual cost of user feedbacks as much as possible. To verify the novel query mechanism, we implement a system, and test it over the DBPedia 3.7 dataset. Results show that, for most queries, users can often efficiently get desired results by providing a small number of simple feedbacks on the constraints of predicates automatically generated.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. http://dbpedia.org

  2. http://www.inex.otago.ac.nz/tracks/entity-ranking/entity-ranking.asp

References

  • Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G. (2007). Dbpedia: a nucleus for a web of open data. In ISWC/ASWC (pp. 722–735).

  • Bhalotia, G., Hulgeri, A., Nakhe, C., Chakrabarti, S., Sudarshan, S. (2002). Keyword searching and browsing in databases using banks. In ICDE (pp. 431–440).

  • Bollacker, K.D., Evans, C., Paritosh, P., Sturge, T., Taylor, J. (2008). Freebase: a collaboratively created graph database for structuring human knowledge. In SIGMOD conference (pp. 1247–1250).

  • Elbassuoni, S., & Blanco, R. (2011). Keyword search over rdf graphs. In CIKM (pp. 237–242).

  • Ferré, S., & Hermann, A. (2011). Semantic search: reconciling expressive querying and exploratory search. In International semantic web conference (Vol. 1, pp. 177–192).

  • He, H., Wang, H., Yang, J., Yu, P.S. (2007). Blinks: ranked keyword searches on graphs. In SIGMOD conference (pp. 305–316).

  • Heath, T., & Bizer, C. (2011). Linked data: evolving the web into a global data space. Synthesis lectures on the semantic web. Morgan & Claypool Publishers.

  • Huang, J., Abadi, D.J., Ren, K. (2011). Scalable sparql querying of large rdf graphs. PVLDB, 4(11), 1123–1134.

    Google Scholar 

  • Kacholia, V., Pandit, S., Chakrabarti, S., Sudarshan, S., Desai, R., Karambelkar, H. (2005). Bidirectional expansion for keyword search on graph databases. In VLDB (pp. 505–516).

  • Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L. (2008). Ease: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In SIGMOD conference (pp. 903–914).

  • Neumann, T., & Weikum, G. (2010). The rdf-3x engine for scalable management of rdf data. The VLDB Journal, 19(1), 91–113.

    Article  Google Scholar 

  • Pérez, J., Arenas, M., Gutierrez, C. (2006). Semantics and complexity of sparql. In International semantic web conference (pp. 30–43).

  • Pound, J., Ilyas, I.F., Weddell, G.E. (2010). Expressive and flexible access to web-extracted data: a keyword-based structured query language. In SIGMOD conference (pp. 423–434).

  • Tran, T., Wang, H., Rudolph, S., Cimiano, P. (2009). Top-k exploration of query candidates for efficient keyword search on graph-shaped (rdf) data. In ICDE (pp 405–416).

  • Unger, C., Bühmann, L., Lehmann, J., Ngomo, A.C.N., Gerber, D., Cimiano, P. (2012). Template-based question answering over rdf data. In WWW (pp. 639–648).

  • Weikum, G., & Theobald, M. (2010). From information to knowledge: harvesting entities and relationships from web sources. In PODS (pp. 65–76).

  • Wilkinson, K., Sayers, C., Kuno, H.A., Reynolds, D. (2003). Efficient rdf storage and retrieval in jena2. In SWDB (pp. 131–150).

  • Yahya, M., Berberich, K., Elbassuoni, S., Ramanath, M., Tresp, V., Weikum, G. (2012). Deep answers for naturally asked questions on the web of data. In WWW (Companion Volume) (pp. 445–449).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mingyan Teng.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Teng, M., Zhu, G. Interactive search over Web scale RDF data using predicates as constraints. J Intell Inf Syst 44, 381–395 (2015). https://doi.org/10.1007/s10844-014-0336-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-014-0336-1

Keywords

Navigation