Skip to main content
Log in

Keyword query with structure: towards semantic scoring of XML search results

  • Published:
Information Technology and Management Aims and scope Submit manuscript

Abstract

Keyword search is an effective paradigm for information discovery and has been introduced recently to query XML documents. Scoring of XML search results is an important issue in XML keyword search. Traditional “bag-of-words” model cannot differentiate the roles of keywords as well as the relationship between keywords, thus is not proper for XML keyword queries. In this paper, we present a new scoring method based on a novel query model, called keyword query with structure (QWS), which is specially designed for XML keyword query. The method is based on a totally new view taken by the QWS model on a keyword query that, a keyword query is a composition of several query units, each representing a query condition. We believe that this method captures the semantic relevance of the search results. The paper first introduces an algorithm reformulating a keyword query to a QWS. Then, a scoring method is presented which measures the relevance of search results according to how many and how well the query conditions are matched. The scoring method is also extended to clusters of search results. Experimental results verify the effectiveness of our methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Chen Y, Wang W, Liu Z et al (2009) Keyword search on structured and semi-structured data. In: Proceedings of SIGMOD, pp 1005–1010

  2. Liu Z, Chen Y (2011) Processing keyword search on XML: a survey. World Wide Web 14(5–6):671–707

    Article  Google Scholar 

  3. Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, Cambridge

    Book  Google Scholar 

  4. Liu Z, Chen Y (2007) Identifying meaningful return information for XML keyword search. In: Proceedings of SIGMOD, pp 329–340

  5. Liu Z, Chen Y (2010) Return specification inference and result clustering for keyword search on XML. ACM Trans Database Syst 35(2):1–47

    Google Scholar 

  6. Bao Z, Ling TW, Chen B et al (2009) Effective XML keyword search with relevance oriented ranking. In: Proceedings of ICDE, pp 517–528

  7. Bao Z, Lu J, Ling TW et al (2010) Towards an effective XML keyword search. IEEE Trans Knowl Data Eng 22(8):1077–1092

    Article  Google Scholar 

  8. Paparizos S (2013) Entwining structure into web search. In: Proceedings of DBRank, pp 1–4

  9. Patil R, Chen Z (2012) STRUCT: incorporating contextual information for english query search on relational databases. In: Proceedings of KEYS, pp 11–22

  10. Zeng Z, Bao Z, Le TN et al (2014) ExpressQ: identifying keyword context and search target in relational keyword queries. In: Proceedings of CIKM, pp 31–40

  11. Guo L, Shao F, Botev C et al (2003) XRANK: ranked keyword search over XML documents. In: Proceedings of SIGMOD, pp 16–27

  12. Cohen S, Mamou J, Kanza Y et al (2003) XSEarch: a semantic search engine for XML. In: Proceedings of VLDB, pp 45–56

  13. Li G, Ooi BC, Feng J et al (2008) EASE: an effective 3-in-1 keyword search method for unstructured, semi-structured and structured data. In: Proceedings of SIGMOD, pp 903–914

  14. Li J, Liu C, Zhou R et al (2014) XML keyword search with promising result type recommendations. World Wide Web 17(1):127–159

    Article  Google Scholar 

  15. Carpineto C, Osiński S, Romano G et al (2009) A survey of web clustering engines. ACM Comput Surv 1(3):17

    Google Scholar 

  16. Moreno JG, Dias G, Cleuziou G (2014) Query log driven web search results clustering. In: Proceedings of SIGIR, pp 777–786

  17. Liu X, Wan C, Chen L (2011) Returning Clustered results for keyword search on XML documents. IEEE Trans Knowl Data Eng 23(12):1811–1825

    Article  Google Scholar 

  18. Wu Y, Yang S, Srivatsa M et al (2013) Summarizing answer graphs induced by keyword queries. Proc VLDB Endow 6(14):1774–1785

    Article  Google Scholar 

  19. Liu X, Chen L, Wan C et al (2013) Exploiting structures in keyword queries for effective XML search. Inf Sci 240:56–71

    Article  Google Scholar 

  20. Goldman R, Widom J (1997) DataGuides: enabling query formulation and optimization in semistructured databases. In: Proceedings of VLDB, pp 436–445

  21. Scaiella U, Ferragina P, Marino A et al (2012) Topical clustering of search results. In: Proceedings of WSDM, pp 223–232

  22. (2013) The lemur toolkit for language modelling and information retrieval. www.lemurproject.org/

  23. Ley M (2013) DBLP bibliography. www.informatik.uni-trier.de/~ley/db/

  24. Mikla G (2002) The Mondial dataset. http://www.cs.washington.edu/research/xmldatasets/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiping Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, X., Wan, C. & Liu, D. Keyword query with structure: towards semantic scoring of XML search results. Inf Technol Manag 17, 151–163 (2016). https://doi.org/10.1007/s10799-015-0247-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10799-015-0247-z

Keywords

Navigation