Abstract
In recent years, more and more XML repositories are emerging, e.g., XML digital library, SIGMOD and DBLP document collections. Since XML is good at representing both structured and unstructured data, to facilitate the usage of this kind of information, it is necessary to support structure-based and content-based (full-text) queries/retrievals over XML repositories. With existing XPath/XQuery Full-Text, user could do search based on cardinality, proximity or distance predicates. In this paper, we propose an efficient approach for the Information Retrieval (IR) style search, especially distance predicates search, on XML documents. Numbering technique is employed to encode XML documents, and then three algorithms are designed to evaluate queries with distance predicates. To improve the performance, some optimization techniques are introduced. Extensive experiments show the effectiveness and efficiency of the proposed approach.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amer-Yahia, S., Lakshmanan, L.V.S., Pandit, S.: FleXPath: Flexible Structure and Full-Text Querying for XML. In: SIGMOD 2004, pp. 83–94 (2004)
Amer-Yahia, S., Fernndez, M.F., Srivastava, D., Xu, Y.: PIX: Exact and Approximate Phrase Matching in XML. In: SIGMOD 2003, pp. 664–664 (2003)
Theobald, A., Weikum, G.: The XXL Search Engine: Ranked Retrieval of XML Data Using Indexes and Ontologies. In: SIGMOD 2002, p. 615 (2002)
Fuhr, N., Grojohann, K.: XIRQL: An XML Query Language Based on Information Retrieval Concepts. In: TOIS 2004, pp. 313–356 (2004)
Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0 (1999), http://www.w3.org/TR/xpath
Chamberlin, D., Berglund, A., Boag, S.: XQuery 1.0: An XML Query Language (2005), http://www.w3.org/TR/xquery/
Case, P., Amer-Yahia, S., Botev, C.: XQuery 1.0 and XPath 2.0 Full-Text (2005), http://www.w3.org/TR/xquery-Full-Text/
Buxton, S., Rys, M.: Uery and XPath Full-Text Requirements (2003), http://www.w3.org/TR/xquery-Full-Text-requirements/
Amer-Yahia, S., Case, P.: XQuery 1.0 and XPath 2.0 Full-Text Use Cases (2005), http://www.w3.org/TR/xmlquery-Full-Text-use-cases/
Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: SIGMOD 2003, pp. 16–27 (2003)
Hristidis, V., Papakonstantinou, Y., Balmin, A.: Keyword Proximity Search on XML Graphs. In: ICDE 2003, pp. 367–378 (2003)
Deutsch, A., Fernandez, M., Florescu, D., Levy, A., Suciu, D.: XML-QL: A Query Language for XML (1998), http://www.w3.org/TR/NOTE-xml-ql/
Amer-Yahia, S., Botev, C., Shanmugasundaram, J.: Texquery: A Full-Text Search Extension to XQuery. In: WWW 2004, pp. 583–594 (2004)
XKSearch, http://www.db.ucsd.edu/projects/xksearch
Bremer, J.M., Gert, M.: XQuery/IR: Integrating XML Document and Data Retrieval. In: WebDB 2002, pp. 1–6 (2002)
TREC, http://trec.nist.gov
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, H., Wang, X., Zhou, A. (2006). Efficient Evaluation of Distance Predicates in XPath Full-Text Query. In: Shen, H.T., Li, J., Li, M., Ni, J., Wang, W. (eds) Advanced Web and Network Technologies, and Applications. APWeb 2006. Lecture Notes in Computer Science, vol 3842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11610496_9
Download citation
DOI: https://doi.org/10.1007/11610496_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31158-4
Online ISBN: 978-3-540-32435-5
eBook Packages: Computer ScienceComputer Science (R0)