Skip to main content

Efficient Evaluation of Distance Predicates in XPath Full-Text Query

  • Conference paper
Advanced Web and Network Technologies, and Applications (APWeb 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3842))

Included in the following conference series:

  • 569 Accesses

Abstract

In recent years, more and more XML repositories are emerging, e.g., XML digital library, SIGMOD and DBLP document collections. Since XML is good at representing both structured and unstructured data, to facilitate the usage of this kind of information, it is necessary to support structure-based and content-based (full-text) queries/retrievals over XML repositories. With existing XPath/XQuery Full-Text, user could do search based on cardinality, proximity or distance predicates. In this paper, we propose an efficient approach for the Information Retrieval (IR) style search, especially distance predicates search, on XML documents. Numbering technique is employed to encode XML documents, and then three algorithms are designed to evaluate queries with distance predicates. To improve the performance, some optimization techniques are introduced. Extensive experiments show the effectiveness and efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amer-Yahia, S., Lakshmanan, L.V.S., Pandit, S.: FleXPath: Flexible Structure and Full-Text Querying for XML. In: SIGMOD 2004, pp. 83–94 (2004)

    Google Scholar 

  2. Amer-Yahia, S., Fernndez, M.F., Srivastava, D., Xu, Y.: PIX: Exact and Approximate Phrase Matching in XML. In: SIGMOD 2003, pp. 664–664 (2003)

    Google Scholar 

  3. Theobald, A., Weikum, G.: The XXL Search Engine: Ranked Retrieval of XML Data Using Indexes and Ontologies. In: SIGMOD 2002, p. 615 (2002)

    Google Scholar 

  4. Fuhr, N., Grojohann, K.: XIRQL: An XML Query Language Based on Information Retrieval Concepts. In: TOIS 2004, pp. 313–356 (2004)

    Google Scholar 

  5. Clark, J., DeRose, S.: XML Path Language (XPath) Version 1.0 (1999), http://www.w3.org/TR/xpath

  6. Chamberlin, D., Berglund, A., Boag, S.: XQuery 1.0: An XML Query Language (2005), http://www.w3.org/TR/xquery/

  7. Case, P., Amer-Yahia, S., Botev, C.: XQuery 1.0 and XPath 2.0 Full-Text (2005), http://www.w3.org/TR/xquery-Full-Text/

  8. Buxton, S., Rys, M.: Uery and XPath Full-Text Requirements (2003), http://www.w3.org/TR/xquery-Full-Text-requirements/

  9. Amer-Yahia, S., Case, P.: XQuery 1.0 and XPath 2.0 Full-Text Use Cases (2005), http://www.w3.org/TR/xmlquery-Full-Text-use-cases/

  10. Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked Keyword Search over XML Documents. In: SIGMOD 2003, pp. 16–27 (2003)

    Google Scholar 

  11. Hristidis, V., Papakonstantinou, Y., Balmin, A.: Keyword Proximity Search on XML Graphs. In: ICDE 2003, pp. 367–378 (2003)

    Google Scholar 

  12. Deutsch, A., Fernandez, M., Florescu, D., Levy, A., Suciu, D.: XML-QL: A Query Language for XML (1998), http://www.w3.org/TR/NOTE-xml-ql/

  13. Amer-Yahia, S., Botev, C., Shanmugasundaram, J.: Texquery: A Full-Text Search Extension to XQuery. In: WWW 2004, pp. 583–594 (2004)

    Google Scholar 

  14. XKSearch, http://www.db.ucsd.edu/projects/xksearch

  15. Bremer, J.M., Gert, M.: XQuery/IR: Integrating XML Document and Data Retrieval. In: WebDB 2002, pp. 1–6 (2002)

    Google Scholar 

  16. TREC, http://trec.nist.gov

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, H., Wang, X., Zhou, A. (2006). Efficient Evaluation of Distance Predicates in XPath Full-Text Query. In: Shen, H.T., Li, J., Li, M., Ni, J., Wang, W. (eds) Advanced Web and Network Technologies, and Applications. APWeb 2006. Lecture Notes in Computer Science, vol 3842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11610496_9

Download citation

  • DOI: https://doi.org/10.1007/11610496_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31158-4

  • Online ISBN: 978-3-540-32435-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics