skip to main content
10.1145/1142473.1142588acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Quark: an efficient XQuery full-text implementation

Published:27 June 2006Publication History

ABSTRACT

The XQuery 1.0 and XPath 2.0 Full-text (XQFT) language has been developed by the W3C to extend XQuery and XPath with full-text search capabilities. XQFT allows users to specify a mix of structured and complex full-text predicates, and also allows users to score/rank such queries. The power and flexibility of XQFT gives rise to two interesting questions. First, is it possible to efficiently integrate a full-function XML query language with sophisticated full-text search? Second, is it possible to score and rank arbitrary XQuery and XQFT queries? In this demonstration, we present evidence that it is indeed possible to achieve the above goals. We demonstrate the Quark open-source data management system and show how we can seamlessly and efficiently integrate structured and unstructured search over XML data. In particular, we demonstrate (a) techniques for efficiently evaluating keyword search over virtual XML views, and (b) a framework for scoring both structured and full-text predicates.

References

  1. R. J. Bayardo, D. Gruhl, V. Josifovski, and J. Myllymaki. An evaluation of binary xml encoding optimizations for fast stream based xml processing. In WWW'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. C. Botev and J. Shanmugasundaram. Context-sensitive keyword search and ranking for xml. In WebDB'2005 Poster.Google ScholarGoogle Scholar
  3. Z. Chen, J. Gehrke, F. Korn, N. Koudas, J. Shanmugasundaram, and D. Srivastava. Index structures for matching xml twigs using relational query processors. In XSDM'2005.Google ScholarGoogle Scholar
  4. M. Fernandez, W.-C. Tan, and D. Suciu. Silkroute: Trading between relations and xml. In WWW'1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. N. Fuhr and K. Groejohann. Xirql: a query language for information retrieval in xml documents. In SIGIR'2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. N. Fuhr and T. Rolleke. A probabilistic relational algebra for the integration of information retrieval and database systems. ACM Trans. Inf. Syst., 15(1):32--66, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Shanmugasundaram, J. Kiernan, E. Shekita, C. Fan, and J. Funderburk. Querying xml views of relational data. In VLDB'2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. I. Tatarinov, S. Viglas, K. Beyer, J. Shanmugasundaram, E. Shekita, and C. Zhang. Storing and querying ordered xml using a relational database system. In SIGMOD'2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Theobald and G. Weikum. The index-based xxl search engine for querying xml data with relevance ranking. In EDBT'2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Quark: an efficient XQuery full-text implementation

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of data
                June 2006
                830 pages
                ISBN:1595934340
                DOI:10.1145/1142473

                Copyright © 2006 ACM

                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 27 June 2006

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • Article

                Acceptance Rates

                Overall Acceptance Rate785of4,003submissions,20%

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader