Skip to main content

On Efficient Matching of Streaming XML Documents and Queries

  • Conference paper
  • First Online:
Advances in Database Technology — EDBT 2002 (EDBT 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2287))

Included in the following conference series:

Abstract

Applications such as online shopping, e-commerce, and supply-chain management require the ability to manage large sets of specifications of products and/or services as well as of consumer requirements, and call for efficient matching of requirements to specifications.

Requirements are best viewed as “queries” and specifications as data, often represented in XML. We present a framework where requirements and specifications are both registered with and are maintained by a registry. On a periodical basis, the registry matches new incoming specifications, e.g., of products and services, against requirements, and notifies the owners of the requirements of matches found. This problem is dual to the conventional problem of database query processing in that the size of data (e.g., a document that is streaming by) is quite small compared to the number of registered queries (which can be very large). For performing matches efficiently, we propose the notion of a “requirements index”, a notion that is dual to a traditional index. We provide efficient matching algorithms that use the proposed indexes. Our prototype MatchMaker system implementation uses our requirements index-based matching algorithms as a core and provides timely notification service to registered users. We illustrate the effectiveness and scalability of the techniques developed with a detailed set of experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mehmet Altinel, Michael J. Franklin. Efficient Filtering of XML Documents for Selective Dissemination of Information. In Proc. VLDB, 2000.

    Google Scholar 

  2. Chee-Yong Chan, Pascal Felber, Minos Garofalakis, and Rajeev Rastogi. Efficient Filtering of XML Documents with XPath Expressions. Proc. ICDE, San Jose, CA, Feb. 2002. To appear.

    Google Scholar 

  3. J. Chen, D. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query System for Internet Dtatabases. In ACM SIGMOD, May 2000.

    Google Scholar 

  4. A.L. Diaz and D. Lovell. XML Generator. http://www.alphaworks/ibm.com/tech/xmlgnerator, Sept. 1999.

  5. Françoise Fabret, Hans-Arno Jacobsen, François LLirbat, João Pereira, Kenneth A. Ross, and Dennis Shasha. Filtering Algorithms and Implementation for Very Fast Publish/Subscribe. In ACM SIGMOD, May 2001.

    Google Scholar 

  6. F.B Fabret et al. Efficient matching for content-based publish/subscribe systems. In Proc. CoopIS, 2000.

    Google Scholar 

  7. Hector Garcia-Molina A. Crespo, and O. Buyukkokten. Efficient Query subscription Processing in a Multicast Environment. In Proc. ICDE, 2000.

    Google Scholar 

  8. Himanshu Gupta and Divesh Srivastava. Data Warehouse of Newsgroups. In Proc. ICDT, 1999.

    Google Scholar 

  9. Eric N. Hanson Chris Carnes, Lan Huang, Mohan Konyala, Lloyd Noronha, Sashi Parthasarathy, J. B. Park, and Albert Vernon. Scalable Trigger Processing. In Proc. ICDE, pages 266–275, April 1999.

    Google Scholar 

  10. The Intel Corporation. Intel Netstructure XML Accelerators. http://www.intel.com/netstructure/products/xml_accelerators.htm, 2000.

  11. H.V. Jagadish, Laks V.S. Lakshmanan, Divesh Srivastava, and Keith Thompson. TAX: A Tree Algebra for XML. Proc. DBPL, Roma, Italy, September 2001.

    Google Scholar 

  12. Laks V.S. Lakshmanan and P. Sailaja. On Efficient Matching of Streaming XML Documents and Queries. Tech. Report, Univ. Of British Columbia, December 2001. http://www.cs.ubc.ca/laks/matchmaker-edbt02-full.ps.gz.

  13. Laks V.S. Lakshmanan and P. Sailaja. MatchMaker: A system for matching XML documents and queries. Demo paper, Proc. ICDE, San Jose, CA, Feb. 2002. To appear.

    Google Scholar 

  14. L. Liu C. Pu, and W. Tang. Continual queries for internet-scale event-driven information delivery. IEEE Trans. on Knowledge and Data Eng. 11(4): 610–628 (1999).

    Article  Google Scholar 

  15. Benjamin Nguyen, Serge Abiteboul, Gregory Cobena, and Mihai Preda. Monitoring XML Data on the Web. ACM SIGMOD, 2001.

    Google Scholar 

  16. Douglas Terry David Goldberg, David Nichols, and Brian Oke. Continuous queries over Append-only databases. In ACM SIGMOD, June 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lakshmanan, L.V.S., Parthasarathy, S. (2002). On Efficient Matching of Streaming XML Documents and Queries. In: Jensen, C.S., et al. Advances in Database Technology — EDBT 2002. EDBT 2002. Lecture Notes in Computer Science, vol 2287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45876-X_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-45876-X_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43324-8

  • Online ISBN: 978-3-540-45876-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics