Abstract
Applications such as online shopping, e-commerce, and supply-chain management require the ability to manage large sets of specifications of products and/or services as well as of consumer requirements, and call for efficient matching of requirements to specifications.
Requirements are best viewed as “queries” and specifications as data, often represented in XML. We present a framework where requirements and specifications are both registered with and are maintained by a registry. On a periodical basis, the registry matches new incoming specifications, e.g., of products and services, against requirements, and notifies the owners of the requirements of matches found. This problem is dual to the conventional problem of database query processing in that the size of data (e.g., a document that is streaming by) is quite small compared to the number of registered queries (which can be very large). For performing matches efficiently, we propose the notion of a “requirements index”, a notion that is dual to a traditional index. We provide efficient matching algorithms that use the proposed indexes. Our prototype MatchMaker system implementation uses our requirements index-based matching algorithms as a core and provides timely notification service to registered users. We illustrate the effectiveness and scalability of the techniques developed with a detailed set of experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mehmet Altinel, Michael J. Franklin. Efficient Filtering of XML Documents for Selective Dissemination of Information. In Proc. VLDB, 2000.
Chee-Yong Chan, Pascal Felber, Minos Garofalakis, and Rajeev Rastogi. Efficient Filtering of XML Documents with XPath Expressions. Proc. ICDE, San Jose, CA, Feb. 2002. To appear.
J. Chen, D. DeWitt, F. Tian, and Y. Wang. NiagaraCQ: A scalable continuous query System for Internet Dtatabases. In ACM SIGMOD, May 2000.
A.L. Diaz and D. Lovell. XML Generator. http://www.alphaworks/ibm.com/tech/xmlgnerator, Sept. 1999.
Françoise Fabret, Hans-Arno Jacobsen, François LLirbat, João Pereira, Kenneth A. Ross, and Dennis Shasha. Filtering Algorithms and Implementation for Very Fast Publish/Subscribe. In ACM SIGMOD, May 2001.
F.B Fabret et al. Efficient matching for content-based publish/subscribe systems. In Proc. CoopIS, 2000.
Hector Garcia-Molina A. Crespo, and O. Buyukkokten. Efficient Query subscription Processing in a Multicast Environment. In Proc. ICDE, 2000.
Himanshu Gupta and Divesh Srivastava. Data Warehouse of Newsgroups. In Proc. ICDT, 1999.
Eric N. Hanson Chris Carnes, Lan Huang, Mohan Konyala, Lloyd Noronha, Sashi Parthasarathy, J. B. Park, and Albert Vernon. Scalable Trigger Processing. In Proc. ICDE, pages 266–275, April 1999.
The Intel Corporation. Intel Netstructure XML Accelerators. http://www.intel.com/netstructure/products/xml_accelerators.htm, 2000.
H.V. Jagadish, Laks V.S. Lakshmanan, Divesh Srivastava, and Keith Thompson. TAX: A Tree Algebra for XML. Proc. DBPL, Roma, Italy, September 2001.
Laks V.S. Lakshmanan and P. Sailaja. On Efficient Matching of Streaming XML Documents and Queries. Tech. Report, Univ. Of British Columbia, December 2001. http://www.cs.ubc.ca/laks/matchmaker-edbt02-full.ps.gz.
Laks V.S. Lakshmanan and P. Sailaja. MatchMaker: A system for matching XML documents and queries. Demo paper, Proc. ICDE, San Jose, CA, Feb. 2002. To appear.
L. Liu C. Pu, and W. Tang. Continual queries for internet-scale event-driven information delivery. IEEE Trans. on Knowledge and Data Eng. 11(4): 610–628 (1999).
Benjamin Nguyen, Serge Abiteboul, Gregory Cobena, and Mihai Preda. Monitoring XML Data on the Web. ACM SIGMOD, 2001.
Douglas Terry David Goldberg, David Nichols, and Brian Oke. Continuous queries over Append-only databases. In ACM SIGMOD, June 1992.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lakshmanan, L.V.S., Parthasarathy, S. (2002). On Efficient Matching of Streaming XML Documents and Queries. In: Jensen, C.S., et al. Advances in Database Technology — EDBT 2002. EDBT 2002. Lecture Notes in Computer Science, vol 2287. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45876-X_11
Download citation
DOI: https://doi.org/10.1007/3-540-45876-X_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43324-8
Online ISBN: 978-3-540-45876-0
eBook Packages: Springer Book Archive