Skip to main content

An Effective XML Filtering Method for High-Performance Publish/Subscribe System

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3842))

Abstract

During the process large-scale XPath queries against fast XML streams on Internet, a bottleneck occurs due to the lack of memory for filtering. This paper presents an effective automaton method to reduce the memory need by diminishing the tedious operators (such as “//” and “*”) in XPath expressions. The method contains a product automata to convert XPath to the actual path; thus to reduce the complexity in search space. The proposed method was implemented in MFSA (Multi- Finite States Automata) system for filtering queries of subscribers. The empirical evidence shows its efficiency and stability when the scale of queries is large.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Altinel, M., Franklin, M.J.: Efficient filtering of XML documents for selective dissemination of information. In: Proceedings of VLDB Conference (2000)

    Google Scholar 

  2. Diao, Y., Fischer, P., Franklin, M., To, R.: YFilter: Efficient and scalable filtering of XML documents. In: Proceedings of ICDE (2002)

    Google Scholar 

  3. Gupta, A., Suciu, D.: Stream Processing of XPath Queries with Predicates. In: Proceeding of ACM SIGMOD Conference on Management of Data (2003)

    Google Scholar 

  4. Green, T.J., Miklau, G., Onizuka, M., Suciu, D.: Processing XML streams with deterministic automata. In: Calvanese, D., Lenzerini, M., Motwani, R. (eds.) ICDT 2003. LNCS, vol. 2572, pp. 173–189. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Papakonstantinou, Y., Vianu, V.: DTD Inference for Views of XML Data. In: Proceedings of PODS (2000)

    Google Scholar 

  6. McHugh, J., Widom, J.: Query optimization for XML. In: Proceedings of the Conference on VLDB, Edinburgh, Scotland (1999)

    Google Scholar 

  7. Liefke, H.: Horizontal Query Optimization on Ordered Semistructured Data. In: Proceedings of WebDB (1999)

    Google Scholar 

  8. Chung, T.S., Kim, H.J.: Extracting Indexing Information from XML DTDs. In: Information Processing Letters, vol. 81(2) (2002)

    Google Scholar 

  9. Harold, E.R., Means, W.S.: XML in a Nutshell. O’Reilly, Sebastopol (2001)

    MATH  Google Scholar 

  10. Koch, C.: Optimizing Queries Using a Meta-level Database.CoRR cs. DB/0205060: 7 (2002)

    Google Scholar 

  11. Benjelloun, O., Dang Ngoc, F.: Exchanging Intentional Xml Data. In: Proceedings of the ACM Sigmod San Diego, California, USA (2003)

    Google Scholar 

  12. Buneman, P., Davidson, S., Fernandez, M., Suciu, D.: Adding structure to unstructured data. In: Proceedings of ICDT. Springer, Deplhi (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, T., Liu, DX., Sun, W., Zhang, Ws. (2006). An Effective XML Filtering Method for High-Performance Publish/Subscribe System. In: Shen, H.T., Li, J., Li, M., Ni, J., Wang, W. (eds) Advanced Web and Network Technologies, and Applications. APWeb 2006. Lecture Notes in Computer Science, vol 3842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11610496_123

Download citation

  • DOI: https://doi.org/10.1007/11610496_123

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-31158-4

  • Online ISBN: 978-3-540-32435-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics