Skip to main content

Content-Based Overlay Networks for XML Peers Based on Multi-level Bloom Filters

  • Conference paper
Databases, Information Systems, and Peer-to-Peer Computing (DBISP2P 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2944))

Abstract

Peer-to-peer systems are gaining popularity as a means to effectively share huge, massively distributed data collections. In this paper, we consider XML peers, that is, peers that store XML documents. We show how an extension of traditional Bloom filters, called multi-level Bloom filters, can be used to route path queries in such a system. In addition, we propose building content-based overlay networks by linking together peers with similar content. The similarity of the content (i.e., the local documents) of two peers is defined based on the similarity of their filters. Our experimental results show that overlay networks built based on filter similarity are very effective in retrieving a large number of relevant documents, since peers with similar content tend to be clustered together.

This work was partially funded by the Information Society Technologies programme of the European Commission, Future and Emerging Technologies under the IST-2001-32645 DBGlobe project and by a Greek Ministry of Education program for Supporting Graduate Studies in Computer Science (EPEAEK II).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bloom, B.: Space/time trade-offs in hash coding with allowable errors. Communications of the ACM 13(7), 422–426 (1970)

    Article  MATH  Google Scholar 

  2. Fan, L., Cao, P., Almeida, J., Broder, A.: Summary cache: A scalable wide-area Web cache sharing protocol. In: Procs of ACM SIGCOMM Conference, September 1998, pp. 254–265 (1988)

    Google Scholar 

  3. Gribble, S.D., Brewer, E.A., Hellerstein, J.M., Culler, D.: Scalable Distributed Data Structures for Internet Service Construction. In: Procs. of the Fourth Symposium on Operating Systems Design and Implementation (2000)

    Google Scholar 

  4. Hodes, T.D., Czerwinski, S.E., Zhao, B.Y., Joseph, A.D., Katz, R.H.: Architecture for Secure Wide-Area Service Discovery. In: Mobicom 1999 (1999)

    Google Scholar 

  5. Ramakrishna, M.V.: Practical performance of Bloom Filters and parallel free-text searching. Communications of the ACM 32(10), 1237–1239

    Google Scholar 

  6. Crespo, A., Garcia-Molina, H.: Routing Indices for Peer-to-peer Systems. In: ICDCS (2002)

    Google Scholar 

  7. Rhea, S.C., Kubiatowicz, J.: Probabilistic Location and Routing. In: INFOCOM (2002)

    Google Scholar 

  8. Crespo, A., Garcia-Molina, H.: Semantic Overlay Networks for P2P Systems (submitted for publication)

    Google Scholar 

  9. Koloniari, G., Pitoura, E.: Bloom-Based Filters for Hierarchical Data. In: WDAS 2003 (2003)

    Google Scholar 

  10. Triantafillou, P., Xiruhaki, C., Koubarakis, M., Ntarmos, N.: Towards High Performance Peer-to-Peer Content and Resource Sharing Systems. In: CIDR 2003 (2003)

    Google Scholar 

  11. Mohan, A., Kalogeraki, V.: Speculative Routing and Update Propagation: A Kundali Centric Approach. In: IEEE International Conference on Communications (ICC 2003) (May 2003)

    Google Scholar 

  12. Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications. In: Procs. of the 2001 ACM SIGCOMM Conference (2001)

    Google Scholar 

  13. The MD5 Message-Digest Algorithm. RFC1321

    Google Scholar 

  14. The Niagara generator, http://www.cs.wisc.edu/niagara

  15. Nejdl, W., Wolpers, M., Siberski, W., Schmitz, C., Schlosser, M., Brunkhorst, I., Loser, A.: Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer-To-Peer Networks. In: WWW 2003, Budapest, Hungary, ACM, New York (2003) 1-58113-680-3/03/0005

    Google Scholar 

  16. Khambatti, M., Ryu, K., Dasgupta, P.: Peer-to-Peer Communities: Formation and Discovery. In: Fourteenth IASTED International Conference on Parallel and Distributed Computing and Systems, Cambridge (2002)

    Google Scholar 

  17. Milojicic, D.S., Kalogeraki, V., Lukose, R., Nagaraja, K., Pruyne, J., Richard, B., Rollins, S., Xu, Z.: Peer-to-Peer Computing, HP Technical Report, HPL-2002-57

    Google Scholar 

  18. Abiteboul, S., Bonifati, A., Cobéna, G., Manolescu, I., Milo, T.: Dynamic XML Documents with Distribution and Replication. In: SIGMOD 2003, San Diego, CA (June 2003)

    Google Scholar 

  19. Saroiu, S., Gummadi, K., Gribble, S.: A measurement study of peer-to-peer file sharing systems. In: Proceedings of Multimedia Conferencing and Networking (2002)

    Google Scholar 

  20. Annexstein, F.S., Berman, K.A., Jovanovic, M.A.: Latency Effects on Reachability in Large-scale Peer-to-Peer Networks. In: Procs. of the 13th annual ACM symposium on Parallel algorithms and architectures, Crete, Greece (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koloniari, G., Petrakis, Y., Pitoura, E. (2004). Content-Based Overlay Networks for XML Peers Based on Multi-level Bloom Filters. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds) Databases, Information Systems, and Peer-to-Peer Computing. DBISP2P 2003. Lecture Notes in Computer Science, vol 2944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24629-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24629-9_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20968-3

  • Online ISBN: 978-3-540-24629-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics