Skip to main content
Log in

Fast retrieval of high-dimensional feature vectors in P2P networks using compact peer data summaries

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

The retrieval facilities of most peer-to-peer (P2P) systems are limited to queries based on a unique identifier or a small set of keywords. The techniques used for this purpose are hardly applicable for content based image retrieval (CBIR) in a P2P network. Furthermore, we will argue that the curse of dimensionality and the high communication overhead prevent the adaptation of multidimensional search trees or fast sequential scan techniques for P2P CBIR. In the present paper we will propose two compact data representations that can be distributed in a P2P network and used as the basis for a source selection. This allows for communicating with only a small fraction of all peers during query processing without deteriorating the result quality significantly. We will also present experimental results confirming our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Aggarwal, C.C., Hinneburg, A., Keim, D.A.: On the surprising behavior of distance metrics in high dimensional space. In: Proceedings of the 8th International Conference on Database Theory (2001)

  2. Balakrishnan, H., Kasshoek, M.F., Karger, D. Morris, R., Stoica, I.: Looking up data in P2P systems. Commun. ACM 46(2), 43–48 (2003)

    Google Scholar 

  3. Beckmann, N., Kriegel, H.-P., Schneider, R., Seeger, B.: The R*-tree: an efficient and robust access method for points and rectangles. In: Proceedings of ACM SIGMOD, Atlantic City, NJ (1990)

  4. Berchtold, S., Keim, D.A., Kriegel, H.-P.: The X-tree: an index structure for high-dimensional data. In: Proceedings of the 22nd International Conference on Very Large Data Bases, San Francisco (1996)

  5. Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th ACM SIGIR, Seattle, WA (1995)

  6. Clarke, I., Sandberg, O., Wiley, B., Hong, T.W.: Freenet: a distributed anonymous information storage and retrieval system. In: Federrath, H. (ed.): Designing Privacy Enhancing Technologies. Lecture notes in computer science, vol. 2009. Berlin, Heidelberg, New York: Springer (2000)

    Google Scholar 

  7. Clip2. The Gnutella Protocol Specification v0.4. URL: http://www9.limewire.com/developer/gnutella_protocol_0.4.pdf (2000)

  8. Cuenca-Acuna, F.M., Nguyen, T.D.: Text-based content search and retrieval in ad hoc P2P communities. Technical Report DCS-TR-483, Department of Computer Science, Rutgers University (2002)

  9. Corel Gallery 4.0, Corel Corp., Ottawa, Canada (1999)

  10. Eisenhardt, M., Müller, W., Henrich, A.: Classifying documents by distributed P2P clustering. In: Dittrich, K.R., König, W., Oberweis, A., Rannenberg, K., Wahlster, W. (eds.): GI Jahrestagung (2), GI-Lecture Notes in Informatics, vol. 35 (2003)

  11. Ganesan, P., Yang, B., Garcia-Molina, H.: One torus to rule them all: multi-dimensional queries in P2P systems. In: Proceedings of 7th International Workshop on the Web and Databases. New York: ACM Press (2004)

    Google Scholar 

  12. Gravano, L., Garcia-Molina, H., Tomasic, A.: The effectiveness of GlOSS for the text database discovery problem. In: Proceedings of ACM SIGMOD, Minneapolis, MN (1994)

  13. Hafner, J., Sawhney, H.S., Equitz, W., Flickner, M., Niblack, W.: Efficient color histogram indexing for quadratic form distance functions. IEEE Trans. Pattern Anal. Mach. Intell. 17(7), 729–736 (1995)

    Article  Google Scholar 

  14. Joseph, S.: Adaptive routing in distributed decentralized systems: Neurogrid, gnutella and freenet. In: Proceedings of Workshop on Infrastructure for Agents, MAS, and Scalable MAS, at Autonomous Agents. Montreal (2001)

  15. Müller, W., Eisenhardt, M., Henrich, A.: Efficient content based P2P image retrieval using peer content descriptions. In: Santini, S., Schettini, R. (eds.): Internet Imaging V. SPIE, no. 5304. SPIE/IS&T (2004) GYG2004, MuH2003

  16. Müller, W., Henrich, A.: Fast retrieval of high-dimensional feature vectors in P2P networks using compact peer data summaries. In: Proceedings of the ACM MIR'03 Workshop, Berkeley, CA (2003)

  17. Ng, C.H., Sia, K.C.: Peer clustering and firework query model. In: Poster Proceedings of the 11th Interational World Wide Web Conference, Honululu, HI (2002)

  18. Ng, C.H., Sia, K.C., Chang, C.H.: Advanced peer clustering and firework query model in the peer-to-peer network. Poster in WWW2003 (2003)

  19. Sakurai, Y., Yoshikawa, M., Uemura, S., Kojima, H.: The a-tree: an index structure for high-dimensional spaces using relative approximation. In: Proceedings of the International Conference on Very Large Data Bases, Cairo, Egypt (2000)

  20. Smeulders, A.W., Worring, M., Santini, S., Gupta, A., Jain, R.: Content based retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  21. Tang, C., Xu, Z., Mahalingam, M.: pSearch: information retrieval in structured overlays. In: Proceedings of the 1st Workshop on Hot Topics in Networks (HotNets-I), Princeton, NJ (2002)

  22. Vasconcelos, N.: Bayesian models for visual information retrieval. PhD Thesis, MIT, Cambrdige, MA (2000)

  23. Weber, R., Schek, H.-J., Blott, S.: A quantitative analysis and performance study for similarity-search methods in high-dimensional spaces. In: Proceedings of the International Conference on Very Large Data Bases, New York (1998)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wolfgang Müller.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Müller, W., Eisenhardt, M. & Henrich, A. Fast retrieval of high-dimensional feature vectors in P2P networks using compact peer data summaries. Multimedia Systems 10, 464–474 (2005). https://doi.org/10.1007/s00530-005-0175-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-005-0175-6

Keywords

Navigation