Skip to main content
Log in

P2P case storage and retrieval with an unspecified ontology

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

Traditional approaches for similarity-based retrieval of structured data, such as Case-Based Reasoning (CBR), have been largely implemented using centralized storage systems. In such systems, when the cases contain both numeric and free-text attributes, similarity-based retrieval cannot exploit standard speedup techniques based on multi-dimensional indexing, and the retrieval is implemented by an exhaustive comparison of the case to be solved with the whole set of stored cases. In this work, we review current research on Peer-to-Peer (P2P) and distributed CBR techniques and propose a novel approach for storage of the case-base in a decentralized Peer-to-Peer environment using the notion of Unspecified Ontology to improve the performance of the case retrieval stage and build CBR systems that can scale up to large case-bases. We develop an algorithm for efficient retrieval of approximated most-similar cases, which exploits inherent characteristics of the unspecified ontology in order to improve the performance of the case retrieval stage in the CBR problem solving cycle. The experiments show that the algorithm successfully retrieves cases close to the most-similar cases, while reducing the number of cases to be compared. Hence, it improves the performance of the retrieval stage. Moreover, the distributed nature of our approach eliminates the computational bottleneck and single point of failure of the centralized storage systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aamodt A, Plaza E (1994) Case-based reasoning: foundational issues, methodological variations, and system approaches.. AI Commun 7(1): 39–59

    Google Scholar 

  • Adar E, Huberman B (2000) Free riding on Gnutella. Technical report, Xerox PARC

  • Androutsellis-Theotokis S, Spinellis D (2004) A survey of peer-to-peer content distribution technologies. ACM Comput Surv 36(4): 335–371

    Article  Google Scholar 

  • Bawa M, Condie T, Ganesan P (2005) LSH forest: self-tuning indexes for similarity search. In: Proceedings of the international conference on World Wide Web, Japan

  • Ben-Asher Y, Berkovsky S (2006) UNSO: unspecified ontologies for peer-to-peer E-commerce applications. J Data Semant 6: 115–142

    Article  Google Scholar 

  • Bernstein A, Kaufmann E, Buerki C, Klein M (2005) How similar is it? Towards personalized similarity measures in ontologies. In: Proceedings of the internationale Tagung Wirtschaftsinformatik, Germany

  • Bogaerts S, Leake D (2004) Facilitating CBR for incompletely-described cases: distance metrics for partial problem descriptions. In: Proceedings of the European conference on case-based reasoning, Spain

  • Bonifacio M, Bouquet P, Mameli G, Nori M (2003) Peer-mediated distributed knowledge management. Agent-Mediated Knowledge Management, Springer

  • Branting K, Aha DW (1995) Stratified case-based reasoning: reusing hierarchical problem solving episodes. In: Proceedings of the international joint conference on artificial intelligence, CA

  • Clarke I, Sandberg O, Wiley B, Hong T (2000) Freenet: a distributed anonymous information storage and retrieval system. In: Proceedings of the workshop on design issues in anonymity and unobservability, CA

  • Coyle L, Doyle D, Cunningham P (2004) Representing similarity for CBR in XML. In: Proceedings of the European conference on advances in case-based reasoning, Spain

  • de Mantaras RL, McSherry D, Bridge D, Leake D, Smyth B, Craw S, Faltings B, Maher ML, Cox MT, Forbus K, Keane M, Aamodt A, Watson I (2005) Retrieval, reuse, revision and retention in case-based reasoning. Knowl Eng Rev 20(3): 215–240

    Article  Google Scholar 

  • Fellbaum C (1998) WordNet–an electronic lexical database. MIT Press Publishers

  • Friedman JH, Bentley JH, Finkel RA (1977) An algorithm for finding best matches in logarithmic expected time. ACM Trans Math Softw 3(3): 209–226

    Article  MATH  Google Scholar 

  • Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis J 6(2): 199–220

    Article  Google Scholar 

  • Harren M, Hellerstein JM, Huebsch R, Loo BT, Shenker S, Stoica I (2002) Complex queries in DHT-based peer-to-peer networks. In: Proceedings of the international workshop on peer-to-peer systems, MA

  • Herlocker JL, Konstan JA, Borchers A, Riedl J (1999) An algorithmic framework for performing collaborative filtering. In: Proceedings of the international SIGIR conference on research and development in information retrieval, CA

  • Kalnis P, Ng WS, Ooi BC, Tan KL (2006) Answering similarity queries in peer-to-peer networks. Inf Syst J 31(1): 57–72

    Article  Google Scholar 

  • Leake DB, Sooriamurthi R (2001) When two case bases are better than one: exploiting multiple case bases. In: Proceedings of the international conference on case-based reasoning, Canada

  • Leake DB, Sooriamurthi R (2003) Dispatching cases versus merging case-bases: when MCBR matters. In: Proceedings of the international Florida artificial intelligence research society conference, FL

  • McGinty L, Smyth B (2001) Collaborative case-based reasoning: applications in personalised route planning. In: Proceedings of the international conference on case-based reasoning, Canada

  • Milojicic DS, Kalogeraki V, Lukose R, Nagaraja K, Pruyne J, Richard B, Rollins S, Xu Z (2002) Peer-to-peer computing. Technical report HPL-2002-57, HP Labs

  • Nagendra Prasad MV, Lesser V, Lander S (1996) Retrieval and reasoning in distributed case bases. J Vis Commun Image Represent, Special Issue on Digital Libraries 7(1): 74–87

    Google Scholar 

  • Napster Inc, The Napster homepage. http://www.napster.com

  • Plaxton C, Rajaraman R, Richa A (1997) Accessing nearby copies of replicated objects in a distributed environment. In: Proceedings of the symposium on parallel algorithms and architectures, RI

  • Plaza E, McGinty L (2005) Distributed case-based reasoning. Knowl Eng Rev 20(3): 261–265

    Article  Google Scholar 

  • Plaza E, Arcos JL, Martin F (1996) Cooperative case-based reasoning. In: Proceedings of the workshop distributed artificial intelligence meets machine learning, Hungary

  • Pouwelse J, van Slobbe M, Wang J, Reinders MJT, Sips H (2005) P2P-based PVR recommendation using friends, taste buddies and superpeers. In: Proceedings of the beyond personalization workshop, CA

  • Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) A scalable content-addressable network. In: Proceedings of the conference of the special interest group on data communication, CA

  • Richter MM (1992) Classification and learning of similarity measure. In: Proceedings of the annual conference of the German society for classification, Germany

  • Rowstron A, Druschel P (2001) Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proceedings of the international conference on distributed systems platforms, Germany

  • Salton G, McGill M (1983) Introduction to modern information retrieval. McGraw-Hill Publishing

  • Schlosser M, Sintek M, Decker S, Nejdl W (2002) A scalable and ontology-based P2P infrastructure for semantic web services. In: Proceedings of the international conference on peer-to-peer computing, Sweden

  • Smyth B, Cunningham P (1996) The utility problem analysed: a case-based reasoning perspective. In: Proceedings of the European workshop on case-based reasoning, Switzerland

  • Tanin E, Nayar D, Samet H (2005) An efficient nearest neighbor algorithm for P2P settings. In: Proceedings of the national conference on digital government research, GA

  • Tran DA (2005) Hierarchical semantic overlay approach to P2P similarity search. In: Proceedings of the USENIX annual technical conference, CA

  • Watson I (1997) Applying case-based reasoning: techniques for enterprise systems. Morgan Kaufmann Publishers

  • Watson I, Gardingen D (1999) A distributed case-based reasoning application for engineering sales support. In: Proceedings of the international joint conference on artificial intelligence, CA

  • Wess S, Althoff KD, Derwand G (1993) Using K-d trees to improve the retrieval step in case-based reasoning. In: Proceedings of the European workshop on case-based reasoning, Germany

  • Wilson DR, Martinez TR (1997) Improved Heterogeneous distance functions. J Artif Intell Res 6: 3–21

    MathSciNet  Google Scholar 

  • Wooldridge M (2002) An introduction to multi-agent systems. John Wiley Publishers

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shlomo Berkovsky.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Berkovsky, S., Kuflik, T. & Ricci, F. P2P case storage and retrieval with an unspecified ontology. Artif Intell Rev 28, 227–255 (2007). https://doi.org/10.1007/s10462-009-9100-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-009-9100-2

Keywords

Navigation