Abstract
When joining information provider peers to a peer-to-peer network, an arbitrary distribution is sub-optimal. In fact, clustering peers by their characteristics, enhances search and integration significantly. Currently super-peer networks, such as the Edutella network, provide no sophisticated means for such a ”semantic clustering” of peers. We introduce the concept of semantic overlay clusters (SOC) for super-peer networks enabling a controlled distribution of peers to clusters. In contrast to the recently announced semantic overlay network approach designed for flat, pure peer-to-peer topologies and for limited meta data sets, such as simple filenames, we allow a clustering of complex heterogeneous schemes known from relational databases and use advantages of super-peer networks, such as efficient search and broadcast of messages. Our approach is based on predefined policies defined by human experts. Based on such policies a fully decentralized broadcast- and matching approach distributes the peers automatically to super-peers. Thus we are able to automate the integration of information sources in super-peer networks and reduce flooding of the network with messages.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aberer, K., Cudré-Mauroux, P., Hauswirth, M.: The chatty web: Emergent semantics through gossiping. In: Proceedings of the Twelfth International World Wide Web Conference (WWW 2003), Budapest, Hungary (2003)
Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data management infrastructure for semantic web applications. In: Proceedings of the Twelfth International World Wide Web Conference (WWW 2003), Budapest, Hungary (2003)
Bernstein, P.A., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data management for peer-to-peer computing: A vision. In: Proceedings of the Fifth International Workshop on the Web and Databases, Madison, Wisconsin (2002)
Nejdl, W., Wolpers, M., Siberski, W., Löser, A., Bruckhorst, I., Schlosser, M., Schmitz, C.: Super-Peer-Based Routing and Clustering Strategies for RDF-Based Peer-To-Peer Networks. In: Proceedings of the Twelfth International World Wide Web Conference (WWW 2003), Budapest, Hungary (2003)
Löser, A., Nejdl, W., Wolpers, M., Siberski, W.: Information Integration in Schema-Based Peer-To-Peer Networks. In: Eder, J., Missikoff, M. (eds.) CAiSE 2003. LNCS, vol. 2681, Springer, Heidelberg (2003)
Ng, C.H., Sia, K.C., King, I.: Peer clustering and firework query model in the peer-to-peer network. Technical report, Chinese University of Hongkong, Department of Computer Science and Engineering (2003)
Crespo, A., Molina, H.G.: Semantic Overlay Networks, Stanford University (2002) (submitted for publication)
Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmér, M., Risch, T.: EDUTELLA: a P2P Networking Infrastructure based on RDF. In: Proceedings of the Eleventh International World Wide Web Conference (WWW 2002), Hawaii, USA (2002)
Wiederhold, G.: Mediators in the architecture of future information systems. IEEE Computer 25, 38–49 (1992)
Kashyap, V., Sheth, A.: Information Brokering Across Heterogeneous Digital Data A Metadata-based Approach. Kluwer Academic Publishers, Boston (2000)
Mena, E., Kashyap, V., Sheth, A.P., Illarramendi, A.: OBSERVER: An approach for query processing in global information systems based on interoperation across pre-existing ontologies. In: Conference on Cooperative Information Systems, pp. 14–25 (1996)
Fisher, D.: Knowledge acquisition via incremental conceptual clustering. Machine Learning 2, 139–172 (1987)
Thompson, K., Langley, P.: Concept formation in structured domains. In: Fisher, D., Pazzani, M., Langley, P. (eds.) Concept formation: knowledge and experience in unsupervised learning, Morgan Kaufmann, San Francisco (1991)
Garcia-Molina, H., Yang, B.: Efficient search in peer-to-peer networks. In: Proceedings of ICDCS (2002)
Gong, L.: Project JXTA: A technology overview. Technical report, SUN Microsystems (2001), http://www.jxta.org/project/www/docs/TechOverview.pdf
Yang, B., Garcia-Molina, H.: Designing a super-peer network. In: Proccedings of the ICDE (March 2003)
Broekstra, J., et al.: A metadata model for semantics-basd peer-to-peer systems. In: Proceedings of the International Workshop in Conjunction with the WWW 2003, Budapest (2003)
Aberer, K., Hauswirth, M.: Semantic gossiping. In: Database and Information Systems Research for Semantic Web and Enterprises, Invitational Workshop, University of Georgia, Amicalola Falls and State Park, Georgia (2002)
Naumann, F.: Quality-Driven Query Answering for Integrated Information Systems. LNCS, vol. 2261. Springer, Heidelberg (2002)
Mohan, S., Willshire, M.J.: DataBryte: A data warehouse cleansing framework. In: Proceedings of the International Conference on Information Quality (IQ), Cambridge, MA, pp. 77–88 (1999)
Hernández, M.A., Stolfo, S.J.: Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery 2(1), 9–37 (1998)
Galhardas, H., Florescu, D., Shasha, D., Simon, E.: An extensible framework for data cleaning. In: ICDE, San Diego, CA, p. 312 (2000)
Gallager, R.G., Humblet, P.A., Spira, M.: A distributed algorithm for minimum weight spanning trees. ACM Transactions on Programming Languages and Systems 5-1, 66–77 (1983)
Schlosser, M., Sintek, M., Decker, S., Nejdl, W.: A scalable and ontology-based P2P infrastructure for semantic web services. In: Proceedings of the Second International Conference on Peer-to-Peer Computing, Linköping, Sweden (2002)
Zhuang, S.Q., Zhao, B.Y., Joseph, A.D., Katz, R.H., Kubiatowicz, J.D.: Bayeux: An architecture for scalable and fault-tolerant wide-area dissemination. In: Proceedings of ACM/NOSSDAV, Port Jefferson, NewYork, USA (2001)
Aberer, K.: P-grid: A self-organizing access structure for p2p information systems. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, p. 179. Springer, Heidelberg (2001)
Buckley, C., Singhal, A., Mitra, M., Salton, G.: New retrieval approaches using SMART: TREC 4. In: Proceedings of the Fourth Text REtrieval Conference (TREC-4), pp. 25–48 (1995)
Tangmunarunkit, H., Decker, S., Kesselman, C.: Ontology-based resource matching the grid meets the semantic web. In: Proceedings of the First Workshop of Semantics in Peer-to-Peer and Grid Computing in Conjunction witrh the 12th WWW Conference, Budapest (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Löser, A., Naumann, F., Siberski, W., Nejdl, W., Thaden, U. (2004). Semantic Overlay Clusters within Super-Peer Networks. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds) Databases, Information Systems, and Peer-to-Peer Computing. DBISP2P 2003. Lecture Notes in Computer Science, vol 2944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24629-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-24629-9_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20968-3
Online ISBN: 978-3-540-24629-9
eBook Packages: Springer Book Archive