Abstract
Databases have employed a schema-based approach to store and retrieve structured data for decades. For peer-to-peer (P2P) networks, similar approaches are just beginning to emerge, also motivated by the fact, that sending (atomic) queries to the appropriate peers clearly fails for queries which need data from more than one peer to be executed. While quite a few database techniques can be re-used in this new context, a P2P data management infrastructure poses additional challenges which have to be solved before schema-based P2P networks become as common as schema-based databases. Because of the dynamic nature of P2P networks, we can neither assume global knowledge about data distribution, nor are static topologies and static query plans suitable for these networks. Unlike in traditional distributed database systems, we cannot assume a complete schema instance but rather work with a distributed schema which directs query processing tasks from one node to one or more neighboring nodes.
In this paper, we will first discuss a suitable topology for schema-based P2P networks and how distributed knowledge about data distribution can be stored, accessed and updated based on that topology. Second we will describe how this knowledge can be used to distribute abstract query plans through the P2P network and expand them on the fly such that we can place query operators next to data sources and utilize distributed computing resources more effectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aberer, K., Hauswirth, M.: Semantic gossiping. In: Database and Information Systems Research for Semantic Web and Enterprises, Invitational Workshop (2002)
Bernstein, P.A., Giunchiglia, F., Kementsietsidis, A., Mylopoulos, J., Serafini, L., Zaihrayeu, I.: Data management for peer-to-peer computing: A vision. In: Proc. of the 5th Intl. Workshop on the Web and Databases (2002)
Boncz, P., Treijtel, C.: AmbientDB: Relational Query Processing over P2P Network. In: Intl. Workshop on Databases, Information Systems and Peer-to-Peer Computing (2003)
Braumandl, R., Keidl, M., Kemper, A., Kossmann, D., Kreutz, A., Seltzsam, S., Stocker, K.: ObjectGlobe: Ubiquitous query processing on the Internet. The VLDB Journal: Special Issue on E-Services 10(3) (2001)
Brickley, D., Guha, R.V.: RDF vocabulary description language 1.0: RDF Schema (2003), http://www.w3.org/TR/rdf-schema/
Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proc. Intl. Conf. on Distributed Computing Systems (2002)
Dublin core metadata initiative
The Edutella Project (2002), http://edutella.jxta.org/
Frankel, J.: Gnutella. Information portal with community, development information and downloads (March 1999), http://www.gnutella.com
Gong, L.: Project JXTA: A technology overview. Technical report, SUN Microsystems (2001), http://www.jxta.org/project/www/docs/TechOverview.pdf
Graefe, G.: Query Evaluation Techniques for Large Databases. ACM Computing Surveys 25(2) (1993)
Gribble, S., Halevy, A.Y., Ives, Z.G., Rodrig, M., Suciu, D.: What can databases do for peer-to-peer. In: Proc. of the 4th Intl. Workshop on the Web and Databases (2001)
Gruser, J.R., Raschid, L., Zadorozhny, V., Zhan, T.: Learning response time for websources using query feedback and application in query optimization. The VLDB Journal 9(1) (2000)
Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data management infrastructure for semantic web applications. In: Proc. of the 12th Intl. World Wide Web Conf. (2003)
Dhraief, H.: Peer Registration RDF Document http://www.kbs.uni-hannover.de/~hdhraief/edutella/
Hemming, N.: KaZaA, http://www.kazaa.com
Josifovski, V., Schwarz, P., Haas, L., Lin, E.: Garlic: A New Flavor of Federated Query Processing for DB2. In: Proc. of the ACM SIGMOD Conf. on Management of Data (2002)
Huebsch, R., Hellerstein, J.M., Lanham, N., Loo, B.T., Shenker, S., Stoica, I.: Querying the Internet with PIER. In: Proc. of the Conf. on Very Large Data Bases (2003)
Kemper, A., Wiesner, C.: HyperQueries: Dynamic Distributed Query Processing on the Internet. In: Proc. of the Conf. on Very Large Data Bases (2001)
Kemper, A., Wiesner, C., Winklhofer, P.: Building dynamic market places using hyperqueries. In: Proc. of the Intl. Conf. on Extending Database Technology (2002)
Lassila, O., Swick, R.R.: W3C Resource Description Framework model and syntax specification (1999), http://www.w3.org/TR/REC-rdf-syntax/
Levy, A.Y., Srivastava, D., Kirk, T.: Data Model and Query Evaluation in Global Information Systems. Journal of Intelligent Information Systems 5(2) (1995)
IEEE Learning Technology Standards Committee, IEEE P1484.12 Learning Object Metadata Working Group
Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmér, M., Risch, T.: EDUTELLA: a P2P Networking Infrastructure based on RDF. In: Proc. of the 11th Intl. World Wide Web Conf. (2002)
Nejdl, W., Wolpers, M., Siberski, W., Schmitz, C., Schlosser, M., Brunkhorst, I., Loser, A.: Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks. In: Proc. of the Intl. World Wide Web Conf. (2003)
Nejdl, W., Wolf, B., Staab, S., Tane, J.: Edutella: Searching and annotating resources within an RDF-based P2P network. In: Proc. of the Semantic Web Workshop, 11th Intl. World Wide Web Conf. (2002)
Papadimos, V., Maier, D.: Distributed Query Processing and Catalogs for Peer-to-Peer Systems (2003)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content addressable network. In: Proc. of the 2001 Conf. on applications, technologies, architectures, and protocols for computer communications (2001)
Schlosser, M., Sintek, M., Decker, S., Nejdl, W.: HyperCuP—Hypercubes, Ontologies and Efficient Search on P2P Networks. In: Intl. Workshop on Agents and P2P Computing (2002)
Seltzsam, S., Börzsönyi, S., Kemper, A.: Security for Distributed E-Service Composition. In: Proc. of the 2nd Intl. Workshop on Technologies for E-Services (2001)
Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: Proc. of the 2001 Conf. on applications, technologies, architectures, and protocols for computer communications (2001)
Wiesner, C., Kemper, A., Brandl, S.: Dynamic, Extendible Query Processing in Super-Peer Based P2P Systems (Demonstration). In: Proc. IEEE Conf. on Data Engineering (2004)
Yang, B., Garcia-Molina, H.: Improving search in peer-to-peer systems. In: Proc. of the 22nd Intl. Conf. on Distributed Computing Systems (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Brunkhorst, I., Dhraief, H., Kemper, A., Nejdl, W., Wiesner, C. (2004). Distributed Queries and Query Optimization in Schema-Based P2P-Systems. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds) Databases, Information Systems, and Peer-to-Peer Computing. DBISP2P 2003. Lecture Notes in Computer Science, vol 2944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24629-9_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-24629-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20968-3
Online ISBN: 978-3-540-24629-9
eBook Packages: Springer Book Archive