Abstract
Data management in P2P Systems is a challenging problem, due to the high number of autonomous and heterogeneous peers. In some Peer Data Management Systems (PDMSs), peers are semantically clustered in the overlay network. A peer joining the system is assigned to an appropriate cluster, and a query issued by a user at a given peer is routed to semantic neighbor clusters which can provide relevant answers. To help matters, semantic knowledge in the form of ontologies and contextual information has been used successfully to support the techniques used to manage data in such systems. Ontologies can be used to solve the heterogeneities between the peers, while contextual information allows a PDMS to deal with information that is acquired dynamically during the execution of a given query. The goal of this paper is to point out how the semantics provided by ontologies and contextual information can be used to enhance the results of two important data management issues in PDMSs, namely, peer clustering and query reformulation. We present a semantic-based approach to support these processes and we report some experimental results which show how semantics can improve them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data Management Infrastructure for Semantic Web Applications. In: World Wide Web Conference, pp. 556–567 (2003)
Valduriez, P., Pacitti, E.: Data Management in Large-Scale P2P Systems. In: Int. Conference on High Performance Computing for Computational Science (VecPar 2004), Valencia, Spain, pp. 104–118 (2004)
Mandreoli, F., Martoglia, R., Penzo, W., Sassatelli, S., Villani, G.: SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes. In: 4th European Semantic Web Conference (ESWC 2007), Innsbruck, Austria (2007)
Lodi, S., Penzo, W., Mandreoli, F., Martoglia, R., Sassatelli, S.: Semantic Peer, Here are the Neighbors You Want? In: 11th Extending Database Technology (EDBT 2008), Nantes, France, pp. 26–37 (2008)
Kantere, V., Tsoumakos, D., Sellis, T., Roussopoulos, N.: GrouPeer: Dynamic clustering of P2P databases. The Information Systems Journal 34(1), 62–86 (2009)
Herschel, S., Heese, R.: Humboldt Discoverer: A Semantic P2P index for PDMS. In: International Workshop Data Integration and the Semantic Web, Porto, Portugal (2005)
Castano, S., Montanelli, S.: Semantic Self-Formation of Communities of Peers. In: ESWC Workshop on Ontologies in Peer-to-Peer Communities, Heraklion, Greece (2005)
Souza, D., Arruda, T., Salgado, A.C., Tedesco, P., Kedad, Z.: Using Semantics to Enhance Query Reformulation in Dynamic Environments. In: 13th East European Conference on Advances in Databases and Information Systems (ADBIS 2009), Riga, Latvia, pp. 78–92 (2009)
Bouquet, P., Kuper, G.M., Scoz, M., Zanobini, S.: Asking and answering semantic queries. In: Workshop on Meaning Coordination and Negotiation Workshop (MCN 2004) in Conjunction with the 3rd International Semantic Web Conference (ISWC 2004), Hiroshima, Japan (2004)
Mandreoli, F., Martoglia, R., Villani, G., Penzo, W.: Flexible query answering on graph-modeled data. In: 12th International Conference on Extending Database Technology (EDBT 2009), Saint-Petersburg, Russia, pp. 216–227 (2009)
Dey, A.: Understanding and Using Context. Personal and Ubiquitous Computing Journal 5(1), 4–7 (2001)
Souza, D., Belian, R., Salgado, A.C., Tedesco, P.: Towards a Context Ontology to Enhance Data Integration Processes. In: 4th Workshop on Ontologies-based Techniques for DataBases in Information Systems and Knowledge Systems (ODBIS), Auckland, New Zealand, pp. 49–56 (2008)
Bucur, O., Beaune, P., Boissier, O.: Steps Towards Making Contextualized Decisions: How to Do What You Can, with What You Have, Where You Are. In: Second International Workshop on Modeling and Retrieval of Context (MRC), Edinburgh, Scotland, pp. 62–85 (2005)
Bolchini, C., Curino, C., Orsi, G., Quintarelli, E., Rossato, R., Schreiber, F., Tanca, L.: And what can context do for data? Communication of the ACM 52(11), 136–140 (2009)
Pires, C.E.: Semantic-based Connectivity in a Peer Data Management System. In: 6th Workshop of Thesis and Dissertation on Data Bases, held in Conjunction with the 22th Brazilian Symposium on Data Bases (SBBD 2008), João Pessoa, Brazil (2007)
The Free Network Project (2009), http://freenetproject.org/
Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications. In: ACM SIGCOMM, San Diego, California, USA, pp. 149–160 (2001)
Yang, B., Garcia-Molina, H.: Designing a Super-Peer Network. In: 19th International Conference on Data Engineering (ICDE 2003), Bangalore, India, pp. 49–60 (2003)
Pires, C.E., Souza, D., Kedad, Z., Bouzeghoub, M., Salgado, A.C.: Using Semantics in Peer Data Management Systems. In: Colloquium of Computation: Brazil / INRIA, Cooperations, Advances and Challenges, Bento Gonçalves, Brazil, pp. 176–179 (2009)
Pires, C.E., Souza, D., Pachêco, T., Salgado, A.C.: A semantic-based ontology matching process for PDMS. In: Hameurlain, A., Tjoa, A.M. (eds.) Globe 2009. LNCS, vol. 5697, pp. 124–135. Springer, Heidelberg (2009)
Pires, C.E., Sousa, P., Kedad, Z., Salgado, A.C.: Summarizing Ontology-based Schemas in PDMS. In: Int. Workshop on Data Engineering meets the Semantic Web (DESWeb 2010) in conjunction with ICDE 2010, Long Beach, CA, USA, pp. 239–244 (2010)
Pires, C.E.: Ontology-based Clustering in a Peer Data Management System. PhD Thesis. Federal University of Pernambuco, Recife, Brazil (2009)
Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)
Castano, S., Ferrara, A., Montanelli, S.: Matching ontologies in open networked systems: Techniques and applications. Journal on Data Semantics 3870, 25–63 (2006)
Borgida, A., Serafini, L.: Distributed description logics: Assimilating information from peer sources. In: Spaccapietra, S., March, S., Aberer, K. (eds.) Journal on Data Semantics I. LNCS, vol. 2800, pp. 153–184. Springer, Heidelberg (2003)
Aumüller, D., Do, H.H., Massmann, S., Rahm, E.: Schema and ontology matching with COMA++. In: International Conference on Management of Data (SIGMOD), Software Demonstration (2005)
Castano, S., Antonellis, V., Fugini, M.G., Pernici, B.: Conceptual Schema Analysis: Techniques and Applications. ACM Transactions on Database Systems 23(3), 286–333 (1998)
Rijsbergen, C.J.: Information Retrieval, 2nd edn. Stoneham, Butterworths (1979), http://www.dcs.gla.ac.uk/Keith/Preface.html
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press/Addison-Wesley (1999)
Hartigan, J.A.: Clustering Algorithms. John Wiley and Sons, Inc., New York (1975)
Fisher, D.H., Xu, L., Zard, N.: Ordering effects in clustering. In: 9th International Conference on Machine Learning, Aberdeen, Scotland, pp. 163–168 (1992)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Schneider, P. (eds.): The Description Logic Handbook: Theory, Implementation and Applications. Cambridge University Press, Cambridge (2003)
Souza, D.Y.: Using Semantics to Enhance Query Reformulation in Dynamic Distributed Environments. PhD Thesis. Federal University of Pernambuco, Recife, Brazil (2009)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 2nd edn. Academic Press, London (2003)
Batistakis, Y., Halkidi, M., Vazirgiannis, M.: Cluster validity methods: Part I. Sigmod Record 31(12), 40–45 (2002)
Fowlkes, E., Mallows, C.: A method for comparing two hierarchical clusterings. Journal of the American Statistical Association 78(383), 569–576 (1983)
Batistakis, Y., Halkidi, M., Vazirgiannis, M.: Clustering validity checking methods: Part II. Sigmod Record 31(3), 19–27 (2002)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Survey 31(3), 264–323 (1999)
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53–65 (1987)
Halevy, A., Ives, Z., Suciu, D., Tatarinov, I.: Schema mediation for large-scale semantic data sharing. VLDB Journal 14(1), 68–83 (2005)
Löser, A., Naumann, F., Siberski, W., Nejdl, W., Thaden, U.: Semantic Overlay Clusters within Super-Peer Networks. In: International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2003), Berlin, Germany, pp. 33–47 (2003)
Conforti, G., Ghelli, G., Manghi, P., Sartiani, C.: Scalable Query Dissemination in XPeer. In: IDEAS 2007, pp. 199–207 (2007)
Faye, D., Nachouki, G., Valduriez, P.: Semantic Query Routing in SenPeer, a P2P Data Management System. In: Int. Conf. on Network-Based Information Systems (NBiS), Regensburg, Germany, pp. 365–374 (2007)
Adjiman, P., Goasdoué, F., Rousset, M.-C.: someRDFS in the semantic web. In: Spaccapietra, S., Atzeni, P., Fages, F., Hacid, M.-S., Kifer, M., Mylopoulos, J., Pernici, B., Shvaiko, P., Trujillo, J., Zaihrayeu, I. (eds.) Journal on Data Semantics VIII. LNCS, vol. 4380, pp. 158–181. Springer, Heidelberg (2007)
Li, J., Vuong, S.: OntSum: A Semantic Query Routing Scheme in P2P Networks Based on Concise Ontology Indexing. In: 21st International Conference on Advanced Networking and Applications, Niagara Falls, Canada, pp. 94–101 (2007)
Crespo, A., Garcia-Molina, H.: Semantic Overlay Networks for P2P Systems. Technical Report, Stanford University (2002)
Montanelli, S., Castano, S.: Semantically Routing Queries in Peer-based Systems: the H-Link Approach. Knowledge Eng. Review 23(1), 51–72 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Souza, D., Pires, C.E., Kedad, Z., Tedesco, P., Salgado, A.C. (2011). A Semantic-Based Approach for Data Management in a P2P System. In: Hameurlain, A., Küng, J., Wagner, R. (eds) Transactions on Large-Scale Data- and Knowledge-Centered Systems III. Lecture Notes in Computer Science, vol 6790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23074-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-23074-5_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23073-8
Online ISBN: 978-3-642-23074-5
eBook Packages: Computer ScienceComputer Science (R0)