Abstract
Data sharing in large-scale Peer Data Management Systems (PDMS) is challenging due to the excessive number of data sites, their autonomous nature, and the heterogeneity of their schema. Existing PDMS query applications have difficulty to simultaneously achieve high recall rate and scalability. In this chapter, we propose an ontology-based sharing framework to improve the quality of data sharing and querying over large-scale distributed communities. In particular, we add a semantic layer to the PDMSs, which alleviates the semantic heterogeneity and assists the system to adjust its topology so that semantically related data sources can be connected. Moreover, we propose a comprehensive query routing and optimization scheme to enable scalable and efficient query evaluation. Simulation studies reveal that our proposed ontology-based data sharing framework effectively improves the query performance in terms of recall and scalability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bender, M., Crecelius, T., Kacimi, M., Miche, S., Xavier Parreira, J., Weikum, G.: Peer-to-peer information search: Semantic, social, or spiritual? IEEE Bulletin of Computer Society Technical Committee on Data Engineering 30(2), 51–60 (2007)
Chawathe, Y., Ratnasam, S., Breslau, L., Lanhan, N., Shenker, S.: Making gnutella-like p2p systems scalable. In: Proceedings of ACM SIGCOMM 2003 (2003)
Choi, N., Song, I.Y., Han, H.: A survey on ontology mapping. ACM Sigmod Record 35(3), 34–41 (2006)
Gruber, T.R.: Principles for the design of ontologies used for knowledge sharing. International Journal Human-Computer Studies 43(3-4), 907–928 (1995)
Halevy, A.Y.: Answering queries using views: A survey. The VLDB Journal 10(4) (2001)
Halevy, A., Ives, Z., Madhavan, J., Mork, P., Suciu, D., Tatarinov, I.: The piazza peer data management system. IEEE Transactions on Knowledge & Data Engineering (TKDE) 16(7), 787–798 (2004)
Herschel, S., Heese, R.: Humboldt discoverer: A semantic p2p index for pdms. In: Proc. of the Workshop on Data Integration and the Semantic Web (2005)
Hose, K., Lemke, C., Sattler, K.U.: Processing relaxed skylines in pdms using distributed data summaries. In: Proc. of the 15th ACM International Conference on Information and Knowledge Management, pp. 425–434 (2006)
Hosea, K., Rothb, A., Zeitzc, A., Sattlera, K., Naumannb, F.: A research agenda for query processing in large-scale peer data management systems. Information Systems 33(7-8), 610–797 (2008)
Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18, 1–31 (2003)
Li, J., Khan, S.U.: MobiSN: Semantics-based mobile ad hoc social network framework. In: Proc. of the IEEE Global Communications Conference (Globecom 2009), Honolulu, HI, USA, pp. 1–6 (2009)
Li, J., Vuong, S.T.: SOON: A scalable self-organized overlay network for distributed information retrieval. In: De Turck, F., Kellerer, W., Kormentzas, G. (eds.) DSOM 2008. LNCS, vol. 5273, pp. 1–13. Springer, Heidelberg (2008)
Li, J., Vuong, S.: Efa: an efficient content routing algorithm in large peer-to-peer overlay networks. In: Proc. of the Third IEEE International Conference on Peer-to-Peer Computing (2003)
Lodi, S., Penzo, W., Mandreoli, F., Martoglia, R., Sassatelli, S.: Semantic peer, here are the neighbors you want! In: Proc. of the 11th Extending Database Technology, pp. 26–37 (2008)
Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to wordnet: an on-line lexical database. In: Proc. of the International Journal of Lexicography (1990)
Pedersen, T., Patwardhan, S., Michelizzi, J.: WordNet: Similarity-measuring the relatedness of concepts. In: Proc. of the 19th National Conference on Artifical Intelligence, AAAI (2004)
Pires, C.E., Souza, D., Pachêco, T., Salgado, A.C.: A semantic-based ontology matching process for PDMS. In: Hameurlain, A., Tjoa, A.M. (eds.) Globe 2009. LNCS, vol. 5697, pp. 124–135. Springer, Heidelberg (2009)
Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Transaction on Systems, Man, and Cybernetics 19(1), 17–30 (1989)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proc. of the ACM SIGCOMM, pp. 161–172 (2001)
Rowstron, A., Druschel, P.: Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proc. of the IFIP/ACM International Conference on Distributed Systems Platforms (2001)
Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: ACM SIGCOMM, pp. 149–160 (2001)
Tatarinov, I., Ives, Z., Madhavan, J., Halevy, A., Suciu, D.: The Piazza Peer Data Management System. In: ACM Sigmod Record (2003)
Valduriez, P., Pacitti, E.: Data management in large-scale p2p systems. In: Proc. of the International Conference on High Performance Computing for Computational Science, pp. 104–118 (2004)
Zhao, B.Y., Kubiatowicz, J.D., Joseph, A.D.: Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Tech. rep., UCB/CSD-01-1141 (2000)
Zipf, G.K.: Human Behaviour and the Principle of Least-Effort. Addison-Wesley, Cambridge (1949)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Li, J. et al. (2011). Efficient Data Sharing over Large-Scale Distributed Communities. In: Bouvry, P., González-Vélez, H., Kołodziej, J. (eds) Intelligent Decision Systems in Large-Scale Distributed Environments. Studies in Computational Intelligence, vol 362. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21271-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-21271-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21270-3
Online ISBN: 978-3-642-21271-0
eBook Packages: EngineeringEngineering (R0)