Skip to main content

Part of the book series: Studies in Computational Intelligence ((SCI,volume 362))

Abstract

Data sharing in large-scale Peer Data Management Systems (PDMS) is challenging due to the excessive number of data sites, their autonomous nature, and the heterogeneity of their schema. Existing PDMS query applications have difficulty to simultaneously achieve high recall rate and scalability. In this chapter, we propose an ontology-based sharing framework to improve the quality of data sharing and querying over large-scale distributed communities. In particular, we add a semantic layer to the PDMSs, which alleviates the semantic heterogeneity and assists the system to adjust its topology so that semantically related data sources can be connected. Moreover, we propose a comprehensive query routing and optimization scheme to enable scalable and efficient query evaluation. Simulation studies reveal that our proposed ontology-based data sharing framework effectively improves the query performance in terms of recall and scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bender, M., Crecelius, T., Kacimi, M., Miche, S., Xavier Parreira, J., Weikum, G.: Peer-to-peer information search: Semantic, social, or spiritual? IEEE Bulletin of Computer Society Technical Committee on Data Engineering 30(2), 51–60 (2007)

    Google Scholar 

  2. Chawathe, Y., Ratnasam, S., Breslau, L., Lanhan, N., Shenker, S.: Making gnutella-like p2p systems scalable. In: Proceedings of ACM SIGCOMM 2003 (2003)

    Google Scholar 

  3. Choi, N., Song, I.Y., Han, H.: A survey on ontology mapping. ACM Sigmod Record 35(3), 34–41 (2006)

    Article  Google Scholar 

  4. Gruber, T.R.: Principles for the design of ontologies used for knowledge sharing. International Journal Human-Computer Studies 43(3-4), 907–928 (1995)

    Article  Google Scholar 

  5. Halevy, A.Y.: Answering queries using views: A survey. The VLDB Journal 10(4) (2001)

    Google Scholar 

  6. Halevy, A., Ives, Z., Madhavan, J., Mork, P., Suciu, D., Tatarinov, I.: The piazza peer data management system. IEEE Transactions on Knowledge & Data Engineering (TKDE) 16(7), 787–798 (2004)

    Article  Google Scholar 

  7. Herschel, S., Heese, R.: Humboldt discoverer: A semantic p2p index for pdms. In: Proc. of the Workshop on Data Integration and the Semantic Web (2005)

    Google Scholar 

  8. Hose, K., Lemke, C., Sattler, K.U.: Processing relaxed skylines in pdms using distributed data summaries. In: Proc. of the 15th ACM International Conference on Information and Knowledge Management, pp. 425–434 (2006)

    Google Scholar 

  9. Hosea, K., Rothb, A., Zeitzc, A., Sattlera, K., Naumannb, F.: A research agenda for query processing in large-scale peer data management systems. Information Systems 33(7-8), 610–797 (2008)

    Google Scholar 

  10. Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18, 1–31 (2003)

    Article  Google Scholar 

  11. Li, J., Khan, S.U.: MobiSN: Semantics-based mobile ad hoc social network framework. In: Proc. of the IEEE Global Communications Conference (Globecom 2009), Honolulu, HI, USA, pp. 1–6 (2009)

    Google Scholar 

  12. Li, J., Vuong, S.T.: SOON: A scalable self-organized overlay network for distributed information retrieval. In: De Turck, F., Kellerer, W., Kormentzas, G. (eds.) DSOM 2008. LNCS, vol. 5273, pp. 1–13. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  13. Li, J., Vuong, S.: Efa: an efficient content routing algorithm in large peer-to-peer overlay networks. In: Proc. of the Third IEEE International Conference on Peer-to-Peer Computing (2003)

    Google Scholar 

  14. Lodi, S., Penzo, W., Mandreoli, F., Martoglia, R., Sassatelli, S.: Semantic peer, here are the neighbors you want! In: Proc. of the 11th Extending Database Technology, pp. 26–37 (2008)

    Google Scholar 

  15. Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to wordnet: an on-line lexical database. In: Proc. of the International Journal of Lexicography (1990)

    Google Scholar 

  16. Pedersen, T., Patwardhan, S., Michelizzi, J.: WordNet: Similarity-measuring the relatedness of concepts. In: Proc. of the 19th National Conference on Artifical Intelligence, AAAI (2004)

    Google Scholar 

  17. Pires, C.E., Souza, D., Pachêco, T., Salgado, A.C.: A semantic-based ontology matching process for PDMS. In: Hameurlain, A., Tjoa, A.M. (eds.) Globe 2009. LNCS, vol. 5697, pp. 124–135. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  18. Rada, R., Mili, H., Bicknell, E., Blettner, M.: Development and application of a metric on semantic nets. IEEE Transaction on Systems, Man, and Cybernetics 19(1), 17–30 (1989)

    Article  Google Scholar 

  19. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proc. of the ACM SIGCOMM, pp. 161–172 (2001)

    Google Scholar 

  20. Rowstron, A., Druschel, P.: Pastry: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proc. of the IFIP/ACM International Conference on Distributed Systems Platforms (2001)

    Google Scholar 

  21. Stoica, I., Morris, R., Karger, D., Kaashoek, M.F., Balakrishnan, H.: Chord: A scalable peer-to-peer lookup service for internet applications. In: ACM SIGCOMM, pp. 149–160 (2001)

    Google Scholar 

  22. Tatarinov, I., Ives, Z., Madhavan, J., Halevy, A., Suciu, D.: The Piazza Peer Data Management System. In: ACM Sigmod Record (2003)

    Google Scholar 

  23. Valduriez, P., Pacitti, E.: Data management in large-scale p2p systems. In: Proc. of the International Conference on High Performance Computing for Computational Science, pp. 104–118 (2004)

    Google Scholar 

  24. Zhao, B.Y., Kubiatowicz, J.D., Joseph, A.D.: Tapestry: An infrastructure for fault-tolerant wide-area location and routing. Tech. rep., UCB/CSD-01-1141 (2000)

    Google Scholar 

  25. Zipf, G.K.: Human Behaviour and the Principle of Least-Effort. Addison-Wesley, Cambridge (1949)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Li, J. et al. (2011). Efficient Data Sharing over Large-Scale Distributed Communities. In: Bouvry, P., González-Vélez, H., Kołodziej, J. (eds) Intelligent Decision Systems in Large-Scale Distributed Environments. Studies in Computational Intelligence, vol 362. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21271-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-21271-0_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-21270-3

  • Online ISBN: 978-3-642-21271-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics