Abstract
Knowledge representation and extraction techniques can be efficiently used to improve data modeling and IR functionalities of P2P Information Systems, which have recently attracted a lot of attention from industrial and academic researchers. These functionalities can be achieved by pushing semantics in both data and queries, and exploiting the derived expressiveness to improve file sharing primitives and lookup mechanisms made available from first-generation P2P systems. XML-based P2P Information Systems are a more specific and interesting instance of this class of systems, where the overall data domain is composed by very large, Internet-like distributed XML repositories from which users extract useful knowledge manly by means of IR methodologies implemented on the top of XML join queries. This paper focuses on several aspects of XML-based P2P Information Systems, raging from foundations and definitions to knowledge representation and extraction models and algorithms, along with their experimental evaluation. However, the results presented in this paper can also be adapted to deal with any kind of data format (e.g., HTML).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aberer, K.: P-Grid: A Self-Organizing Access Structure for P2P Information Systems. In: Batini, C., Giunchiglia, F., Giorgini, P., Mecella, M. (eds.) CoopIS 2001. LNCS, vol. 2172, pp. 179–194. Springer, Heidelberg (2001)
Aberer, K., Despotovic, Z.: Managing Trust in a Peer-2-Peer Information System. In: Proc. of ACM CIKM, pp. 310–317 (2001)
Cai, M., Frank, M.: RDFPeers: A Scalable Distributed RDF Repository based on a Structured Peer-to-Peer Network. In: Proc. of ACM WWW, pp. 650–657 (2004)
Callan, J.: Distributed Information Retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)
Crespo, A., Garcia-Molina, H.: Semantic Overlay Networks for P2P Systems. Stanford Technical Report, Computer Science Department, Stanford University (2003)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science 41(6), 391–407 (1999)
The Gnutella File Sharing System. Web pages available at: http://gnutella.wego.com
Gupta, A., Agrawal, D., El Abbadi, A.: Approximate Range Selection Queries in Peer-to-Peer Systems. In: Proc. CIDIR (2003), online edition available at: http://wwwdb.cs.wisc.edu/cidr/cidr2003/program/p13.pdf
Halaschek, C., Aleman-Meza, B., Arpinar, I.B., Sheth, A.P.: Discovering and Ranking Semantic Associations over a Large RDF Metabase. In: Proc. of VLDB, pp. 1317–1320 (2004)
Halevy, A.Y., Ives, Z.G., Mork, P., Tatarinov, I.: Piazza: Data Management Infrastructure for Semantic Web Applications. In: Proc. of ACM WWW, pp. 556–567 (2003)
Kalogeraki, V., Gunopulos, D., Zeinalipour-Yazti, D.: A Local Search Mechanism for Peer-to-Peer Networks. In: Proc. of ACM CIKM 2002, pp. 300–307 (2002)
Li, M., Lee, W.-C., Sivasubramaniam, A.: Neighborhood Signatures for Searching P2P Networks. In: Proc. of IEEE IDEAS, pp. 149–159 (2003)
Lv, Q., Cao, P., Cohen, E., Li, K., Shenker, S.: Search and Replication in Unstructured Peer-to-Peer Networks. In: Proc. of ACM ICS, pp. 84–95 (2002)
Nejdl, W., Wolpers, M., Siberski, W., Schmitz, C., Schlosser, M., Brunkhorst, I., Loser, A.: Super-Peer-based Routing and Clustering Strategies for RDF-based P2P Networks. In: Proc. of the ACM WWW, pp. 536–543 (2003)
Schmidt, A., Waas, F., Kersten, M., Carey, M., Manolescu, I., Busse, R.: XMark: A Benchmark for XML Data Management. In: Proc. of VLDB, pp. 974–985 (2002)
Tsoumakos, D., Roussopoulos, N.: A Comparison of Peer-to-Peer Search Methods. In: Proc. of ACM WebDB, pp. 61–66 (2003)
Zeinalipour-Yazti, D., Kalogeraki, V., Gunopulos, D.: Information Retrieval Techniques for Peer-to-Peer Networks. IEEE CiSE Magazine, Special Issue on Web Engineering 30(4), 12–20 (2004)
Zeinalipour-Yazti, D., Kalogeraki, V., Gunopulos, D.: Exploiting Locality for Scalable Information Retrieval in Peer-to-Peer Systems. Information Systems 30(4), 277–298 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cuzzocrea, A. (2006). On Semantically-Augmented XML-Based P2P Information Systems. In: Larsen, H.L., Pasi, G., Ortiz-Arroyo, D., Andreasen, T., Christiansen, H. (eds) Flexible Query Answering Systems. FQAS 2006. Lecture Notes in Computer Science(), vol 4027. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11766254_37
Download citation
DOI: https://doi.org/10.1007/11766254_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34638-8
Online ISBN: 978-3-540-34639-5
eBook Packages: Computer ScienceComputer Science (R0)