Abstract
Classifications are trees where links between nodes codify the fact that a node lower in the hierarchy describes a topic (and contains documents about this topic) which is more specific than the topic of the node one level above. In turn, multiple classifications can be connected by semantic links which represent mappings among them and which can be computed, e.g. by ontology matching. In this paper, we describe how these two types of links can be used to define a semantic overlay network which can cover any number of peers and which can be flooded to perform a semantic search on documents, i.e. to perform semantic flooding. We have evaluated our approach by simulating networks of 10, 100, 1,000 and 10,000 peers containing classifications which are fragments of the DMoz web directory. The results are promising and show that, in our approach, only a relatively small number of peers needs to be queried in order to achieve high accuracy.
Similar content being viewed by others
References
Bawa, M., Manku, G., Raghavan, P.: Sets: search enhanced by topic segmentation. In: Proceedings of the 26th Annual International ACM SIGIR Conference, pp. 306–313 (2003)
Bender, M., Michel, S., Triantafillou, P., Weikum, G., Zimmer, C.: Minerva: collaborative p2p search. In: Proceedings of VLDB, pp. 1263–1266 (2005)
Bharambe, A., Agrawal, M., Seshan. S.: Mercury: supporting multi-attribute range queries. In: Proceedings of ACM SIGCOMM, (2004)
Borgida, A., Walsh, T., Hirsh, H.: Towards measuring similarity in description logics. In: Proceedings of the 2005 International Workshop on Description Logics (DL2005) (2005)
Bouquet, P., Giunchiglia, F., van Harmelen, F., Serafini, L., Stuckenschmidt, H.: Contextualizing ontologies. JWS 1, 325–343 (2004)
Cohen, E., Kaplan, H., Fiat, A.: Associative search in peer to peer networks: harnessing latent semantics. In: Proceedings of IEEE INFOCOM (2003)
Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proceedings of the 22nd IEEE International Conference on Distributed Computing Systems (ICDCS) (2002)
Crespo, A., Garcia-Molina, H.: Semantic overlay networks for p2p systems. In: Moro, G., Bergamaschi, S., Aberer, K. (eds.) Agents and Peer-to-Peer Computing. Lecture Notes in Computer Science, vol. 3601, pp. 1–13. Springer, Berlin/Heidelberg (2005). doi:10.1007/11574781_1
Delicious. http://www.delicious.com/. Accessed 20 October 2009
Dmoz. http://www.dmoz.org/. Accessed March 2009
Druschel, P., Rowstron, A.: Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proc. of ACM SIGCOM (2001)
Giunchiglia, F., Kharkevich, U., Noori, S.R.H.: P2P Concept Search: some preliminary results. In: SemSearch2009 workshop at WWW (2009)
Giunchiglia, F., Kharkevich, U., Zaihrayeu, I.: Concept search. In: Proc. of ESWC’09. Lecture Notes in Computer Science. Springer (2009)
Giunchiglia, F., Marchese, M., Zaihrayeu, I.: Encoding classifications into lightweight ontologies. JoDS VIII, 57–81 (2007)
Giunchiglia, F., Shvaiko, P., Yatskevich, M.: Discovering missing background knowledge in ontology matching. In: Proc. of ECAI (2006)
Giunchiglia, F., Yatskevich, M., Shvaiko, P.: Semantic matching: algorithms and implementation. JoDS 9, 1–38, (2007)
Giunchiglia, F., Zaihrayeu, I.: Lightweight ontologies. In: The Encyclopedia of Database Systems (2008)
Giunchiglia, F., Zaihrayeu, I., Kharkevich, U.: Formalizing the get-specific document classification algorithm. In: Kovács, L., Fuhr, N., Meghini, C. (eds.) ECDL. Lecture Notes in Computer Science, vol. 4675, pp. 26–37. Springer (2007)
Haase, P., Broekstra, J., Ehrig, M., Menken, M., Mika, P., Plechawski, M., Pyszlak, P., Schnizler, B., Siebes, R., Staab, S., Tempich, C.: Bibster—a semantics-based bibliographic peer-to-peer system. In: In Proceedings of the Third International Semantic Web Conference (2004)
Joseph, S.: Neurogrid: semantically routing queries in peer-to-peer networks. In: Gregori, E., Cherkasova, L., Cugola, G., Panzieri, F., Picco, G. (eds.) Web Engineering and Peer-to-Peer Computing. Lecture Notes in Computer Science, vol. 2376, pp. 202–214. Springer, Berlin/Heidelberg (2010). doi:10.1007/3-540-45745-3_18
Li, J., Thau, B., Joseph, L., Hellerstein, M., Frans Kaashoek, M.: On the feasibility of peer-to-peer web indexing and search. In: 2nd International Workshop on Peer-to-Peer Systems (IPTPS 2003) (2003)
Löser, A., Staab, S., Tempich, C.: Semantic social overlay networks. In: Shen, X., Yu, H., Buford, J., Akon, M. (eds.) Handbook of Peer-to-Peer Networking, pp. 189–219. Springer, US (2010). doi:10.1007/978-0-387-09751-0_8
Lucene. http://lucene.apache.org/java/2_4_0/. Accessed June 2009
Luu, T., Skobeltsyn, G., Klemm, F., Puh, M., Žarko, I.P., Rajman, M., Aberer, K.: AlvisP2P: scalable peer-to-peer text retrieval in a structured p2p network. In: Proc. VLDB Endow. (2008)
Ma, W., Fang, W., Wang, G., Liu, J.: Concept index for document retrieval with peer-to-peer network. In: Proc. SNPD ’07 (2007)
Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmer, M., Risch, T.: Edutella: a p2p networking infrastructure based on rdf. In: Proceedings to the Eleventh International World Wide Web Conference (2002)
Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: InfoScale’06: Proceedings of the 1st International Conference on Scalable Information Systems. ACM, New York (2006)
Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proceedings of SIGCOMM (2001)
Risson, J., Moors, T.: Survey of research towards robust peer-to-peer networks: search methods. Comput. Networks 50, 3485–3521 (2006)
Spripanidkulchai, K., Maggs, B., Zhang, H.: Efficient content location using interest-based locality in peer-to-peer systems. In: Proceedings of IEEE INFOCOM, vol. 3, pp. 2166–2176 (2003)
Stoica, I., Morris, R., Karger, D., Frans Kaashoek, M., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of SIGCOMM (2001)
Tang, C., Xu, Z., Dwarkadas, S.: Peer-to-peer information retrieval using self-organizing semantic overlay networks. In: Proceedings of ACM SIGCOMM, pp. 175–186 (2003)
Turney, P.: Learning algorithms for keyphrase extraction. Inf Retr 2, 303–336 (2000)
Wetzker, R., Zimmermann, C., Bauckhage, C.: Analyzing social bookmarking systems: a delicious cookbook. In: Mining Social Data (MSoDa) Workshop Proceedings, ECAI, pp. 26–30 (2008)
Xiao, H., Cruz, I.F.: Ontology-based query rewriting in peer-to-peer networks. In: Proceedings of the 2nd International Conference on Knowledge Engineering and Decision Support, pp. 11–18 (2006)
Zaihrayeu, I., Sun, L., Giunchiglia, F., Pan, W., Ju, Q., Chi, M., Huang, X.: From web directories to ontologies: natural language processing challenges. In: 6th International Semantic Web Conference (ISWC 2007). Springer (2007)
Zhao, B.Y., Kubiatowicz, J., Joseph, A.: Tapestry: an infrastructure for fault-tolerant wide-area location and routing. Technical report, Computer Science Department, University of California (2001)
Zhu, Y., Hu, Y.: Ess: efficient semantic search on gnutella-like p2p system. Technical report, Department of ECECS, University of Cincinnati (2004)
Zhuge, H, Liu, J., Feng, L., Sun, X., He, C.: Query routing in a peer-to-peer semantic link network. Comput. Intell. 21, 197–216 (2005)
Author information
Authors and Affiliations
Corresponding author
Additional information
A short version of this article with the title “Semantic Flooding: Search Over Semantic Links” was published at the 1st International Workshop on Data Engineering meets the Semantic Web (DESWeb2010).
Rights and permissions
About this article
Cite this article
Giunchiglia, F., Kharkevich, U. & Hume, A. Semantic flooding. World Wide Web 14, 651–669 (2011). https://doi.org/10.1007/s11280-010-0108-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-010-0108-y