Skip to main content
Log in

Semantic flooding

Semantic search across distributed lightweight ontologies

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Classifications are trees where links between nodes codify the fact that a node lower in the hierarchy describes a topic (and contains documents about this topic) which is more specific than the topic of the node one level above. In turn, multiple classifications can be connected by semantic links which represent mappings among them and which can be computed, e.g. by ontology matching. In this paper, we describe how these two types of links can be used to define a semantic overlay network which can cover any number of peers and which can be flooded to perform a semantic search on documents, i.e. to perform semantic flooding. We have evaluated our approach by simulating networks of 10, 100, 1,000 and 10,000 peers containing classifications which are fragments of the DMoz web directory. The results are promising and show that, in our approach, only a relatively small number of peers needs to be queried in order to achieve high accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bawa, M., Manku, G., Raghavan, P.: Sets: search enhanced by topic segmentation. In: Proceedings of the 26th Annual International ACM SIGIR Conference, pp. 306–313 (2003)

  2. Bender, M., Michel, S., Triantafillou, P., Weikum, G., Zimmer, C.: Minerva: collaborative p2p search. In: Proceedings of VLDB, pp. 1263–1266 (2005)

  3. Bharambe, A., Agrawal, M., Seshan. S.: Mercury: supporting multi-attribute range queries. In: Proceedings of ACM SIGCOMM, (2004)

  4. Borgida, A., Walsh, T., Hirsh, H.: Towards measuring similarity in description logics. In: Proceedings of the 2005 International Workshop on Description Logics (DL2005) (2005)

  5. Bouquet, P., Giunchiglia, F., van Harmelen, F., Serafini, L., Stuckenschmidt, H.: Contextualizing ontologies. JWS 1, 325–343 (2004)

    Google Scholar 

  6. Cohen, E., Kaplan, H., Fiat, A.: Associative search in peer to peer networks: harnessing latent semantics. In: Proceedings of IEEE INFOCOM (2003)

  7. Crespo, A., Garcia-Molina, H.: Routing indices for peer-to-peer systems. In: Proceedings of the 22nd IEEE International Conference on Distributed Computing Systems (ICDCS) (2002)

  8. Crespo, A., Garcia-Molina, H.: Semantic overlay networks for p2p systems. In: Moro, G., Bergamaschi, S., Aberer, K. (eds.) Agents and Peer-to-Peer Computing. Lecture Notes in Computer Science, vol. 3601, pp. 1–13. Springer, Berlin/Heidelberg (2005). doi:10.1007/11574781_1

    Google Scholar 

  9. Delicious. http://www.delicious.com/. Accessed 20 October 2009

  10. Dmoz. http://www.dmoz.org/. Accessed March 2009

  11. Druschel, P., Rowstron, A.: Pastry: scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proc. of ACM SIGCOM (2001)

  12. Giunchiglia, F., Kharkevich, U., Noori, S.R.H.: P2P Concept Search: some preliminary results. In: SemSearch2009 workshop at WWW (2009)

  13. Giunchiglia, F., Kharkevich, U., Zaihrayeu, I.: Concept search. In: Proc. of ESWC’09. Lecture Notes in Computer Science. Springer (2009)

  14. Giunchiglia, F., Marchese, M., Zaihrayeu, I.: Encoding classifications into lightweight ontologies. JoDS VIII, 57–81 (2007)

    Google Scholar 

  15. Giunchiglia, F., Shvaiko, P., Yatskevich, M.: Discovering missing background knowledge in ontology matching. In: Proc. of ECAI (2006)

  16. Giunchiglia, F., Yatskevich, M., Shvaiko, P.: Semantic matching: algorithms and implementation. JoDS 9, 1–38, (2007)

    Google Scholar 

  17. Giunchiglia, F., Zaihrayeu, I.: Lightweight ontologies. In: The Encyclopedia of Database Systems (2008)

  18. Giunchiglia, F., Zaihrayeu, I., Kharkevich, U.: Formalizing the get-specific document classification algorithm. In: Kovács, L., Fuhr, N., Meghini, C. (eds.) ECDL. Lecture Notes in Computer Science, vol. 4675, pp. 26–37. Springer (2007)

  19. Haase, P., Broekstra, J., Ehrig, M., Menken, M., Mika, P., Plechawski, M., Pyszlak, P., Schnizler, B., Siebes, R., Staab, S., Tempich, C.: Bibster—a semantics-based bibliographic peer-to-peer system. In: In Proceedings of the Third International Semantic Web Conference (2004)

  20. Joseph, S.: Neurogrid: semantically routing queries in peer-to-peer networks. In: Gregori, E., Cherkasova, L., Cugola, G., Panzieri, F., Picco, G. (eds.) Web Engineering and Peer-to-Peer Computing. Lecture Notes in Computer Science, vol. 2376, pp. 202–214. Springer, Berlin/Heidelberg (2010). doi:10.1007/3-540-45745-3_18

    Google Scholar 

  21. Li, J., Thau, B., Joseph, L., Hellerstein, M., Frans Kaashoek, M.: On the feasibility of peer-to-peer web indexing and search. In: 2nd International Workshop on Peer-to-Peer Systems (IPTPS 2003) (2003)

  22. Löser, A., Staab, S., Tempich, C.: Semantic social overlay networks. In: Shen, X., Yu, H., Buford, J., Akon, M. (eds.) Handbook of Peer-to-Peer Networking, pp. 189–219. Springer, US (2010). doi:10.1007/978-0-387-09751-0_8

    Chapter  Google Scholar 

  23. Lucene. http://lucene.apache.org/java/2_4_0/. Accessed June 2009

  24. Luu, T., Skobeltsyn, G., Klemm, F., Puh, M., Žarko, I.P., Rajman, M., Aberer, K.: AlvisP2P: scalable peer-to-peer text retrieval in a structured p2p network. In: Proc. VLDB Endow. (2008)

  25. Ma, W., Fang, W., Wang, G., Liu, J.: Concept index for document retrieval with peer-to-peer network. In: Proc. SNPD ’07 (2007)

  26. Nejdl, W., Wolf, B., Qu, C., Decker, S., Sintek, M., Naeve, A., Nilsson, M., Palmer, M., Risch, T.: Edutella: a p2p networking infrastructure based on rdf. In: Proceedings to the Eleventh International World Wide Web Conference (2002)

  27. Pass, G., Chowdhury, A., Torgeson, C.: A picture of search. In: InfoScale’06: Proceedings of the 1st International Conference on Scalable Information Systems. ACM, New York (2006)

    Google Scholar 

  28. Ratnasamy, S., Francis, P., Handley, M., Karp, R., Shenker, S.: A scalable content-addressable network. In: Proceedings of SIGCOMM (2001)

  29. Risson, J., Moors, T.: Survey of research towards robust peer-to-peer networks: search methods. Comput. Networks 50, 3485–3521 (2006)

    Article  MATH  Google Scholar 

  30. Spripanidkulchai, K., Maggs, B., Zhang, H.: Efficient content location using interest-based locality in peer-to-peer systems. In: Proceedings of IEEE INFOCOM, vol. 3, pp. 2166–2176 (2003)

  31. Stoica, I., Morris, R., Karger, D., Frans Kaashoek, M., Balakrishnan, H.: Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of SIGCOMM (2001)

  32. Tang, C., Xu, Z., Dwarkadas, S.: Peer-to-peer information retrieval using self-organizing semantic overlay networks. In: Proceedings of ACM SIGCOMM, pp. 175–186 (2003)

  33. Turney, P.: Learning algorithms for keyphrase extraction. Inf Retr 2, 303–336 (2000)

    Article  Google Scholar 

  34. Wetzker, R., Zimmermann, C., Bauckhage, C.: Analyzing social bookmarking systems: a delicious cookbook. In: Mining Social Data (MSoDa) Workshop Proceedings, ECAI, pp. 26–30 (2008)

  35. Xiao, H., Cruz, I.F.: Ontology-based query rewriting in peer-to-peer networks. In: Proceedings of the 2nd International Conference on Knowledge Engineering and Decision Support, pp. 11–18 (2006)

  36. Zaihrayeu, I., Sun, L., Giunchiglia, F., Pan, W., Ju, Q., Chi, M., Huang, X.: From web directories to ontologies: natural language processing challenges. In: 6th International Semantic Web Conference (ISWC 2007). Springer (2007)

  37. Zhao, B.Y., Kubiatowicz, J., Joseph, A.: Tapestry: an infrastructure for fault-tolerant wide-area location and routing. Technical report, Computer Science Department, University of California (2001)

  38. Zhu, Y., Hu, Y.: Ess: efficient semantic search on gnutella-like p2p system. Technical report, Department of ECECS, University of Cincinnati (2004)

  39. Zhuge, H, Liu, J., Feng, L., Sun, X., He, C.: Query routing in a peer-to-peer semantic link network. Comput. Intell. 21, 197–216 (2005)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alethia Hume.

Additional information

A short version of this article with the title “Semantic Flooding: Search Over Semantic Links” was published at the 1st International Workshop on Data Engineering meets the Semantic Web (DESWeb2010).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Giunchiglia, F., Kharkevich, U. & Hume, A. Semantic flooding. World Wide Web 14, 651–669 (2011). https://doi.org/10.1007/s11280-010-0108-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-010-0108-y

Keywords

Navigation