Abstract
The web is a vast graph built of hundreds of millions of web pages and over a billion links. Directly or indirectly, each of these links has been written by hand, and, despite the amount of duplication among links, is the result of an enormous effort by web authors.
One has to ask if it is possible that some of this labour can be automated. That is, can we automate some of the effort required to create and maintain links between pages? In recent work, we described FWEB, a system capable of automating link creation using publish/subscribe communication among a peer-to-peer network of web servers. This allowed web servers to match information about link requirements and page content in circumstances where we specify an anchor in terms of what content we want to link to, rather than a specific URL. When such a match is successful, a link between the pages is automatically created.
However, this system relied on simple keyword-based descriptions, and has several drawbacks, verified by experiment. In this paper, we show how the use of shared ontologies can improve the process of matching the content requirements for links and the descriptions of web pages. We report on our experience of using FWEB and, in addition, show how the capabilities of the FWEB architecture can be extended to include link personalization and explicit backlinks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Carzaniga, A., Rosenblum, D., Wolf, A.: Content-based addressing and routing: A general model and its application (2000)
Carzaniga, A., Rosenblum, D.S., Wolf, A.L.: Design and evaluation of a wide-area event notification service. ACM Transactions on Computer Systems 19(3), 332–383 (2001)
Courtenage, S., Williams, S.: Automatic hyperlink creation using p2p and publish/subscribe. In: Workshop on Peer-to-Peer and Agent Infrastructures for Knowledge Management (PAIKM), Kaiserlautern, Germany (April 2005)
Courtenage, S.A.: Specifying and detecting composite events in content-based publish/subscribe systems. In: 1st International Workshop on Discrete Event-Based Systems (June 2002)
Dabek, F., Brunskill, E., Kaashoek, M.F., Karger, D., Morris, R., Stoica, I., Balakrishnan, H.: Building peer-to-peer systems with Chord, a distributed lookup service. In: IEEE (ed.) Eighth IEEE Workshop on Hot Topics in Operating Systems (HotOS-VIII), Schloss Elmau, Germany, 1109 Spring Street, Suite 300, Silver Spring, MD 20910, USA, May 20–23, 2001, pp. 81–86. IEEE Computer Society Press, Los Alamitos (2001)
Ehrig, M., Haase, P., van Harmelen, F., Siebes, R., Staab, S., Stuckenschmidt, H., Studer, R., Tempich, C.: The swap data and metadata model for semantics-based peer-to-peer systems. In: Schillo, M., Klusch, M., Müller, J., Tianfield, H. (eds.) MATES 2003. LNCS (LNAI), vol. 2831, pp. 144–155. Springer, Heidelberg (2003)
Eugster, P., Felber, P., Guerraoui, R., Kermarrec, A.-M.: The many faces of publish/subscribe (2001)
Fabret, F., Llirbat, F., Pereira, J., Shasha, D.: Efficient matching for content-based publish/subscribe systems. Technical report, INRIA (2000), http://wwwcaravel.inria.fr/pereira/matching.ps
Humphreys, B., Lindberg, D.: The umls project: making the conceptual connection between users and the information they need. Bulletin of the Medical Library Association 81(2), 170–177 (1993)
Idreos, S., Koubarakis, M., Tryfonopoulos, T.: P2P-DIET: an extensible P2P service that unifies ad-hoc and continuous querying in super-peer networks. In: Weikum, G., König, A.C., Dessloch, S. (eds.) Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD 2004), New York, June 13–18, 2004, pp. 933–934. ACM Press, New York (2004)
Kleinberg, J.M., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.S.: The web as a graph: Measurements, models, and methods. In: Asano, T., Imai, H., Lee, D.T., Nakano, S.-i., Tokuyama, T. (eds.) COCOON 1999. LNCS, vol. 1627, p. 1. Springer, Heidelberg (1999)
Lewis, D., Feeney, K., Tiropanis, K.T., Courtenage, S.: An active, ontology-driven network service for internet collaboration. In: Workshop on Application of Semantic Web Technologies to Web Communities (SWWC) at ECAI 2004 (August 2004)
Pietzuch, P.R., Bacon, J.M.: Hermes: A Distributed Event-Based Middleware Architecture. In: Proc. of the 1st Int. Workshop on Distributed Event-Based Systems (DEB 2002), Vienna, Austria, July 2002, pp. 611–618 (2002)
Schlosser, M., Sintek, M., Decker, S., Nejdl, W.: A scalable and ontology-based p2p infrastructure for semantic web services. In: Second IEEE International Conference on Peer-to-Peer Computing, P2P 2002 (2002)
Segall, B., Arnold, D.: Elvin has left the building: A publish/subscribe notification service with quenching. In: Proceedings of AUUUG 1997 (1997)
Stoica, I., Morris, R., Karger, D., Kaashock, M., Balakrishman, H.: Chord: A scalable peer-to-peer lookup protocol for internet applications. In: Proceedings of the ACM SIGCOMM, August 2001, pp. 149–160 (2001)
Strom, R., Banavar, G., Chandra, T., Kaplan, M., Miller, K., Mukherjee, B., Sturman, D., Ward, M.: Gryphon: An information flow based approach to message brokering (1998)
Terpstra, W.W., Behnel, S., Fiege, L., Zeidler, A., Buchmann, A.P.: A peer-to-peer approach to content-based publish/subscribe. In: Proceedings of the 2nd international workshop on Distributed event-based systems, pp. 1–8. ACM Press, New York (2003)
Triantafillou, P., Aekaterinidis, I.: Content-based publish/subscribe over structured p2p networks. In: 1st International Workshop on Discrete Event-Based Systems (May 2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Courtenage, S., Williams, S. (2005). Semantic-Based Matching and Personalization in FWEB, a Publish/Subscribe-Based Web Infrastructure. In: Meersman, R., Tari, Z. (eds) On the Move to Meaningful Internet Systems 2005: CoopIS, DOA, and ODBASE. OTM 2005. Lecture Notes in Computer Science, vol 3760. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11575771_25
Download citation
DOI: https://doi.org/10.1007/11575771_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29736-9
Online ISBN: 978-3-540-32116-3
eBook Packages: Computer ScienceComputer Science (R0)