Loading [a11y]/accessibility-menu.js
Exploring similarity among Web pages using the hyperlink structure | IEEE Conference Publication | IEEE Xplore

Exploring similarity among Web pages using the hyperlink structure


Abstract:

Hyperlinks inside HTML pages contain a wealth of information about the relationships among Web pages. Given a set of Web pages, we can explore the hyperlink relationships...Show More

Abstract:

Hyperlinks inside HTML pages contain a wealth of information about the relationships among Web pages. Given a set of Web pages, we can explore the hyperlink relationships among these pages. This paper first provides formal definitions of hyperlink relations. We then use the notations to define similarity between two Web pages and between two sets of Web pages. For each one of them, we provide several definitions of similarity using forward and backward links. The similarity measure gives us a number between 0 and 1. We also demonstrate how to use the similarity measure to study clustering within a set of pages and to determine the "diversity" of a set of Web pages.
Date of Conference: 05-07 April 2004
Date Added to IEEE Xplore: 24 August 2004
Print ISBN:0-7695-2108-8
Conference Location: Las Vegas, NV, USA

Contact IEEE to Subscribe

References

References is not available for this document.