Abstract
A number of methods exists that measure the distance between two web pages. Average-Clicks is a new measure of distance between web pages which fits user’s intuition of distance better than the traditional measure of clicks between two pages. Average-Clicks however assumes that the probability of the user following any link on a web page is the same and gives equal weights to each of the out-going links. In our method “Usage Aware Average-Clicks” we have taken the user’s browsing behavior into account and assigned different weights to different links on a particular page based on how frequently users follow a particular link. Thus, Usage Aware Average-Clicks is an extension to the Average-Clicks Algorithm where the static web link structure graph is combined with the dynamic Usage Graph (built using the information available from the web logs) to assign different weights to links on a web page and hence capture the user’s intuition of distance more accurately. A new distance metric has been designed using this methodology and used to improve the efficiency of a web recommendation engine.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Broder, A., Kumar, R., Maghoul, F., Raghavan, P., Rajagopalan, S., Stata, R., Tomkins, A., Wiener, J.: Graph Structure in the web. In: Proc. 9th WWW conf. (2000)
Borodin, A., Gareth, O., Roberts, J.S., Rosenthal, P.T.: Finding authorities and hubs from link structures on the world wide web. In: World Wide Web, pp. 415–429 (2001)
Bose, A., Beemanapalli, K., Srivastava, J., Sahar, S.: Incorporating Concept hierarchies into Usage Mining Based Recommendations. In: Nasraoui, O., Spiliopoulou, M., Srivastava, J., Mobasher, B., Masand, B. (eds.) WebKDD 2006. LNCS (LNAI), vol. 4811, pp. 110–126. Springer, Heidelberg (2007)
Oztekin, B.U., Ertoz, L., Kumar, V., Srivastava, J.: Usage Aware PageRank (2003), http://www2003.org
Cooley, R., Srivastava, J., Mobasher, B.: Web Mining – Information and Pattern Discovery on the World wide Web. In: 9th IEEE International Conference on Tools with Artificial Intelligence (November 1997)
Ward Eric.: How Search Engines Use Link Analysis - A special report from the Search Engine Strategies 2001 Conference, November 14-15, Dallas, TX. (December 2001)
Google: http://www.google.com/
Kleinberg, J.M., Kumar, R., Raghavan, P., Rajagopalan, S., Tomkins, A.S.: The web as a Graph: measurements, models, and methods. In: Proc. of the International Conference on Combinatorics and Computing (1999)
Miller, J.C., Rae, G., Schaefer, F., Ward, L.A., LoFaro, T., Farahat, A.: Modifications of Kleinberg’s HITS algorithm using Matrix Exponentiation and Web log Records. In: SIGIR 2001. Proceeding of the 24th annual international ACM SIGIR conference onReseardh and development in information retrieval, pp. 444–445. ACM Press, New York,NY,USA (2001)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM) 46(5), 604–632
Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank Citation Ranking: Bringing order to the Web. In: Stanford Digital Library Technologies Project (1998)
Lee, J.H., Kim, M.H., Lee, Y.J.: Information retrieval based on conceptual distance in IS-A hierarchies. Journal of Documentation 49(2), 188–207 (1993)
Richardson, M., Domingos, P.: The intelligent surfer: Probabilistic combination of link and content information in pagerank. In: Advances in Neural Information Processing Systems, vol. 14, MIT Press, Cambridge, MA (2002)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30(1–7), 107–117 (1998)
Srivastava, J., Cooley, R., Deshpande, M.: Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations 1(2), 12–23 (2000)
Haveliwala, T.: Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE Transactions on Knowledge and Data Engineering (2003)
University of Minnesota, Computer Science Department website, http://www.cs.umn.edu
Matsuo, Y., Ohsawa, Y., Ishizuka, M.: Average-Clicks. A New Measure on the World Wide Web-Journal of Intelligent Systems (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Beemanapalli, K., Rangarajan, R., Srivastava, J. (2007). Incorporating Usage Information into Average-Clicks Algorithm. In: Nasraoui, O., Spiliopoulou, M., Srivastava, J., Mobasher, B., Masand, B. (eds) Advances in Web Mining and Web Usage Analysis. WebKDD 2006. Lecture Notes in Computer Science(), vol 4811. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77485-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-77485-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77484-6
Online ISBN: 978-3-540-77485-3
eBook Packages: Computer ScienceComputer Science (R0)