skip to main content
article

Web site personalization based on link analysis and navigational patterns

Published: 01 October 2007 Publication History

Abstract

The continuous growth in the size and use of the World Wide Web imposes new methods of design and development of online information services. The need for predicting the users' needs in order to improve the usability and user retention of a Web site is more than evident and can be addressed by personalizing it. Recommendation algorithms aim at proposing “next” pages to users based on their current visit and past users' navigational patterns. In the vast majority of related algorithms, however, only the usage data is used to produce recommendations, disregarding the structural properties of the Web graph. Thus important—in terms of PageRank authority score—pages may be underrated. In this work, we present UPR, a PageRank-style algorithm which combines usage data and link analysis techniques for assigning probabilities to Web pages based on their importance in the Web site's navigational graph. We propose the application of a localized version of UPR (l-UPR) to personalized navigational subgraphs for online Web page ranking and recommendation. Moreover, we propose a hybrid probabilistic predictive model based on Markov models and link analysis for assigning prior probabilities in a hybrid probabilistic model. We prove, through experimentation, that this approach results in more objective and representative predictions than the ones produced from the pure usage-based approaches.

References

[1]
Aktas, M. S., Nacar, M. A., and Menczer, F. 2004. Personalizing PageRank based on domain profiles. In Proceedings of the WEBKDD 2004 Workshop (Seattle, WA, Aug. 2004).
[2]
Borges, J. and Levene, M. 2000. Data mining of user navigation patterns. In Revised Papers from the International Workshop on Web Usage Analysis and User Profiling. Lecture Notes in Computer Science, vol. 1836. Springer, Berlin, Germany, 92--111.
[3]
Borges, J. and Levene, M. 2004. A dynamic clustering-based Markov model for Web usage Mining. Technical Report. Available online at http://xxx.arxiv.org/abs/cs.IR/0406032.
[4]
Borges, J. and Levene, M. 2006. Ranking pages by topology and popularity within Web sites. World Wide Web J. 9, 3 (Oct.), 301--316.
[5]
Brin, S. and Page, L. 1998. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. 30, 1--7, 107--117.
[6]
Cadez, I. Heckerman, D., Meek, C., Smyth, P., and White, S. 2000. Visualization of navigation patterns on a Web site using model based clustering. In Proceedings of the ACM KDD2000 Conference (Boston, MA).
[7]
Cadez, I., Gaffney, S., and Smyth, P. 2006. A general probabilistic framework for clustering individuals and objects. In Proceedings of the ACM KDD2000 Conference (Boston, MA).
[8]
Deshpande, M. and Karypis, G. 2001. Selective Markov models for predicting Web-page accesses. In Proceedings of the first SIAM International Conference on Data Mining.
[9]
Eirinaki, M. 2004. Web mining: A roadmap. Technical Report. Available online at http://www.db-net.aueb.gr.
[10]
Eirinaki, M. and Vazirgiannis, M. 2003. Web mining for Web personalization. ACM Trans. Internet Tech. 3, 1, 1--29.
[11]
Eirinaki, M., Vazirgiannis, M., and Kapogiannis, D. 2005. Web path recommendations based on page ranking and Markov models. In Proceedings of the Seventh ACM International Workshop on Web Information and Data Management (WIDM 2005, Bremen, Germany, November).
[12]
Eirinaki, M., Vazirgiannis, M., and Varlamis, I. 2003. SEWeP: Using site semantics and a taxonomy to enhance the Web personalization process. In Proceedings of the ACM KDD2003 Conference (Washington, DC, August).
[13]
El-Sayed, M., Ruiz, C., Rundesteiner, E. A. 2004. FS-Miner: Efficient and incremental mining of frequent sequence patterns in Web logs. In Proceedings of the Sixth ACM International Workshop on Web Information and Data Management (WIDM 2004, Washington, DC, November).
[14]
Haveliwala, T. 2002. Topic-sensitive PageRank. In Proceedings of the WWW2002 Conference (Hawaii, May).
[15]
Huang, Z., Li, X., and Chen, H. 2005. Link prediction approach to collaborative filtering. In Proceedings of ACM JCDL'05 (Colorado).
[16]
Kamvar, S. D., Haveliwala, T. H., and Golub, G. H. 2003a. Adaptive methods for the computation of PageRank. In Proceedings of the International Conference on the Numerical Solution of Markov Chains (September).
[17]
Kamvar, S. D., Haveliwala, T. H., Manning, C. D., and Golub, G. H. 2003b. Extrapolation methods for accelerating PageRank computations. In Proceedings of the twelfth International World Wide Web Conference (WWW2003, May).
[18]
Kendall, M. and Gibbons, J. D. 1990. Rank correlation methods. Oxford University Press, Oxford, U.K.
[19]
Levene, M. and Loizou, G. 2003. Computing the entropy of user navigation in the Web. Int. J. Inform. Tech. Decis. Mak. 2, 459--476.
[20]
Manavoglu, D., Pavlov, D., and Giles, C. L. 2003. Probabilistic user behaviour models. In Proceedings of ICDM 2003.
[21]
Motwani, R. and Raghavan, P. 1995. Randomized algorithms. Cambridge University Press, Cambridge, U.K.
[22]
Nakagawa, M. and Mobasher, B. 2003. A hybrid Web personalization model based on site connectivity. In Proceedings of the Fifth WEBKDD Workshop (Washington, DC).
[23]
Polyzotis, N. and Garofalakis, M. 2002. Structure and value synopses for XML data graphs. In Proceedings of the 28th VLDB Conference.
[24]
Polyzotis, N., Garofalakis, M., and Ioannidis, Y. 2004. Approximate XML query answers. In Proceedings of SIGMOD 2004 (Paris, France, June).
[25]
Richardson, M. and Domingos, P. 2002. The intelligent surfer: Probabilistic combination of link and content information in PageRank. Neur. Inform. Process. Syst. 14, 1441--1448.
[26]
Sarukkai, R. R. 2000. Link prediction and path analysis using Markov chains. Comput. Netw. 33, 1--6, 337--386.
[27]
Sen, R. and Hansen, M. 2003. Predicting a Web user's next access based on log data. J. Comput. Graph. Stat. 12, 1, 143--155.
[28]
Spiliopoulou, M. and Faulstich, L. C. 1998. WUM: A Web utilization miner. In Proceedings of the First International Workshop on the Web and Databases (WebDB 1998 Spain, March).
[29]
Zhao, Q. and Bhowmick, S. S. 2004. Mining history of changes to Web access patterns. In Proceedings of PKDD 2004 (Italy, September).
[30]
Zhu, J., Hong, J., and Hughes, J. G. 2002. Using Markov models for Web site link prediction. In Proceedings of ACM HT'02 (Maryland).

Cited By

View all

Index Terms

  1. Web site personalization based on link analysis and navigational patterns

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Internet Technology
        ACM Transactions on Internet Technology  Volume 7, Issue 4
        October 2007
        153 pages
        ISSN:1533-5399
        EISSN:1557-6051
        DOI:10.1145/1278366
        Issue’s Table of Contents
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 01 October 2007
        Published in TOIT Volume 7, Issue 4

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Markov models
        2. Web personalization
        3. link analysis
        4. recommendations
        5. usage-based PageRank

        Qualifiers

        • Article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)3
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 03 Mar 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2018)Proposing Logical Table Constructs for Enhanced Machine Learning ProcessIEEE Access10.1109/ACCESS.2018.28660466(47751-47769)Online publication date: 2018
        • (2015)Using High-Frequency Interaction Events to Automatically Classify Cognitive LoadHuman Behavior, Psychology, and Social Interaction in the Digital Era10.4018/978-1-4666-8450-8.ch010(210-228)Online publication date: 2015
        • (2015)An effective Web page recommender using binary data clusteringInformation Retrieval10.1007/s10791-015-9252-418:3(167-214)Online publication date: 1-Jun-2015
        • (2014)Defending Approach against Forceful Browsing in Web ApplicationsICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India- Vol II10.1007/978-3-319-03095-1_71(651-659)Online publication date: 2014
        • (2013)Network visualisation as a way to the web usage analysisAslib Proceedings10.1108/0001253131129717765:1(40-53)Online publication date: Jan-2013
        • (2011)New hybrid web personalization framework2011 IEEE 3rd International Conference on Communication Software and Networks10.1109/ICCSN.2011.6014395(86-92)Online publication date: May-2011
        • (2011)A new recommendation algorithm using distributed learning automata and graph partitioning2011 11th International Conference on Hybrid Intelligent Systems (HIS)10.1109/HIS.2011.6122131(351-357)Online publication date: Dec-2011
        • (2010)A web personalizing technique using adaptive data structuresJournal of Systems and Software10.1016/j.jss.2010.06.02683:11(2200-2210)Online publication date: 1-Nov-2010
        • (2009)Variable Length Markov Chains for Web Usage MiningEncyclopedia of Data Warehousing and Mining, Second Edition10.4018/978-1-60566-010-3.ch310(2031-2035)Online publication date: 2009
        • (2009)MUADDIBACM Transactions on Information Systems10.1145/1629096.162910227:4(1-41)Online publication date: 30-Nov-2009
        • Show More Cited By

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Figures

        Tables

        Media

        Share

        Share

        Share this Publication link

        Share on social media