Abstract
Hyperlink, or shortly link, analysis seeks to model the web structures and discover the relations among web sites or Web pages. The extracted models or relations can be used for the web mining applications, including market researches and various online businesses. It is well known that PageRank of Google’s search engine is one of the most successful stories of link analysis. In this paper, we investigate into the link structures among the sites, each of which is the collection of web pages in the same university domain in Korea. However, the PageRank algorithm cannot be directly applied to the ranking of a relatively small number of sites or communities since the transition probabilities from a node with a low out-degree significantly affect the whole rankings among the sites. We modify the original version of the PageRank algorithm in order to make it fit into the site ranking, we propose a site ranking algorithm, which is a modification of the PageRank algorithm. The experimental results show that our approach to the site ranking performs much better than PageRank.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Thewall, M.: Link Analysis: An Information Science Approach. Elsevier academic press, Amsterdam (2004)
Thewall, M.: Three Target Document Pange Metrics for Univerisy Web Sites. Journal of the American Society for Information Science and Technology, 489–496 (2003)
Charkrabarti, S.: Mining the web Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2003)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proc. 7th Int. World Wide Web Conference, Computer Networks and ISDN Systems, Brisbane, Australia, vol. 30, pp. 107–117 (1998)
Haveliwala, T.: Efficient Computation of PageRank. Technical Report, Stanford University (1999)
Haveliwala, T.: Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. IEEE Transactions on Knowledge and Data Engineering 15(4), 784–796 (2003)
Cai, D., He, X., Wen, J.R., Ma, W.Y.: Block-level Link Analysis. In: Proceedings of the 27th Annual ACM SIGIR 04, pp. 440–447. ACM Press, New York (2004)
Jeh, G., Widom, G.: Scaling Personalized Web Search. Technical Report, Stanford University (2002)
Lu, Y., Zhang, B., Xi, W., Chen, Z., Liu, Y., Lyu, M.R., Ma, W.Y.: The PowerRank Web Link Analysis Algorithm. In: 13th WWW conference, pp. 254–255 (2004)
Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and the Web. Wiley, Chichester (2003)
Najork, M., Heydon, A.: High-performance Web crawling. Tech. Rep. Research Report 173, Compaq SRC (2001)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, K., Kang, M., Choi, Y. (2007). A Site-Ranking Algorithm for a Small Group of Sites. In: Gervasi, O., Gavrilova, M.L. (eds) Computational Science and Its Applications – ICCSA 2007. ICCSA 2007. Lecture Notes in Computer Science, vol 4706. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74477-1_37
Download citation
DOI: https://doi.org/10.1007/978-3-540-74477-1_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74475-7
Online ISBN: 978-3-540-74477-1
eBook Packages: Computer ScienceComputer Science (R0)