Abstract
Web graph is a tool to visualize web information as network. It unfurls inherent connectivity of the web for end users from a different viewpoint. The enlarged size of the web causes the information overload problem and forces the wide use of compression techniques such as filtering and clustering on graphs during presentation of web information. In addition, the Internet users, their intentions and activities on the web differ. User interest-based web graph, which is modulated by user interests during construction, is used to accommodate differences over end users and/or their needs. However, user interest-based web graph features an unorthodox way to present connectivities among nodes by utilizing edge labels. This complicates further operations such as clustering and focused visualization on web graphs. This paper introduces a novel approach to cluster user interest-based web graphs by adopting the divide and conquer strategy. It is demonstrated that, this approach can effectively cluster the user interest-based web graph.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Avrachenkov, K., Dobrynin, V., Nemirovsky, D., Pham, S.K., Smirnova, E.: Pagerank based clustering of hypertext document collections. In: SIGIR, pp. 873–874. ACM (2008)
Bekkerman, R., Zilberstein, S., Allan, J.: Web page clustering using heuristic search in the web graph. In: IJCAI, pp. 2280–2285. MORGKAUF (2007)
Brisaboa, N.R., Ladra, S., Navarro, G.: Compact representation of web graphs with extended functionality. Inf. Syst. 39, 152–174 (2014)
Hartigan, J.A.: Clustering Algorithms, 99th edn. John Wiley & Sons, Inc., New York (1975)
Huang, X., Lai, W.: Clustering graphs for visualization via node similarities. J. Vis. Lang. Comput. 17(3), 225–253 (2006)
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Lin, D.: An information-theoretic definition of similarity. In: ICML, pp. 296–304. ACM (1998)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to information retrieval, vol. 1. Cambridge University Press, Cambridge (2008)
Meyerhenke, H., Sanders, P., Schulz, C.: Partitioning complex networks via size-constrained clustering. CoRR abs/1402.3281 (2014)
Rattigan, M.J., Maier, M., Jensen, D.: Graph clustering with network structure indices. In: ICML, pp. 783–790. ACM (2007)
Saleheen, S., Lai, W.: A new type of web graph for personalized visualization. In: PacificVis, pp. 238–242. IEEE (2014)
Savas, B., Dhillon, I.S.: et al.: Clustered low rank approximation of graphs in information science applications. In: SDM, pp. 164–175. SIAM (2011)
Schaeffer, S.E.: Survey: Graph clustering. Comput. Sci. Rev. 1(1), 27–64 (2007)
Sun, Y., Yu, Y., Han, J.: Ranking-based clustering of heterogeneous information networks with star network schema. In: KDD, pp. 797–806. ACM (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Saleheen, S., Lai, W. (2014). An Interest-Based Clustering Method for Web Information Visualization. In: Luo, X., Yu, J.X., Li, Z. (eds) Advanced Data Mining and Applications. ADMA 2014. Lecture Notes in Computer Science(), vol 8933. Springer, Cham. https://doi.org/10.1007/978-3-319-14717-8_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-14717-8_33
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14716-1
Online ISBN: 978-3-319-14717-8
eBook Packages: Computer ScienceComputer Science (R0)