Abstract
One of the main issues in Web usage mining is the discovery of patterns in the navigational behavior of Web users. Standard approaches, such as clustering of users’ sessions and discovering association rules or frequent navigational paths, do not generally allow to characterize or quantify the unobservable factors that lead to common navigational patterns. Therefore, it is necessary to develop techniques that can discover hidden and useful relationships among users as well as between users and Web objects. Correspondence Analysis (CO-AN) is particularly useful in this context, since it can uncover meaningful associations among users and pages. We present a model-based cluster analysis for Web users sessions including a novel visualization and interpretation approach which is based on CO-AN.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Baldi, P., Frasconi, P., Smyth, P.: Modeling the Internet and the Web. Wiley Press, USA (2003)
Cadez, I.V., Heckerman, D., Meek, C., Smyth, P., White, S.: Model-based Clustering and Visualization of Navigation Patterns on a Web Site. Journal of Data Mining and Knowledge Discovery 7(4), 399–424 (2003)
Chen, Z., Fu, A., Tong, F.: Optimal Algorithms for Finding User Access Sessions from very Large Web Logs. World Wide Web: Internet and Information Systems 6, 259–279 (2003)
Chen, K., Liu, L.: Validating and Refining Clusters via Visual Rendering. In: Proc. of the International Conference on Data Mining (ICDM 2003), Melbourne, Florida, December 2003, pp. 501–504 (2003)
Dempster, A.P., Lsird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Statistics Society B 39, 1–22 (1977)
Fraley, C., Raftery, A.: How Many Clusters? Which Clustering Method? Answers via Model-based Cluster Analysis. Computer Journal 41, 578–588 (1998)
Greenacre, M.J.: Correspondence Analysis in Practice. Academic Press, London (1993)
Gunter, S., Bunke, H.: Validation Indices for Graph Clustering. Pattern Recognition Letters 24(8), 1107–1113 (2003)
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: On Clustering Validation Techniques. J. Intell. Inf. Syst. 17(2-3), 107–145 (2001)
Kohavi, R.: Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: Proc. of the 14th International Joint Conference on A.I., Canada, vol. 2 (1995)
Pallis, G., Angelis, L., Vakali, A., Pokorny, J.: A Probabilistic Validation Algorithm for Web Users’ Clusters. In: Proc. of the IEEE International Conference on Systems, Man and Cybernetics (SMC 2004), Hague, Holland (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pallis, G., Angelis, L., Vakali, A. (2005). Model-Based Cluster Analysis for Web Users Sessions. In: Hacid, MS., Murray, N.V., Raś, Z.W., Tsumoto, S. (eds) Foundations of Intelligent Systems. ISMIS 2005. Lecture Notes in Computer Science(), vol 3488. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11425274_23
Download citation
DOI: https://doi.org/10.1007/11425274_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25878-0
Online ISBN: 978-3-540-31949-8
eBook Packages: Computer ScienceComputer Science (R0)