Abstract
The analysis of user behavior on the Web presupposes a reliable reconstruction of the users’ navigational activities. Cookies and server-generated session identifiers have been designed to allow an accurate session reconstruction. However, in the absence of reliable methods, analysts must employ heuristics (a) to identify unique visitors to a site, and (b) to distinguish among the activities of such users during independent sessions. The characteristics of the site, such as the site structure, as well as the methods used for data collection (e.g., the existence of cookies and reliable synchronization across multiple servers) may necessitate the use of different types of heuristics. In this study, we extend our work on the reliability of sessionizing mechanisms, by investigating the impact of site structure on the quality of constructed sessions. Specifically, we juxtapose sessionizing on a frame-based and a frame-free version of a site. We investigate the behavior of cookies, server-generated session identification, and heuristics that exploit session duration, page stay time and page linkage. Different measures of session reconstruction quality, as well as experiments on the impact on the prediction of frequent entry and exit pages, show that different reconstruction heuristics can be recommended depending on the characteristics of the site. We also present first results on the impact of session reconstruction heuristics on predictive applications such as Web personalization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berendt, B., Mobasher, B., Spiliopoulou, M., Wiltshire, J.: Measuring the accuracy of sessionizers for web usage analysis. In: Proceedings of the Workshop on Web Mining, First SIAM International Conference on Data Mining, Chicago, IL, pp. 7–14 (2001)
Berendt, B., Spiliopoulou, M.: Analysis of navigation behaviour in web sites integrating multiple information systems. The VLDB Journal 9, 56–75 (2000)
Catledge, L., Pitkow, J.: Characterizing browsing behaviors on the world wide web. Computer Networks and ISDN Systems 26, 1065–1073 (1995)
Cooley, R., Mobasher, B., Srivastava, J.: Data preparation for mining world wide web browsing patterns. Journal of Knowledge and Information Systems 1, 5–32 (1999)
Mobasher, B., Cooley, R., Srivastava, J.: Automatic personalization based on web usage mining. Communications of the ACM 43(8), 142–151 (2000)
Mobasher, B., Dai, H., Luo, T., Nakagawa, M.: Discovery and evaluation of aggregate usage profiles forWeb personalization. Data Mining and Knowledge Discovery 6, 61–82 (2002)
Padmanabhan, B., Zheng, Z., Kimbrough, S.O.: Personalization from incomplete data: What you don’t know can hurt. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2001), San Francisco, CA, August 2001, pp. 154–163 (2001)
Spiliopoulou, M., Faulstich, L.C.: WUM: A tool for web utilization analysis. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 184–203. Springer, Heidelberg (1999)
Spiliopoulou, M., Mobasher, B., Berendt, B., Nakagawa, M.: A framework for the evaluation of session reconstruction heuristics in Webusage analysis. INFORMS Journal on Computing 15, 171–190 (2003)
World Wide Web Committee Web Usage Characterization Activity. W3C Working Draft: Web Characterization Terminology & Definitions Sheet (1999), http://www.w3.org/1999/05/WCA-terms/
Zheng, Z., Padmanabhan, B., Kimbrough, S.: On the existence and significance of data preprocessing biases in Web-usage mining. INFORMS Journal on Computing 15, 148–170 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Berendt, B., Mobasher, B., Nakagawa, M., Spiliopoulou, M. (2003). The Impact of Site Structure and User Environment on Session Reconstruction in Web Usage Analysis. In: Zaïane, O.R., Srivastava, J., Spiliopoulou, M., Masand, B. (eds) WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles. WebKDD 2002. Lecture Notes in Computer Science(), vol 2703. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39663-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-39663-5_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20304-9
Online ISBN: 978-3-540-39663-5
eBook Packages: Springer Book Archive