Abstract
Session reconstruction is an essential step in Web usage mining. The quality of reconstructed sessions affects the result of Web usage mining. This paper presents a new approach of reconstructing sessions from Web server logs using the Markov chain model combined with a competitive algorithm. The proposed approach has the ability to reconstruct interleaved sessions from server logs. It is robust even if the client IP is not available in the log file. This capability makes our work distinct from other session reconstruction methods. The experiments show that our approach provides a significant improvement in regarding interleaved sessions compared to the traditional methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Berendt, B., Mobasher, B., Spiliopoulou, M., Wiltshire, J.: Measuring the accuracy of sessionizers for Web usage analysis. In: Workshop on Web Mining at the First SIAM International Conference on Data Mining, April 2001, pp. 7–14 (2001)
Colley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. In: Knowledge and Information Systems, pp. 5–32 (1999)
Huang, X., Peng, F., An, A., Schuurmans, D., Cercone, N.: Session Boundary Detection for Association Rule Learning Using n-Gram Language Models. In: Proceedings of Sixteenth Conference of the Canadian Society for Computational Studies of the Intelligence, pp. 237–251 (2003)
Kilfoil, M., Ghorbani, A., Xing, W., Lei, Z., Lu, J., Zhang, J., Xu, X.: Toward an Adaptive Web: The State of the Art and Science. In: The 1st Annual Conference on Communication Networks & Services Research, pp. 119–130 (2003)
Pitkow, J. In: search of reliable usage data on the WWW. Computer Networks and ISDN Systems, 1343–1355 (1997)
Sarukkai, R.R.: Link Prediction and Path Analysis Using Markov Chains. Computer Network, 377–386 (2000)
Spiliopoulou, M., Pohle, C., Faulstich, L.: Improving the Effectiveness of a Web Site with Web Usage Mining, WEBKDD, pp. 142–162 (1999)
Spiliopoulou, M., Mobasher, B., Berendt, B., Nakagawa, M.: A Framework for the Evaluation of Session Reconstruction Heuristics inWeb Usage Analysis. INFORMS Journal of Computing, Special Issue on Mining Web-Based Data for E-Business Applications (2003)
Hallam-Baker, P.M., Behlendorf, B.: Extended Log File Format, http://www.w3.org/TR/WD-logfile (accessed August 19, 2003)
Ypma, A., Heskes, T.: Categorization of Web Pages and User Clustering with mixtures of Hidden Markov Models. In: The International Workshop on Web Knowledge Discovery and Data Mining (July 2002)
Zhu, J., Hong, J., Hughes, J.G.: Using Markov models for web site link prediction. In: Proceedings of the thirteenth ACM conference on Hypertext and hypermedia, pp. 169–170 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lei, J.Z., Ghorbani, A. (2004). The Reconstruction of the Interleaved Sessions from a Server Log. In: Tawfik, A.Y., Goodwin, S.D. (eds) Advances in Artificial Intelligence. Canadian AI 2004. Lecture Notes in Computer Science(), vol 3060. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24840-8_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-24840-8_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22004-6
Online ISBN: 978-3-540-24840-8
eBook Packages: Springer Book Archive