Skip to main content

The Reconstruction of the Interleaved Sessions from a Server Log

  • Conference paper
Advances in Artificial Intelligence (Canadian AI 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3060))

  • 1539 Accesses

Abstract

Session reconstruction is an essential step in Web usage mining. The quality of reconstructed sessions affects the result of Web usage mining. This paper presents a new approach of reconstructing sessions from Web server logs using the Markov chain model combined with a competitive algorithm. The proposed approach has the ability to reconstruct interleaved sessions from server logs. It is robust even if the client IP is not available in the log file. This capability makes our work distinct from other session reconstruction methods. The experiments show that our approach provides a significant improvement in regarding interleaved sessions compared to the traditional methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berendt, B., Mobasher, B., Spiliopoulou, M., Wiltshire, J.: Measuring the accuracy of sessionizers for Web usage analysis. In: Workshop on Web Mining at the First SIAM International Conference on Data Mining, April 2001, pp. 7–14 (2001)

    Google Scholar 

  2. Colley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. In: Knowledge and Information Systems, pp. 5–32 (1999)

    Google Scholar 

  3. Huang, X., Peng, F., An, A., Schuurmans, D., Cercone, N.: Session Boundary Detection for Association Rule Learning Using n-Gram Language Models. In: Proceedings of Sixteenth Conference of the Canadian Society for Computational Studies of the Intelligence, pp. 237–251 (2003)

    Google Scholar 

  4. Kilfoil, M., Ghorbani, A., Xing, W., Lei, Z., Lu, J., Zhang, J., Xu, X.: Toward an Adaptive Web: The State of the Art and Science. In: The 1st Annual Conference on Communication Networks & Services Research, pp. 119–130 (2003)

    Google Scholar 

  5. Pitkow, J. In: search of reliable usage data on the WWW. Computer Networks and ISDN Systems, 1343–1355 (1997)

    Google Scholar 

  6. Sarukkai, R.R.: Link Prediction and Path Analysis Using Markov Chains. Computer Network, 377–386 (2000)

    Google Scholar 

  7. Spiliopoulou, M., Pohle, C., Faulstich, L.: Improving the Effectiveness of a Web Site with Web Usage Mining, WEBKDD, pp. 142–162 (1999)

    Google Scholar 

  8. Spiliopoulou, M., Mobasher, B., Berendt, B., Nakagawa, M.: A Framework for the Evaluation of Session Reconstruction Heuristics inWeb Usage Analysis. INFORMS Journal of Computing, Special Issue on Mining Web-Based Data for E-Business Applications (2003)

    Google Scholar 

  9. Hallam-Baker, P.M., Behlendorf, B.: Extended Log File Format, http://www.w3.org/TR/WD-logfile (accessed August 19, 2003)

  10. Ypma, A., Heskes, T.: Categorization of Web Pages and User Clustering with mixtures of Hidden Markov Models. In: The International Workshop on Web Knowledge Discovery and Data Mining (July 2002)

    Google Scholar 

  11. Zhu, J., Hong, J., Hughes, J.G.: Using Markov models for web site link prediction. In: Proceedings of the thirteenth ACM conference on Hypertext and hypermedia, pp. 169–170 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lei, J.Z., Ghorbani, A. (2004). The Reconstruction of the Interleaved Sessions from a Server Log. In: Tawfik, A.Y., Goodwin, S.D. (eds) Advances in Artificial Intelligence. Canadian AI 2004. Lecture Notes in Computer Science(), vol 3060. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24840-8_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24840-8_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22004-6

  • Online ISBN: 978-3-540-24840-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics