Skip to main content

A Latent Usage Approach for Clustering Web Transaction and Building User Profile

  • Conference paper
Advanced Data Mining and Applications (ADMA 2005)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3584))

Included in the following conference series:

  • 2438 Accesses

Abstract

Web transaction data between web visitors and web functionalities usually convey users’ task-oriented behavior patterns. Clustering web transactions, thus, may capture such informative knowledge, in turn, build user profiles, which are associated with different navigational patterns. For some advanced web applications, such as web recommendation or personalization, the aforementioned work is crucial to make web users get their preferred information accurately. On the other hand, the conventional web usage mining techniques for clustering web objects often perform clustering on usage data directly rather than take the underlying semantic relationships among the web objects into account. Latent Semantic Analysis (LSA) model is a commonly used approach for capturing semantic associations among co-occurrence observations.. In this paper, we propose a LSA-based approach for such purpose. We demonstrated usability and scalability of the proposed approach through performing experiments on two real world datasets. The experimental results have validated the method’s effectiveness in comparison with some previous studies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Joachims, T., Freitag, D., Mitchell, T.: Webwatcher: A tour guide for the world wide web. In: The 15th International Joint Conference on Artificial Intelligence (ICJAI 1997), Nagoya, Japan (1997)

    Google Scholar 

  2. Lieberman, H.: Letizia: An agent that assists web browsing. In: Proc. of the 1995 International Joint Conference on Artificial Intelligence. Morgan Kaufmann, Montreal (1995)

    Google Scholar 

  3. Perkowitz, M., Etzioni, O.: Adaptive Web Sites: Automatically Synthesizing Web Pages. In: Proceedings of the 15th National Conference on Artificial Intelligence. AAAI, Madison (1998)

    Google Scholar 

  4. Ngu, D.S.W., Wu, X.: Sitehelper: A localized agent that helps incremental exploration of the world wide web. In: Proceedings of 6th International World Wide Web Conference. ACM Press, Santa Clara (1997)

    Google Scholar 

  5. Cohen, E., Krishnamurthy, B., Rexford, J.: Improving end-to-end performance of the web using server volumes and proxy lters. In: Proc. of the ACM SIGCOMM 1998. ACM Press, Vancouver (1998)

    Google Scholar 

  6. Büchner, A.G., Mulvenna, M.D.: Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining. SIGMOD Record 27(4), 54–61 (1998)

    Article  Google Scholar 

  7. Mobasher, B., Cooley, R., Srivastava, J.: Creating adaptive web sites through usagebased clustering of URLs. In: Proceedings of the 1999 Workshop on Knowledge and Data Engineering Exchange. IEEE Computer Society Press, Los Alamitos (1999)

    Google Scholar 

  8. Han, E., et al.: Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results. IEEE Data Engineering Bulletin 21(1), 15–22 (1998)

    Google Scholar 

  9. Mobasher, B., et al.: Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization. Data Mining and Knowledge Discovery 6(1), 61–82 (2002)

    Article  MathSciNet  Google Scholar 

  10. Agarwal, R., Aggarwal, C., Prasad, V.: A Tree Projection Algorithm for Generation of Frequent Itemsets. Journal of Parallel and Distributed Computing 61(3), 350–371 (1999)

    Article  Google Scholar 

  11. Agrawal, R., Srikant, R.: Jorge B. Bocca and Matthias Jarke and Carlo Zaniolo. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann, Santiago (1994)

    Google Scholar 

  12. Agrawal, R., Srikant, R.: Mining Sequential Patterns. In: Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society Press, Taipei (1995)

    Google Scholar 

  13. Mobasher, B.: Web Usage Mining and Personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing. CRC Press, Boca Raton (2004)

    Google Scholar 

  14. Perkowitz, M., Etzioni, O.: Adaptive Web sites. Communications of the ACM 43(8), 152–158 (2000)

    Article  Google Scholar 

  15. O’Conner, M., Herlocker, J.: Clustering Items for Collaborative Filtering. In: Proceedings of the ACM SIGIR Workshop on Recommender Systems. ACM Press, Berkeley (1999)

    Google Scholar 

  16. Zhou, Y., Jin, X., Mobasher, B.: A Recommendation Model Based on Latent Principal Factors in Web Navigation Data. In: Proceedings of the 3rd International Workshop on Web Dynamics. ACM Press, New York (2004)

    Google Scholar 

  17. Xu, G., et al.: Discovering User Access Pattern Based on Probabilistic Latent Factor Model. In: Proceeding of 16th Australasian Database Conference. ACS Inc, Newcastle (2004)

    Google Scholar 

  18. Xu, G., Zhang, Y., Zhou, X.: Using Probabilistic Semantic Latent Analysis for Web Page Grouping. In: 15th International Workshop on Research Issues on Data Engineering: Stream Data Mining and Applications (RIDE-SDMA 2005), Tyoko, Japan (2005)

    Google Scholar 

  19. Jin, X., Zhou, Y., Mobasher, B.: A Unified Approach to Personalization Based on Probabilistic Latent Semantic Models of Web Usage and Content. In: Proceedings of the AAAI 2004 Workshop on Semantic Web Personalization (SWP 2004), San Jose (2004)

    Google Scholar 

  20. Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Journal of Knowledge and Information Systems 1(1), 5–32 (1999)

    Google Scholar 

  21. Datta, B.N.: Numerical Linear Algebra and Application. Brooks/Cole Publishing Company (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, Y., Xu, G., Zhou, X. (2005). A Latent Usage Approach for Clustering Web Transaction and Building User Profile. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_6

Download citation

  • DOI: https://doi.org/10.1007/11527503_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27894-8

  • Online ISBN: 978-3-540-31877-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics