Abstract
Web transaction data between web visitors and web functionalities usually convey users’ task-oriented behavior patterns. Clustering web transactions, thus, may capture such informative knowledge, in turn, build user profiles, which are associated with different navigational patterns. For some advanced web applications, such as web recommendation or personalization, the aforementioned work is crucial to make web users get their preferred information accurately. On the other hand, the conventional web usage mining techniques for clustering web objects often perform clustering on usage data directly rather than take the underlying semantic relationships among the web objects into account. Latent Semantic Analysis (LSA) model is a commonly used approach for capturing semantic associations among co-occurrence observations.. In this paper, we propose a LSA-based approach for such purpose. We demonstrated usability and scalability of the proposed approach through performing experiments on two real world datasets. The experimental results have validated the method’s effectiveness in comparison with some previous studies.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Joachims, T., Freitag, D., Mitchell, T.: Webwatcher: A tour guide for the world wide web. In: The 15th International Joint Conference on Artificial Intelligence (ICJAI 1997), Nagoya, Japan (1997)
Lieberman, H.: Letizia: An agent that assists web browsing. In: Proc. of the 1995 International Joint Conference on Artificial Intelligence. Morgan Kaufmann, Montreal (1995)
Perkowitz, M., Etzioni, O.: Adaptive Web Sites: Automatically Synthesizing Web Pages. In: Proceedings of the 15th National Conference on Artificial Intelligence. AAAI, Madison (1998)
Ngu, D.S.W., Wu, X.: Sitehelper: A localized agent that helps incremental exploration of the world wide web. In: Proceedings of 6th International World Wide Web Conference. ACM Press, Santa Clara (1997)
Cohen, E., Krishnamurthy, B., Rexford, J.: Improving end-to-end performance of the web using server volumes and proxy lters. In: Proc. of the ACM SIGCOMM 1998. ACM Press, Vancouver (1998)
Büchner, A.G., Mulvenna, M.D.: Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining. SIGMOD Record 27(4), 54–61 (1998)
Mobasher, B., Cooley, R., Srivastava, J.: Creating adaptive web sites through usagebased clustering of URLs. In: Proceedings of the 1999 Workshop on Knowledge and Data Engineering Exchange. IEEE Computer Society Press, Los Alamitos (1999)
Han, E., et al.: Hypergraph Based Clustering in High-Dimensional Data Sets: A Summary of Results. IEEE Data Engineering Bulletin 21(1), 15–22 (1998)
Mobasher, B., et al.: Discovery and Evaluation of Aggregate Usage Profiles for Web Personalization. Data Mining and Knowledge Discovery 6(1), 61–82 (2002)
Agarwal, R., Aggarwal, C., Prasad, V.: A Tree Projection Algorithm for Generation of Frequent Itemsets. Journal of Parallel and Distributed Computing 61(3), 350–371 (1999)
Agrawal, R., Srikant, R.: Jorge B. Bocca and Matthias Jarke and Carlo Zaniolo. In: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann, Santiago (1994)
Agrawal, R., Srikant, R.: Mining Sequential Patterns. In: Proceedings of the International Conference on Data Engineering (ICDE). IEEE Computer Society Press, Taipei (1995)
Mobasher, B.: Web Usage Mining and Personalization. In: Singh, M.P. (ed.) Practical Handbook of Internet Computing. CRC Press, Boca Raton (2004)
Perkowitz, M., Etzioni, O.: Adaptive Web sites. Communications of the ACM 43(8), 152–158 (2000)
O’Conner, M., Herlocker, J.: Clustering Items for Collaborative Filtering. In: Proceedings of the ACM SIGIR Workshop on Recommender Systems. ACM Press, Berkeley (1999)
Zhou, Y., Jin, X., Mobasher, B.: A Recommendation Model Based on Latent Principal Factors in Web Navigation Data. In: Proceedings of the 3rd International Workshop on Web Dynamics. ACM Press, New York (2004)
Xu, G., et al.: Discovering User Access Pattern Based on Probabilistic Latent Factor Model. In: Proceeding of 16th Australasian Database Conference. ACS Inc, Newcastle (2004)
Xu, G., Zhang, Y., Zhou, X.: Using Probabilistic Semantic Latent Analysis for Web Page Grouping. In: 15th International Workshop on Research Issues on Data Engineering: Stream Data Mining and Applications (RIDE-SDMA 2005), Tyoko, Japan (2005)
Jin, X., Zhou, Y., Mobasher, B.: A Unified Approach to Personalization Based on Probabilistic Latent Semantic Models of Web Usage and Content. In: Proceedings of the AAAI 2004 Workshop on Semantic Web Personalization (SWP 2004), San Jose (2004)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Journal of Knowledge and Information Systems 1(1), 5–32 (1999)
Datta, B.N.: Numerical Linear Algebra and Application. Brooks/Cole Publishing Company (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y., Xu, G., Zhou, X. (2005). A Latent Usage Approach for Clustering Web Transaction and Building User Profile. In: Li, X., Wang, S., Dong, Z.Y. (eds) Advanced Data Mining and Applications. ADMA 2005. Lecture Notes in Computer Science(), vol 3584. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11527503_6
Download citation
DOI: https://doi.org/10.1007/11527503_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27894-8
Online ISBN: 978-3-540-31877-4
eBook Packages: Computer ScienceComputer Science (R0)