Abstract
Web usage mining aims at the discovery of interesting usage patterns from Web server log files. “Interestingness” relates to the business goals of the site owner. However, business goals refer to business objects rather than the page hits and script invocations recorded by the site server. Hence, Web usage analysis requires a preparatory mechanism that incorporates the business goals, the concepts reflecting them and the expert’s background knowledge on them into the mining process. To this purpose, we present a methodology and a mechanism for the establishment and exploitation of application-oriented concept hierarchies in Web usage analysis. We demonstrate our approach on a real data set and show how it can substantially improve both the search for interesting patterns by the mining algorithm and the interpretation of the mining results by the analyst.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Sarabjot S. Anand, David A. Bell, and John G. Hughes. The role of domain knowledge in data mining. In CIKM’95, pages 37–43, Baltimore MD, USA, 1995.
Alex G. Büchner, Maurice D. Mulvenna, Sarab S. Anand, and John G. Hughes. An internetenabled knowledge discovery process. In Proc. of the 9th Int’l Database Conference, 1999.
Surajit Chaudhuri and Umeshwar Dayal. An overview of data warehousing and olap technology. ACM SIGMOD Record, 26(1), 1997.
Ming-Syan Chen, Jiawei Han, and Philip S. Yu. Data mining: An overview from database perspective. IEEE Trans. on Knowledge and Data Engineering, 9:866–883, 1996.
Mat Cutler and Jim Sterne. E-metrics — business metrics for the new economy. Whitepaper, Net Genesis Corp., Cambridge, MA, 2000.
T. Ellman. Explanation-based learning:Asurvey of programs and perspectives. ACM Comput. Serveys, 21:162–222, 1989.
Bernhard Ganter and Rudolf Wille. Formale Begriffsanalyse: Mathematische Grundlagen. Springer-Verlag, 1996.
Henner Graubitz, Myra Spiliopoulou, and Karsten Winkler. The DIAsDEM framework for converting domain-specific texts into XML documents with data mining techniques. In Proc. of the 1st IEEE Intl. Conf. on Data Mining,, pages 171–178, San Jose, CA, Nov. 2001. IEEE.
J. Hereth, G. Stumme, R. Wille, and U. Wille. Conceptual knowledge discovery and data analysis. In B. Ganter and G. Mineau, editors, Proc. of Eight International Conference on Conceputel Structures: Logical, Linguistic, and Computational Issues, volume 1867 of Lecture Notes in Artificial Intelligence (LNAI), pages 421–437, Heidelberg, Aug 2000. Springer.
Patrik Jernmark, Nitin Mittal, Ramesh Narayan, Suresh Subudhi, and Kristian Wallin. Analysis of the Thomaskirche website. Kdd-course project report, Leipzig Graduate School of Management, Dec 2001.
Ryszard S. Michalski and Kenneth A. Kaufman. Data mining and knowledge discovery: A review of issues and a multistrategy approach. In R.S. Michalski, I. Bratko, and M. Kubat, editors, Machine Learning and Data Mining: Methods and Applications. JohnWiley & Sons Ltd., 1997.
B. Mobasher, H. Dai, T. Luo, and M. Nakagawa. Effective personalization based on association rule discovery from web usage data. In Proceedings of the 3rd ACM Workshop on Web Information and Data Management (WIDM01), held in conjunction with the International Conference on Information and Knowledge Management (CIKM 2001), Atlanta, Georgia, Nov 2001.
Peter Pirolli, James Pitkow, and Ramana Rao. Silk from a sow’s ear: Extracting usable structures from the web. In Conf. on Human Factors in Computing Systems (CIH’96), Vancouver, British Columbia, Canada, Apr 13–18 1996.
Giovanni M. Sacco. Dynamic taxonomies: A model for large information bases. IEEE Transactions on Knowledge and Data Engineering, 12(3):468–479, May/Jun 2000.
Myra Spiliopoulou and Lukas C. Faulstich. WUM: A web utilization miner. In EDBT Workshop WebDB98, Valencia, Spain, 1998. Springer Verlag.
Myra Spiliopoulou and Carsten Pohle. Data mining for measuring and improving the success of web sites. Journal of Data Mining and Knowledge Discovery, Special Issue on E-Commerce, 5:85–114, Jan–Apr 2001.
Ramakrishnan Srikant and Rakesh Agrawal. Mining generalized association rules. In Proc. 21st Conf. on Very Large Databases (VLDB) Zurich, Switzerland, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pohle, C., Spiliopoulou, M. (2002). Building and Exploiting Ad Hoc Concept Hierarchies for Web Log Analysis. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2002. Lecture Notes in Computer Science, vol 2454. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-46145-0_9
Download citation
DOI: https://doi.org/10.1007/3-540-46145-0_9
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44123-6
Online ISBN: 978-3-540-46145-6
eBook Packages: Springer Book Archive