Abstract
The emerging WWW poses new technological challenges for information processing. The scale of WWW is expected to keep growing as more devices, such as mobile phones and PDAs are equipped with the ability to access internet. Here we report the application of data mining techniques on large scale web data of a directory service for users of i- Mode, a major mobile phone internet access in Japan. We develop tool to visualize the behavior of web site visitors. We also report experiments on PC cluster as promising platform for large scale web mining. Parallel algorithms for generalized association rules are implemented on PC cluster with 100 PCs.
NTT Information Sharing Platform Laboratories, Midori-cho 3-9-11, Musashino-shi, Tokyo 180-8585, Japan
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
R. Agrawal and R. Srikant. “Fast Algorithms for Mining Association Rules”. In Proc. of VLDB, pp. 487–499, Sep. 1994.
R. Agrawal and J. C. Shafer. “Parallel Mining of Associaton Rules”. In IEEE TKDE, Vol. 8
E.-H. Han and G. Karypis and Vipin Kumar “Scalable Parallel Data Mining for Association Rules.” In Proc. of SIGMOD, pp. 277–288, May. 1997
J.I. Hong and J.A. Landay “WebQuilt: A Framework for Capturing and Visualizing the Web Experience” In Proc. of WWW10, 2001.
J.S. Park, M.-S. Chen, P.S. Yu “Efficient Parallel Algorithms for Mining Association Rules” In Proc. of CIKM, pp. 31–36, Nov. 1995
R. Srikant, R. Agrawal. “Mining Generalized Association Rules”. In Proc. of VLDB, 1995.
T. Shintani and M. Kitsuregawa “Hash Based Parallel Algorithms for Mining Association Rules”. In Proc. of PDIS, pp. 19–30, Dec. 1996.
T. Shintani, M. Kitsuregawa “Parallel Mining Algorithms for Generalized Association Rules with Classification Hierarchy.” In Proc. of SIGMOD, pp. 25–36, 1998.
T. Shintani, M. Oguchi, M. Kitsuregawa. “Performance Analysis for Parallel Generalized Association Rule Mining on a Large Scale PC Cluster”. In Proc. of Euro-par, 1999.
M. Spiliopoulou and L.C. Faulstich “WUM:A tool for Web Utilization Analysis” In Proc. of EDBT Workshop WebDB’98, 1998.
M. Tamura, M. Kitsuregawa. “Dynamic Load Balancing for Parallel Association Rule Mining on Heterogeneous PC Cluster System”. In Proc. of VLDB, 1999.
T. Tamura, M. Oguchi, M. Kitsuregawa “Parallel Database Processing on a 100 Node PC Cluster: Cases for Decision Support Query Processing and Data Mining.” In Proc of Super Computing 97::High Performance Networking and Computing, 1997
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kitsuregawa, M., Pramudiono, I., Takahashi, K., Prasetyo, B. (2001). Web Mining Is Parallel. In: Monien, B., Prasanna, V.K., Vajapeyam, S. (eds) High Performance Computing — HiPC 2001. HiPC 2001. Lecture Notes in Computer Science, vol 2228. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45307-5_34
Download citation
DOI: https://doi.org/10.1007/3-540-45307-5_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43009-4
Online ISBN: 978-3-540-45307-9
eBook Packages: Springer Book Archive