Abstract
We introduce the notion of dense region as distinct and meaningful patterns from given data. Efficient and effective algorithms for identifying such regions are presented. Next, we discuss extensions of the algorithms for handling data streams. Finally, experiments on large-scale data streams such as clickstreams are given which confirm that the usefulness of our algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Huang, J., Ng, M., Ching, W., Ng, J., Cheung, D.: A cube model and cluster analysis for web access sessions. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, p. 48. Springer, Heidelberg (2002)
Wu, E.H., Ng, M.K., Huang, J.Z.: On improving website connectivity by using web-log data streams. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 352–364. Springer, Heidelberg (2004)
Wu, E.H., Ng, M.K.: A graph-based optimization algorithm for Website topology using interesting association rules. In: Proc. of PAKDD 2003, Seoul, Korea (2003)
Yang, C., Fayyad, U., Bradley, P.S.: Efficient discovery of error-tolerant frequent itemsets in high dimensions. In: Proc. of ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining: San Francisco, California, pp. 194–203 (2001)
Yip, A.M., Wu, E.H., Ng, M.K., Chan, T.F.: An efficient algorithm for dense regions discovery from large-scale data streams (extended version), UCLA CAM Reports 03-76, Math. Dept., University of California, Los Angeles, CA (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yip, A.M., Wu, E.H., Ng, M.K., Chan, T.F. (2004). An Efficient Algorithm for Dense Regions Discovery from Large-Scale Data Streams. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-24775-3_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22064-0
Online ISBN: 978-3-540-24775-3
eBook Packages: Springer Book Archive