Abstract
In recent years, data mining over uncertain data stream has attracted a lot of attentions along with the imprecise data widely generated. In many cases, the estimated error of the data stream is available. The estimated error is very useful for the clustering process, since it can be used to improve the quality of the cluster results. In this paper, we try to resolve the problem of clustering uncertain data stream over sliding windows. The tuple expected value and uncertainty are considered meanwhile in the clustering process. We therefore propose the algorithm based on Voronoi diagram to reduce the number of expected distance calculation over sliding windows. Finally, our performance study with both real and synthetic data sets demonstrates the efficiency and effectiveness of our proposed method.
This research was supported by the National Natural Science Foundation of China (Grant No. 61073063, 61173029, 60803026 and 61173030).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aggarwal, C.C., Yu, P.S.: A survey of uncertain data algorithms and applications. Knowledge and Data Engineering 21(5), 609–623 (2009)
Zhang, C., Gao, M., Zhou, A.: Tracking high quality clusters over uncertain data streams. In: ICDE 2009, pp. 1641–1648 (2009)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: SIGMOD 1996, vol. 25(2), pp. 103–114 (1996)
Chang, J.L., Cao, F., Zhou, A.Y.: Clustering evolving data streams over sliding windows. Journal of Software 18(4), 905–918 (2007)
Aggarwal, C.C., Yu, P.S.: A framework for clustering uncertain data streams. In: ICDE, pp. 150–159 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, K., Wang, G., Han, D., Ma, Y., Ma, X. (2012). A Framework for High-Quality Clustering Uncertain Data Stream over Sliding Windows. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-32281-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32280-8
Online ISBN: 978-3-642-32281-5
eBook Packages: Computer ScienceComputer Science (R0)