Skip to main content

A Framework for High-Quality Clustering Uncertain Data Stream over Sliding Windows

  • Conference paper
Web-Age Information Management (WAIM 2012)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7418))

Included in the following conference series:

Abstract

In recent years, data mining over uncertain data stream has attracted a lot of attentions along with the imprecise data widely generated. In many cases, the estimated error of the data stream is available. The estimated error is very useful for the clustering process, since it can be used to improve the quality of the cluster results. In this paper, we try to resolve the problem of clustering uncertain data stream over sliding windows. The tuple expected value and uncertainty are considered meanwhile in the clustering process. We therefore propose the algorithm based on Voronoi diagram to reduce the number of expected distance calculation over sliding windows. Finally, our performance study with both real and synthetic data sets demonstrates the efficiency and effectiveness of our proposed method.

This research was supported by the National Natural Science Foundation of China (Grant No. 61073063, 61173029, 60803026 and 61173030).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, C.C., Yu, P.S.: A survey of uncertain data algorithms and applications. Knowledge and Data Engineering 21(5), 609–623 (2009)

    Article  Google Scholar 

  2. Zhang, C., Gao, M., Zhou, A.: Tracking high quality clusters over uncertain data streams. In: ICDE 2009, pp. 1641–1648 (2009)

    Google Scholar 

  3. Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. In: SIGMOD 1996, vol. 25(2), pp. 103–114 (1996)

    Google Scholar 

  4. Chang, J.L., Cao, F., Zhou, A.Y.: Clustering evolving data streams over sliding windows. Journal of Software 18(4), 905–918 (2007)

    Article  Google Scholar 

  5. Aggarwal, C.C., Yu, P.S.: A framework for clustering uncertain data streams. In: ICDE, pp. 150–159 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cao, K., Wang, G., Han, D., Ma, Y., Ma, X. (2012). A Framework for High-Quality Clustering Uncertain Data Stream over Sliding Windows. In: Gao, H., Lim, L., Wang, W., Li, C., Chen, L. (eds) Web-Age Information Management. WAIM 2012. Lecture Notes in Computer Science, vol 7418. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32281-5_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32281-5_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32280-8

  • Online ISBN: 978-3-642-32281-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics