Abstract
We present a single-pass, (1.8 + ε)-factor, O(1/ε)-space data stream algorithm for the Euclidean 2-center problem for any fixed d ≥ 1. This is an improvement on the approximation factor over the (2 + ε)-factor and O(1/ε)-space algorithms of Ahn et al. [3] and Guha [8]. It can also be considered as an improvement on the space over the (1 + ε)-factor and O(1/ε d)-space algorithm of Zarrabi-Zadeh [11], while sacrificing the approximation factor a little bit. To our best knowledge, this is the first breakthrough with an approximation factor below 2 using O(1/ε) space for any fixed d. Our algorithm also extends to the k-center problem with k > 2.
This work was supported by the National Research Foundation of Korea(NRF) grant funded by the Korea government(MSIP) (No. 2011-0030044).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Agarwal, P.K., Sharathkumar, R.: Streaming algorithms for extent problems in high dimensions. In: Proc. of the 21st ACM-SIAM Sympos. Discrete Algorithms, pp. 1481–1489 (2010)
Aggarwal, C.C.: Data streams: models and algorithms. Springer (2007)
Ahn, H.-K., Kim, H.-S., Kim, S.-S., Son, W.: Computing k-center over streaming data for small k. In: Proc. of the 23rd Int. Sympos. Algorithms and Computation, pp. 54–63 (2012)
Bonnell, I., Bate, M., Vine, S.: The hierarchical formation of a stellar cluster. Monthly Notices of the Royal Astronomical Society 343(2), 413–418 (2003)
Chan, T.M., Pathak, V.: Streaming and dynamic algorithms for minimum enclosing balls in high dimensions. In: Proc. of the 12th Int. Conf. on Algorithms and Data Structures, pp. 195–206 (2011)
Clarke, C., Bonnell, I., Hillenbrand, L.: The formation of stellar clusters. In: Mannings, V., Boss, A., Russell, S. (eds.) Protostars and Planets IV, pp. 151–177. University of Arizona Press, Tucson (2000)
Fayyad, U., Piatetsky-Shapiro, G., Smyth, P.: From data mining to knowledge discovery in databases. AI Magazine 17, 37–54 (1996)
Guha, S.: Tight results for clustering and summarizing data streams. In: Proc. of the 12th Int. Conf. on Database Theory, pp. 268–275. ACM (2009)
Han, J., Kamber, M.: Data mining: concepts and techniques. Morgan Kaufmann (2006)
Hershberger, J., Suri, S.: Adaptive sampling for geometric problems over data streams. Computational Geometry 39(3), 191–208 (2008)
Zarrabi-Zadeh, H.: Core-preserving algorithms. In: Proc. of 20th Canadian Conf. on Computational Geometry, pp. 159–162 (2008)
Poon, C.K., Zhu, B.: Streaming with minimum space: An algorithm for covering by two congruent balls. In: Lin, G. (ed.) COCOA 2012. LNCS, vol. 7402, pp. 269–280. Springer, Heidelberg (2012)
Sonka, M., Hlavac, V., Boyle, R.: Image processing, analysis, and machine vision, 3rd edn. Thomson Learning (2007)
Zarrabi-Zadeh, H., Chan, T.: A simple streaming algorithm for minimum enclosing balls. In: Proc. of 18th Canadian Conf. on Computational Geometry, pp. 139–142 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, SS., Ahn, HK. (2014). An Improved Data Stream Algorithm for Clustering. In: Pardo, A., Viola, A. (eds) LATIN 2014: Theoretical Informatics. LATIN 2014. Lecture Notes in Computer Science, vol 8392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54423-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-642-54423-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54422-4
Online ISBN: 978-3-642-54423-1
eBook Packages: Computer ScienceComputer Science (R0)