Abstract
We consider the following clustering with outliers problem: Given a set of points X ⊂ { − 1,1}n, such that there is some point z ∈ { − 1,1}n for which \(\Pr_{x\in X}[\left<{x,z}\right>\ge \varepsilon ]\ge \delta\), find z. We call such a point z a (δ,ε)-center of X.
In this work we give lower and upper bounds for the task of finding a (δ,ε)-center. We first show that for δ = 1 − ν close to 1, i.e. in the “unique decoding regime”, given a (1 − ν,ε)-centered set our algorithm can find a (1 − (1 + o(1))ν,(1 − o(1))ε)-center. More interestingly, we study the “list decoding regime”, i.e. when δ is close to 0. Our main upper bound shows that for values of ε and δ that are larger than 1/polylog(n), there exists a polynomial time algorithm that finds a (δ − o(1),ε − o(1))-center. Moreover, our algorithm outputs a list of centers explaining all of the clusters in the input.
Our main lower bound shows that given a set for which there exists a (δ,ε)-center, it is hard to find even a (δ/n c, ε)-center for some constant c and ε = 1/poly(n), δ = 1/poly(n).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arora, S., Barak, B.: Complexity theory: A modern approach (2008)
Badoiu, M., Har-Peled, S., Indyk, P.: Approximate clustering via core-sets. In: Proc. 34th Annu. ACM Sympos. Theory Comput., pp. 250–257 (2002)
Feldman, V., Gopalan, P., Khot, S., Ponnuswami, A.K.: On agnostic learning of parities, monomials, and halfspaces. SIAM J. Comput. 39(2), 606–645 (2009)
Goldreich, O., Levin, L.A.: A hard-core predicate for all one-way functions. In: Proceedings of the Twenty-First Annual ACM Symposium on Theory of Computing, STOC 1989, pp. 25–32. ACM, New York (1989)
Guruswami, V., Raghavendra, P.: Hardness of learning halfspaces with noise. In: Proceedings of FOCS, pp. 543–552 (2006)
Impagliazzo, R., Jaiswal, R., Kabanets, V.: Approximately list-decoding direct product codes and uniform hardness amplification. In: Proceedings of the 47th Annual IEEE Symposium on Foundations of Computer Science, pp. 187–196. IEEE Computer Society, Washington, DC (2006)
Impagliazzo, R., Jaiswal, R., Kabanets, V., Wigderson, A.: Uniform direct product theorems: Simplified, optimized, and derandomized, vol. 39, pp. 1637–1665 (2010)
Johnson, S.M.: A new upper bound for error-correcting codes. IRE Transactions on Information Theory 8(2), 203–207 (1962)
Trevisan, L., Sorkin, G.B., Sudan, M., Williamson, D.P.: Gadgets approximation, and linear programming. In: IEEE Annual Symposium on Foundations of Computer Science, p. 617 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dinur, I., Goldenberg, E. (2013). Clustering in the Boolean Hypercube in a List Decoding Regime. In: Fomin, F.V., Freivalds, R., Kwiatkowska, M., Peleg, D. (eds) Automata, Languages, and Programming. ICALP 2013. Lecture Notes in Computer Science, vol 7965. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39206-1_35
Download citation
DOI: https://doi.org/10.1007/978-3-642-39206-1_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39205-4
Online ISBN: 978-3-642-39206-1
eBook Packages: Computer ScienceComputer Science (R0)