Abstract
We study k-party number-in-hand set disjointness in the simultaneous message-passing model, and show that even if each element \(i\in [n]\) is guaranteed to either belong to all k parties or to at most O(1) parties in expectation (and to at most \(O(\log n)\) parties with high probability), then \(\varOmega (n \min (\log 1/\delta , \log k) / k )\) communication is required by any \(\delta \)-error communication protocol for this problem (assuming \(k = \varOmega (\log n)\)).
We use the strong promise of our lower bound, together with a recent characterization of turnstile streaming algorithms as linear sketches, to obtain new lower bounds for the well-studied problem in data streams of approximating the frequency moments. We obtain a space lower bound of \(\varOmega (n^{1-2/p} \varepsilon ^{-2} \log M \log 1/\delta )\) bits for any algorithm giving a \((1+\varepsilon )\)-approximation to the p-th moment \(\sum _{i=1}^n |x_i|^p\) of an n-dimensional vector \(x\in \{\pm M\}^n\) with probability \(1-\delta \), for any \(\delta \ge 2^{-o(n^{1/p})}\). Our lower bound improves upon a prior \(\varOmega (n^{1-2/p} \varepsilon ^{-2} \log M)\) lower bound which did not capture the dependence on \(\delta \), and our bound is optimal whenever \(\varepsilon \le 1/\text {poly}(\log n)\). This is the first example of a lower bound in data streams which uses a characterization in terms of linear sketches to obtain stronger lower bounds than obtainable via the one-way communication model; indeed, our set disjointness lower bound provably cannot hold in the one-way model.
O. Weinstein and D.P. Woodruff—Research supported by a Simons Fellowship in Theoretical Computer Science and NSF Award CCF-1215990.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Andoni, A., Krauthgamer, R., Onak, K.: Streaming algorithms from precision sampling. CoRR, abs/1011.1263 (2010)
Alon, N., Matias, Y., Szegedy, M.: The space complexity of approximating the frequency moments. JCSS 58(1), 137–147 (1999)
Andoni, A.: High frequency moment via max stability. http://web.mit.edu/andoni/www/papers/fkStable.pdf
Andoni, A., Nguyên, H.L., Polyanskiy, Y., Wu, Y.: Tight lower bound for linear sketches of moments. In: Fomin, F.V., Freivalds, R., Kwiatkowska, M., Peleg, D. (eds.) ICALP 2013, Part I. LNCS, vol. 7965, pp. 25–32. Springer, Heidelberg (2013)
Bhuvanagiri, L., Ganguly, S., Kesh, D., Saha, C.: Simpler algorithm for estimating frequency moments of data streams. In: SODA, pp. 708–713 (2006)
Braverman, M., Garg, A., Pankratov, D., Weinstein, O.: Information lower bounds via self-reducibility. In: Bulatov, A.A., Shur, A.M. (eds.) CSR 2013. LNCS, vol. 7913, pp. 183–194. Springer, Heidelberg (2013)
Braverman, V., Katzman, J., Seidell, C., Vorsanger, G.: An optimal algorithm for large frequency moments using bits. In: APPROX/RANDOM (2014)
Braverman, V., Ostrovsky, R.: Recursive sketching for frequency moments. CoRR, abs/1011.2571 (2010)
Braverman, V., Ostrovsky, R.: Approximating large frequency moments with pick-and-drop sampling. CoRR, abs/1212.0202 (2012)
Braverman, M., Oshman, R.: The communication complexity of number-in-hand set disjointness with no promise. Electronic Colloquium on Computational Complexity (ECCC) 22(2) (2015)
Bar-Yossef, Z., Jayram, T.S., Kumar, R., Sivakumar, D.: An information statistics approach to data stream and communication complexity. Journal of Computer and System Sciences 68(4), 702–732 (2004)
Clarkson, K.L., Drineas, P., Magdon-Ismail, M., Mahoney, M.W., Meng, X., Woodruff, D.P.: The fast cauchy transform and faster robust linear regression. In: SODA (2013)
Coppersmith, D., Kumar, R.: An improved data stream algorithm for frequency moments. In: SODA (2004)
Chakrabarti, A., Khot, S., Sun, X.: Near-optimal lower bounds on the multi-party communication complexity of set disjointness. In: CCC, pp. 107–117 (2003)
Chakrabarti, A., Kondapally, R., Wang, Z.: Information complexity versus corruption and applications to orthogonality and gap-hamming. In: Gupta, A., Jansen, K., Rolim, J., Servedio, R. (eds.) APPROX 2012 and RANDOM 2012. LNCS, vol. 7408, pp. 483–494. Springer, Heidelberg (2012)
Chattopadhyay, A., Pitassi, T.: The story of set disjointness. SIGACT News 41(3), 59–85 (2010)
Ganguly, S.: Estimating frequency moments of data streams using random linear combinations. In: Jansen, K., Khanna, S., Rolim, J.D.P., Ron, D. (eds.) RANDOM 2004 and APPROX 2004. LNCS, vol. 3122, pp. 369–380. Springer, Heidelberg (2004)
Ganguly, S.: A hybrid algorithm for estimating frequency moments of data streams, Manuscript (2004)
Ganguly, S.: Polynomial estimators for high frequency moments. CoRR, abs/1104.4552 (2011)
Ganguly, S.: A lower bound for estimating high moments of a data stream. CoRR, abs/1201.0253 (2012)
Indyk, P., Woodruff, D.: Optimal approximations of the frequency moments of data streams. In: STOC. ACM (2005)
Jayram, T.S., Woodruff, D.P.: Optimal bounds for johnson-lindenstrauss transforms and streaming problems with subconstant error. ACM Transactions on Algorithms 9(3), 26 (2013)
Kane, D.M., Nelson, J., Woodruff, D.P.: On the exact space complexity of sketching and streaming small norms. In: SODA, pp. 1161–1178 (2010)
Kalyanasundaram, B., Schnitger, G.: The probabilistic communication complexity of set intersection. SIAM Journal on Discrete Mathematics 5(4), 545–557 (1992)
Li, Y., Nguyen, H.L., Woodruff, D.P.: Turnstile streaming algorithms might as well be linear sketches. In: STOC, pp. 174–183 (2014)
Li, Y., Woodruff, D.P.: A tight lower bound for high frequency moment estimation with small error. In: Raghavendra, P., Raskhodnikova, S., Jansen, K., Rolim, J.D.P. (eds.) RANDOM 2013 and APPROX 2013. LNCS, vol. 8096, pp. 623–638. Springer, Heidelberg (2013)
Monemizadeh, M., Woodruff, D.P.: \(1\)-pass relative-error \(l_p\)-sampling with applications. In: SODA (2010)
Sherstov, A.A.: Communication lower bounds using directional derivatives. J. ACM 61(6), 34 (2014)
Saks, M., Sun, X.: Space lower bounds for distance approximation in the data stream model. In: STOC (2002)
Sohler, C., Woodruff, D.P.: Subspace embeddings for the l\({}_{\text{1 }}\)-norm with applications. In: STOC, pp. 755–764 (2011)
Woodruff, D.P.: Optimal space lower bounds for all frequency moments. In: SODA, pp. 167–175 (2004)
Woodruff, D.P., Zhang, Q.: Tight bounds for distributed functional monitoring. In: STOC, pp. 941–960 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Weinstein, O., Woodruff, D.P. (2015). The Simultaneous Communication of Disjointness with Applications to Data Streams. In: Halldórsson, M., Iwama, K., Kobayashi, N., Speckmann, B. (eds) Automata, Languages, and Programming. ICALP 2015. Lecture Notes in Computer Science(), vol 9134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47672-7_88
Download citation
DOI: https://doi.org/10.1007/978-3-662-47672-7_88
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-47671-0
Online ISBN: 978-3-662-47672-7
eBook Packages: Computer ScienceComputer Science (R0)