Skip to main content

The Simultaneous Communication of Disjointness with Applications to Data Streams

  • Conference paper
  • First Online:
Book cover Automata, Languages, and Programming (ICALP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9134))

Included in the following conference series:

Abstract

We study k-party number-in-hand set disjointness in the simultaneous message-passing model, and show that even if each element \(i\in [n]\) is guaranteed to either belong to all k parties or to at most O(1) parties in expectation (and to at most \(O(\log n)\) parties with high probability), then \(\varOmega (n \min (\log 1/\delta , \log k) / k )\) communication is required by any \(\delta \)-error communication protocol for this problem (assuming \(k = \varOmega (\log n)\)).

We use the strong promise of our lower bound, together with a recent characterization of turnstile streaming algorithms as linear sketches, to obtain new lower bounds for the well-studied problem in data streams of approximating the frequency moments. We obtain a space lower bound of \(\varOmega (n^{1-2/p} \varepsilon ^{-2} \log M \log 1/\delta )\) bits for any algorithm giving a \((1+\varepsilon )\)-approximation to the p-th moment \(\sum _{i=1}^n |x_i|^p\) of an n-dimensional vector \(x\in \{\pm M\}^n\) with probability \(1-\delta \), for any \(\delta \ge 2^{-o(n^{1/p})}\). Our lower bound improves upon a prior \(\varOmega (n^{1-2/p} \varepsilon ^{-2} \log M)\) lower bound which did not capture the dependence on \(\delta \), and our bound is optimal whenever \(\varepsilon \le 1/\text {poly}(\log n)\). This is the first example of a lower bound in data streams which uses a characterization in terms of linear sketches to obtain stronger lower bounds than obtainable via the one-way communication model; indeed, our set disjointness lower bound provably cannot hold in the one-way model.

O. Weinstein and D.P. Woodruff—Research supported by a Simons Fellowship in Theoretical Computer Science and NSF Award CCF-1215990.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Andoni, A., Krauthgamer, R., Onak, K.: Streaming algorithms from precision sampling. CoRR, abs/1011.1263 (2010)

    Google Scholar 

  2. Alon, N., Matias, Y., Szegedy, M.: The space complexity of approximating the frequency moments. JCSS 58(1), 137–147 (1999)

    MATH  MathSciNet  Google Scholar 

  3. Andoni, A.: High frequency moment via max stability. http://web.mit.edu/andoni/www/papers/fkStable.pdf

  4. Andoni, A., Nguyên, H.L., Polyanskiy, Y., Wu, Y.: Tight lower bound for linear sketches of moments. In: Fomin, F.V., Freivalds, R., Kwiatkowska, M., Peleg, D. (eds.) ICALP 2013, Part I. LNCS, vol. 7965, pp. 25–32. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  5. Bhuvanagiri, L., Ganguly, S., Kesh, D., Saha, C.: Simpler algorithm for estimating frequency moments of data streams. In: SODA, pp. 708–713 (2006)

    Google Scholar 

  6. Braverman, M., Garg, A., Pankratov, D., Weinstein, O.: Information lower bounds via self-reducibility. In: Bulatov, A.A., Shur, A.M. (eds.) CSR 2013. LNCS, vol. 7913, pp. 183–194. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  7. Braverman, V., Katzman, J., Seidell, C., Vorsanger, G.: An optimal algorithm for large frequency moments using bits. In: APPROX/RANDOM (2014)

    Google Scholar 

  8. Braverman, V., Ostrovsky, R.: Recursive sketching for frequency moments. CoRR, abs/1011.2571 (2010)

    Google Scholar 

  9. Braverman, V., Ostrovsky, R.: Approximating large frequency moments with pick-and-drop sampling. CoRR, abs/1212.0202 (2012)

    Google Scholar 

  10. Braverman, M., Oshman, R.: The communication complexity of number-in-hand set disjointness with no promise. Electronic Colloquium on Computational Complexity (ECCC) 22(2) (2015)

    Google Scholar 

  11. Bar-Yossef, Z., Jayram, T.S., Kumar, R., Sivakumar, D.: An information statistics approach to data stream and communication complexity. Journal of Computer and System Sciences 68(4), 702–732 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  12. Clarkson, K.L., Drineas, P., Magdon-Ismail, M., Mahoney, M.W., Meng, X., Woodruff, D.P.: The fast cauchy transform and faster robust linear regression. In: SODA (2013)

    Google Scholar 

  13. Coppersmith, D., Kumar, R.: An improved data stream algorithm for frequency moments. In: SODA (2004)

    Google Scholar 

  14. Chakrabarti, A., Khot, S., Sun, X.: Near-optimal lower bounds on the multi-party communication complexity of set disjointness. In: CCC, pp. 107–117 (2003)

    Google Scholar 

  15. Chakrabarti, A., Kondapally, R., Wang, Z.: Information complexity versus corruption and applications to orthogonality and gap-hamming. In: Gupta, A., Jansen, K., Rolim, J., Servedio, R. (eds.) APPROX 2012 and RANDOM 2012. LNCS, vol. 7408, pp. 483–494. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  16. Chattopadhyay, A., Pitassi, T.: The story of set disjointness. SIGACT News 41(3), 59–85 (2010)

    Article  Google Scholar 

  17. Ganguly, S.: Estimating frequency moments of data streams using random linear combinations. In: Jansen, K., Khanna, S., Rolim, J.D.P., Ron, D. (eds.) RANDOM 2004 and APPROX 2004. LNCS, vol. 3122, pp. 369–380. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  18. Ganguly, S.: A hybrid algorithm for estimating frequency moments of data streams, Manuscript (2004)

    Google Scholar 

  19. Ganguly, S.: Polynomial estimators for high frequency moments. CoRR, abs/1104.4552 (2011)

    Google Scholar 

  20. Ganguly, S.: A lower bound for estimating high moments of a data stream. CoRR, abs/1201.0253 (2012)

    Google Scholar 

  21. Indyk, P., Woodruff, D.: Optimal approximations of the frequency moments of data streams. In: STOC. ACM (2005)

    Google Scholar 

  22. Jayram, T.S., Woodruff, D.P.: Optimal bounds for johnson-lindenstrauss transforms and streaming problems with subconstant error. ACM Transactions on Algorithms 9(3), 26 (2013)

    Article  MathSciNet  Google Scholar 

  23. Kane, D.M., Nelson, J., Woodruff, D.P.: On the exact space complexity of sketching and streaming small norms. In: SODA, pp. 1161–1178 (2010)

    Google Scholar 

  24. Kalyanasundaram, B., Schnitger, G.: The probabilistic communication complexity of set intersection. SIAM Journal on Discrete Mathematics 5(4), 545–557 (1992)

    Article  MATH  MathSciNet  Google Scholar 

  25. Li, Y., Nguyen, H.L., Woodruff, D.P.: Turnstile streaming algorithms might as well be linear sketches. In: STOC, pp. 174–183 (2014)

    Google Scholar 

  26. Li, Y., Woodruff, D.P.: A tight lower bound for high frequency moment estimation with small error. In: Raghavendra, P., Raskhodnikova, S., Jansen, K., Rolim, J.D.P. (eds.) RANDOM 2013 and APPROX 2013. LNCS, vol. 8096, pp. 623–638. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  27. Monemizadeh, M., Woodruff, D.P.: \(1\)-pass relative-error \(l_p\)-sampling with applications. In: SODA (2010)

    Google Scholar 

  28. Sherstov, A.A.: Communication lower bounds using directional derivatives. J. ACM 61(6), 34 (2014)

    Article  MathSciNet  Google Scholar 

  29. Saks, M., Sun, X.: Space lower bounds for distance approximation in the data stream model. In: STOC (2002)

    Google Scholar 

  30. Sohler, C., Woodruff, D.P.: Subspace embeddings for the l\({}_{\text{1 }}\)-norm with applications. In: STOC, pp. 755–764 (2011)

    Google Scholar 

  31. Woodruff, D.P.: Optimal space lower bounds for all frequency moments. In: SODA, pp. 167–175 (2004)

    Google Scholar 

  32. Woodruff, D.P., Zhang, Q.: Tight bounds for distributed functional monitoring. In: STOC, pp. 941–960 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Omri Weinstein .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Weinstein, O., Woodruff, D.P. (2015). The Simultaneous Communication of Disjointness with Applications to Data Streams. In: Halldórsson, M., Iwama, K., Kobayashi, N., Speckmann, B. (eds) Automata, Languages, and Programming. ICALP 2015. Lecture Notes in Computer Science(), vol 9134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47672-7_88

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-47672-7_88

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-47671-0

  • Online ISBN: 978-3-662-47672-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics