skip to main content
10.1145/3212734.3212795acmotherconferencesArticle/Chapter ViewAbstractPublication PagespodcConference Proceedingsconference-collections
keynote

Data Summarization and Distributed Computation

Published: 23 July 2018 Publication History

Abstract

The notion of summarization is to provide a compact representation of data which approximately captures its essential characteristics. If such summaries can be created, they can lead to efficient distributed algorithms which exchange summaries in order to compute a desired function. In this talk, I'll describe recent efforts in this direction for problems inspired by machine learning: building graphical models over evolving, distributed training examples, and solving robust regression problems over large, distributed data sets.

References

[1]
Graham Cormode, Charlie Dickens, and David Woodruff. Leveraging well-conditioned bases: Streaming and distributed summaries in minkowski p-norms. In International Conference on Machine Learning, 2018.
[2]
Zengfeng Huang, Ke Yi, and Qin Zhang. Randomized algorithms for tracking distributed count, frequencies, and ranks. In ACM Principles of Database Systems, pages 295--306, 2012.
[3]
Yu Zhang, Srikanta Tirthapura, and Graham Cormode. Learning graphical models from a distributed stream. In IEEE International Conference on Data Engineering, 2018.

Cited By

View all
  • (2024)CodingSketch: A Hierarchical Sketch with Efficient Encoding and Recursive Decoding2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00130(1592-1605)Online publication date: 13-May-2024
  • (2023)TreeSensing: Linearly Compressing Sketches with FlexibilityProceedings of the ACM on Management of Data10.1145/35889101:1(1-28)Online publication date: 30-May-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
PODC '18: Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing
July 2018
512 pages
ISBN:9781450357951
DOI:10.1145/3212734
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 July 2018

Check for updates

Author Tags

  1. data summarization
  2. distributed computation

Qualifiers

  • Keynote

Funding Sources

  • European Research Council

Conference

PODC '18

Acceptance Rates

PODC '18 Paper Acceptance Rate 41 of 163 submissions, 25%;
Overall Acceptance Rate 740 of 2,477 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)CodingSketch: A Hierarchical Sketch with Efficient Encoding and Recursive Decoding2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00130(1592-1605)Online publication date: 13-May-2024
  • (2023)TreeSensing: Linearly Compressing Sketches with FlexibilityProceedings of the ACM on Management of Data10.1145/35889101:1(1-28)Online publication date: 30-May-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media