research-article

Technical Perspective: Efficient Logspace Classes for Enumeration, Counting, and Uniform Generation

Author:
Reinhard Pichler

TU Wien, Austria

TU Wien, Austria
View Profile

Authors Info & Claims

ACM SIGMOD Record Volume 49 Issue 1March 2020pp 51https://doi.org/10.1145/3422648.3422660

Published:04 September 2020Publication History

ACM SIGMOD Record

Abstract

Traditionally, by query answering we mean the problem of finding all answers to a given query over a given database. But what happens if the number of answers is prohibitively big - which may easily occur in a Big Data context? In such situations, it seems preferable to have a mechanism that produces one answer after the other with certain guarantees on the time between any two outputs and to let the user decide when to stop. This leads us to the enumeration problem, which has received a lot of interest recently [1]. However, in order for the user to get a "realistic" picture of the entirety of answers, two crucial questions arise: first, how big is the portion of output answers compared with the total number of answers? And second, do the output answers reflect the variety of the complete set of answers? The first question refers to the counting problem, where we are interested in the total number of answers. The second question leads us to the problem of uniform generation, where we request that the answers be uniformly generated and thus form an unbiased sample of the complete set of answers.

References

E. Boros, B. Kimelfeld, R. Pichler, and N. Schweikardt. Enumeration in data management (dagstuhl seminar 19211). Dagstuhl Reports, 9(5):89--109, 2019.Google Scholar
M. Jerrum, L. G. Valiant, and V. V. Vazirani. Random generation of combinatorial structures from a uniform distribution. Theor. Comput. Sci., 43:169--188, 1986. Google ScholarDigital Library

Recommendations

Technical Perspective: Efficient Query Processing for Dynamically Changing Datasets

The paper Efficient Query Processing for Dynamically Changing Datasets, by Muhammad Idris, Mart´n Ugarte, Stijn Vansummeren, Hannes Voigt, and Wolfgang Lehner studies two central aspects of answering queries: (1) enumerating the answers to a query and (...
Read More
Technical Perspective: Query Answers - Fewer is Faster

We often write queries using LIMIT k, indicating that only k answers are to be returned. This feature is present in most query languages, for different data models: SQL, SPARQL, Cypher etc. For example, in a repository of about 250M SPARQL queries, ...
Read More
Technical Perspective for: Query Games in Databases

When a data analyst runs some query to analyze her data, she often wants to ask some follow-up questions, about the result of the query. Why-questions take many shapes, and occur in many scenarios. Why is a particular tuple in the answer? Why is it not ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM SIGMOD Record Volume 49, Issue 1
March 2020
72 pages
ISSN:0163-5808
DOI:10.1145/3422648
Editors:
Rada Chirkova
North Carolina State University
,
Vanessa Braganholo
Universidade Federal Fluminense
,
Wim Martens
University of Bayreuth
,
Divesh Srivastava
ATT research
,
Pinar Tözü
IBM Almaden Research Center
,
Marianne Winslett
University of Illinois
,
Jun Yang
Duke University
,
Azza Abouzied
NYU
,
Lyublena Antova
Datometry
,
Aaron J. Elmore
University of Chicago
,
Kyriakos Mouratidis
Singapore Management University
,
Dan Olteanu
University of Oxford
,
Immanuel Trummer
Cornell University
,
Yannis Velegrakis
Utrecht University
Issue’s Table of Contents
Copyright © 2020 Copyright is held by the owner/author(s)
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 4 September 2020
Check for updates
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 28
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Technical Perspective: Efficient Logspace Classes for Enumeration, Counting, and Uniform Generation

ACM SIGMOD Record

Abstract

References

Cited By

Recommendations

Technical Perspective: Efficient Query Processing for Dynamically Changing Datasets

Technical Perspective: Query Answers - Fewer is Faster

Technical Perspective for: Query Games in Databases