Generating private synthetic databases for untrusted system evaluation | IEEE Conference Publication | IEEE Xplore

Generating private synthetic databases for untrusted system evaluation


Abstract:

Evaluating the performance of database systems is crucial when database vendors or researchers are developing new technologies. But such evaluation tasks rely heavily on ...Show More

Abstract:

Evaluating the performance of database systems is crucial when database vendors or researchers are developing new technologies. But such evaluation tasks rely heavily on actual data and query workloads that are often unavailable to researchers due to privacy restrictions. To overcome this barrier, we propose a framework for the release of a synthetic database which accurately models selected performance properties of the original database. We improve on prior work on synthetic database generation by providing a formal, rigorous guarantee of privacy. Accuracy is achieved by generating synthetic data using a carefully selected set of statistical properties of the original data which balance privacy loss with relevance to the given query workload. An important contribution of our framework is an extension of standard differential privacy to multiple tables.
Date of Conference: 31 March 2014 - 04 April 2014
Date Added to IEEE Xplore: 19 May 2014
Electronic ISBN:978-1-4799-2555-1

ISSN Information:

Conference Location: Chicago, IL, USA

Contact IEEE to Subscribe

References

References is not available for this document.