Skip to main content

Using the Jackknife Method to Produce Safe Plots of Microdata

  • Conference paper
Book cover Privacy in Statistical Databases (PSD 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4302))

Included in the following conference series:

  • 767 Accesses

Abstract

We discuss several methods for producing plots of uni- and bivariate distributions of confidential numeric microdata so that no single value is disclosed even in the presence of detailed additional knowledge, using the jackknife method of confidentiality protection. For histograms (as for frequency tables) this is similar to adding white noise of constant amplitude to all frequencies. Decreasing the bin size and smoothing, leading to kernel density estimation in the limit, gives more informative plots which need less noise for protection. Detail can be increased by choosing the bandwidth locally. Smoothing also the noise (i.e. using correlated noise) gives more visual improvement. Additional protection comes from robustifying the kernel density estimator or plotting only classified densities as in contour plots.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Gopal, R., Garfinkel, R., Goes, P.: Confidentiality Via Camouflage: The CVC Approach to Disclosure Limitation When Answering Queries to Databases. Operations Research 50, 501–516 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  2. Gottschalk, S.: Microdata disclosure by resampling – empirical findings for business survey data. J. German Statist. Soc. 88(3), 279–302 (2004)

    MATH  MathSciNet  Google Scholar 

  3. Heitzig, J.: The Jackknife Method: Confidentiality Protection for Complex Statistical Analyses. In: Joint UNECE/Eurostat work session on statistical data confidentiality Geneva, Switzerland, November 9-11 (2005), URL: http://www.unece.org/stats/documents/ece/ces/ge.46/2005/wp.39.e.pdf

  4. Izenman, A.: Recent Developments in Nonparametric Density Estimation. J. Amer. Statist. Ass. 86, 205 (1991)

    Article  MATH  MathSciNet  Google Scholar 

  5. Scheffler, M.: Jackknife-Geheimhaltung im Vergleich zur Nutzung von Scientific-Use-Files am Beispiel der Kostenstrukturerhebung. Methoden–Verfahren–Entwicklungen, Vol. I/2006, Federal Statistical Office Germany, Wiesbaden (2006)

    Google Scholar 

  6. Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Monographs on Statistics and Applied Probability. Chapman and Hall, London (1986)

    Google Scholar 

  7. Steel, Ph., Reznek, A.: Issues in Designing a Confidentiality Preserving Model Server. In: Joint UNECE/Eurostat work session on statistical data confidentiality, Geneva, Switzerland, November 9-11 (2005), URL: http://www.unece.org/stats/documents/ece/ces/ge.46/2005/wp.4.e.pdf

  8. Wand, M.P., Jones, M.C.: Comparison of Smoothing Parameterizations in Bivariate Kernel Density Estimation. J. Amer. Statist. Ass. 88, 520–528 (1993)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Heitzig, J. (2006). Using the Jackknife Method to Produce Safe Plots of Microdata. In: Domingo-Ferrer, J., Franconi, L. (eds) Privacy in Statistical Databases. PSD 2006. Lecture Notes in Computer Science, vol 4302. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11930242_13

Download citation

  • DOI: https://doi.org/10.1007/11930242_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49330-3

  • Online ISBN: 978-3-540-49332-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics