Skip to main content

Microdata Protection through Noise Addition

  • Chapter
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2316))

Abstract

Microdata protection by adding noise is being discussed for more than 20 years now. Several algorithms were developed that have different characteristics. The simplest algorithm consists of adding white noise to the data. More sophisticated methods use more or less complex transformations of the data and more complex error-matrices to improve the results. This contribution gives an overview over the different algorithms and discusses their properties in terms of analytical validity and level of protection. Therefore some theoretical considerations are shown and an illustrating empirical example is given.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brand, R. (2000): Anonymität von Betriebsdaten, Beiträge zur Arbeitsmarktund Berufsforschung Beitrag 237, Institut für Arbeitsmarkt-und Berufsforschung, Nürnberg.

    Google Scholar 

  2. Brand, R., Bender, S. and S. Kohaut (1999): Possibilities for the Creation of a Scientific Use File for the IAB-Establishment-Panel, Statistical Data Confidentiality, Proceedings of the Joint Eurostat/UN-ECE Work session on Statistical Data Confidentiality in March 1999.

    Google Scholar 

  3. Chauduri, A. and R. Mukerjee (1988): Randomized Response: Theory and Techniques, New York, Base: Marcel Dekker.

    Google Scholar 

  4. de Waal, A.G. and L.C.R.J. Willenborg (1997): Statistical Disclosure Control and Sampling Weights, Journal of Official Statistics 13, 417–434.

    Google Scholar 

  5. Domingo-Ferrer, J. and J.M. Mateo-Sanz (2001): Comparing SDC Methods for Microdata on the Basis of Information Loss and Disclosure Risk, paper presented at the Joint ECE/Eurostat Worksession on Statistical Confidentiality in Skopie (The former Yugoslav Republic of Macedonia), 14–16 March 2001

    Google Scholar 

  6. Fienberg, S.E. (1997): Confidentiality and Disclosure Limitation Methodology: Challenges for National Statistics and Sstatistical Research, Technical Report No. 611, Carnegie Mellon university, Pittsburgh.

    Google Scholar 

  7. Fuller, W.A. (1993): Masking Procedures for Microdata Disclosure Limitation, Journal of Official Statistics 9, 383–406.

    Google Scholar 

  8. Gouweleeuw, J.M., Kooiman, P., Willenborg, L.C.R.J. and P.-P. de Wolf (1998): Post Randomisation for Statistical Disclosure Control: Theory and Implementation, Journal of Official Statistics 14, 463–478.

    Google Scholar 

  9. Harho., D. and G. Licht (1993): Anonymisation of Innovation Survey Data, the Disguise Factor Method, Zentrum für Europäische Wirtschaftsforschung, Mannheim.

    Google Scholar 

  10. Kim, J.J. (1986): A Method for Limiting Disclosure in Microdata based on Random Noise and Transformation, Proceedings of the Section on Survey Research Methods 1986, American Statistical Association, 303–308.

    Google Scholar 

  11. Kim, J.J. (1990): Subpopulation Estimation for the Masked Data, Proceedings of the Section on Survey Research Methods 1990, American Statistical Association, 456–461.

    Google Scholar 

  12. Kim, J.J. and W.E. Winkler (1995): Masking Microdata Files, Proceedings of the Section on Survey Research Methods 1995, American Statistical Association, 114–119.

    Google Scholar 

  13. Kim, J.J. and W.E. Winkler (1997): Masking Microdata Files, Statistical Research Division RR97/03, US Bureau of the Census, Washington, DC.

    Google Scholar 

  14. Kim, J.J. and W.E. Winkler (2001): Multiplicative Noise for Masking Continouus Data, unpublished manuscript.

    Google Scholar 

  15. McGuckin, R. H. and S.V. Nguyen (1990), Public Use Microdata: Disclosure and Usefulness, Journal of Economic and Social Development 16, 19–39.

    Google Scholar 

  16. McGuckin, R. H. (1993): Analytic Use of Economic Microdata: A Model for Researcher Access with Confidentiality Protection, Proceedings of the International Seminar on Statistical Confidentiality, 08–10. Sept. 1992, Dublin, Ireland; Eurostat, 83–97.

    Google Scholar 

  17. Moore, R.A. (1996): Analysis of the Kim-Winkler Algorithm for Masking Microdata files—How Much Masking is Necessary and Sufficient? Conjectures for the Development of a Controllable Algorithm, Statistical Research Division RR96/05, US Bureau of the Census, Washington D.C.

    Google Scholar 

  18. Paass, G. (1988): Disclosure Risk and Disclosure Avoidance for Microdata, Journal of Business & Economic Statistics 6, 487–500.

    Article  Google Scholar 

  19. Roque, G. M. (2000): Masking Microdata Files with Mixtures of Multivariate Normal Distributions, unpublished PhD-Thesis, University of California, Riverside.

    Google Scholar 

  20. Spruill, N.L (1983) The Confidentiality and Analytic Usefulness of Masked Business Microdata, Proceedings of the Section on Survey Research Methods 1983, American Statistical Association, 602–610.

    Google Scholar 

  21. Sullivan, G.R. (1989): The Use of Added Error to Avoid Disclosure in Microdata Releases, unpublished PhD-Thesis, Iowa State University.

    Google Scholar 

  22. Sullivan, G.R. and W.A. Fuller (1989): The Use of Measurement Error to Avoid Disclosure, Proceedings of the Section on Survey Research Methods 1989, American Statistical Association, 802–807.

    Google Scholar 

  23. Sullivan, G.R. and W.A. Fuller (1990): Construction of Masking Error for Categorial Variables, Proceedings of the Section on Survey Research Methods 1990, American Statistical Association, 453–439.

    Google Scholar 

  24. Tendick, P. (1988): Bias Avoidance and Measures of Confidentiality for the Noise Addition Method of Database Disclosure Control, unpublished PhD-Thesis, University of California, Davis.

    Google Scholar 

  25. Tendick, P. (1991): Optimal Noise Addition for Preserving Confidentiality in Multivariate Data, Journal of Statistical Planning and Inference 27, 341–353.

    Article  MATH  MathSciNet  Google Scholar 

  26. Tendick, P. and N. Matlo. (1994): A Modified Random Perturbation Method for Databse Security, ACM Transactions on Database Systems 19, 47–63.

    Article  Google Scholar 

  27. Willenborg, L. and T. de Waal (1996): Statistical Disclosure Control in Practice, New York, Berlin, Heidelberg: Springer.

    MATH  Google Scholar 

  28. Winkler, W.E. (1995): Matching and Record Linkage, Cox, B.G., Binder, D.A., Chinnappa, B.N., Christianson, A., Colledge, M.J. und P.S. Kott (ed.): Business Survey Methods, Wiley Series in Probability and Mathematical Statistics, New York: John Wiley & Sons, 335–384.

    Google Scholar 

  29. Winkler, W.E. (1998): Re-identification Methods for Evaluating the Confidentiality of Analytically Valid Microdata, Statistical Data Protection 1998, Lisbon, Portugal.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Brand, R. (2002). Microdata Protection through Noise Addition. In: Domingo-Ferrer, J. (eds) Inference Control in Statistical Databases. Lecture Notes in Computer Science, vol 2316. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47804-3_8

Download citation

  • DOI: https://doi.org/10.1007/3-540-47804-3_8

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43614-0

  • Online ISBN: 978-3-540-47804-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics