Skip to main content

Multiply-Imputing Confidential Characteristics and File Links in Longitudinal Linked Data

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3050))

Abstract

This paper describes ongoing research to protect confidentiality in longitudinal linked data through creation of multiply-imputed, partially synthetic data. We present two enhancements to the methods of [2]. The first is designed to preserve marginal distributions in the partially synthetic data. The second is designed to protect confidential links between sampling frames.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abowd, J.M., Lane, J.I., Prevost, R.: Design and conceptual issues in realizing analytical enhancements through linkages of employer and employee data. In: Proceedings of the Federal Committee on Statistical Methodology (2000)

    Google Scholar 

  2. Abowd, J.M., Woodcock, S.D.: Disclosure limitation in longitudinal linked data. In: Doyle, P., Lane, J.I., Theeuwes, J.J., Zayatz, L.V. (eds.) Confidentiality, Disclosure, and Data Access: Theory and Practical Applications for Statistical Agencies, pp. 215–278. North-Holland, Amsterdam (2000)

    Google Scholar 

  3. Fienberg, S.E.: A radical proposal for the provision of micro-data samples and the preservation of confidentiality. Carnegie Mellon University Department of Statistics Technical Report No. 611 (1994)

    Google Scholar 

  4. Fienberg, S.E., Makov, U.E.: Confidentiality, uniqueness, and disclosure limitation for categorical data. J. Official Statistics 14(4), 385–397 (1998)

    Google Scholar 

  5. Fienberg, S.E., Makov, U.E., Steele, R.J.: Disclosure limitation using perturbation and related methods for categorical data. J. Official Statistics 14(4), 485–502 (1998)

    Google Scholar 

  6. Kennickell, A.B.: Multiple imputation and disclosure protection: the case of the 1995 Survey of Consumer Finances. SCF Working Paper (1997)

    Google Scholar 

  7. LEHD Program: The Longitudinal Employer-Household Dynamics program: Employment Dynamics Estimates project versions 2.2 and 2.3. LEHD Technical Paper 2002-05 (2002)

    Google Scholar 

  8. Raghunathan, T.J., Reiter, J.P., Rubin, D.: Multiple imputation for statistical disclosure limitation. J. Official Statistics 19(1), 1–16 (2003)

    Google Scholar 

  9. Reiter, J.P.: Satisfying disclosure restrictions with synthetic data sets. J. Official Statistics 18(4), 531–544 (2002)

    Google Scholar 

  10. Reiter, J.P.: Inference for partially synthetic, public use microdata sets. Survey Methodology 29, 181–188 (2003)

    Google Scholar 

  11. Reiter, J.P.: Releasing multiply-imputed, synthetic public use microdata: An illustration and empirical study. J. Royal Statistical Society, Series A (forthcoming)

    Google Scholar 

  12. Reiter, J.P.: Significance tests for multi-component estimands from multiplyimputed, synthetic microdata. J. Statistical Planning and Inference (forthcoming)

    Google Scholar 

  13. Reiter, J.P.: Using CART to generate partially synthetic public use microdata. Duke University working paper (2003)

    Google Scholar 

  14. Rubin, D.B.: Multiple Imputation for Nonresponse in Surveys. Wiley, Chichester (1987)

    Google Scholar 

  15. Rubin, D.B.: Discussion of statistical disclosure limitation. J. Official Statistics 9(2), 461–468 (1993)

    Google Scholar 

  16. Statistical Disclosure Control in Practice. Springer, Heidelberg (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Abowd, J.M., Woodcock, S.D. (2004). Multiply-Imputing Confidential Characteristics and File Links in Longitudinal Linked Data. In: Domingo-Ferrer, J., Torra, V. (eds) Privacy in Statistical Databases. PSD 2004. Lecture Notes in Computer Science, vol 3050. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-25955-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-25955-8_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22118-0

  • Online ISBN: 978-3-540-25955-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics