Skip to main content

Synthetic Decimal Numbers as a Flexible Tool for Suppression of Post-published Tabular Data

  • Conference paper
  • First Online:
Privacy in Statistical Databases (PSD 2022)

Abstract

Cell suppression is a widely used statistical disclosure control method for tabular data. Commonly, several linked tables are suppressed simultaneously. After publication, additional tables may be requested. In many contexts, new tables mean new ways of grouping and aggregating data that has already been published. The suppression of the new tables must be coordinated with the tables that have already been disseminated. A certain type of synthetic decimal numbers has proven to be very useful for this purpose. Based on the aggregation of these decimal numbers, one can decide whether a cell should be suppressed or not. An aggregation summing up to a whole number means the same as non-suppression. This article describes the theoretical basis for such decimal numbers. This is based on standard methodology from ordinary linear regression. The method is illustrated by a small example. In addition, two practical applications at Statistics Norway are presented, where one involves large hierarchical and linked tables where more than 50000 unique cells were primarily suppressed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bates, D., R Development Core Team: Comparing Least Squares Calculations (2022). https://cran.r-project.org/package=Matrix (r Vignette)

  2. Burridge, J.: Information preserving statistical obfuscation. Stat. Comput. 13(4), 321–327 (2003). https://doi.org/10.1023/A:1025658621216

  3. Castro, J., Via, A.: Revisiting interval protection, a.k.a. partial cell suppression, for tabular data. In: Domingo-Ferrer, J., Pejić-Bach, M. (eds.) PSD 2016. LNCS, vol. 9867, pp. 3–14. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45381-1_1

    Chapter  Google Scholar 

  4. Chan, T.F.: Rank revealing QR factorizations. Linear Alg. Appl. 88, 67–82 (1987). https://doi.org/10.1016/0024-3795(87)90103-0

  5. Fischetti, M., Salazar, J.J.: Solving the cell suppression problem on tabular data with linear constraints. Manag. Sci. 47(7), 1008–1027 (2001). http://www.jstor.org/stable/822485

  6. Giessing, S., de Wolf, P.P., Reiffert, M., Geyer, F.: Considerations to deal with the frozen cell problem in Tau-Argus Modular. In: Joint UNECE/Eurostat Expert Meeting on Statistical Data Confidentiality, 1–3 December 2021, Poznań, Poland (2021)

    Google Scholar 

  7. Hundepool, A., et al.: Statistical Disclosure Control. John Wiley & Sons, Ltd., Boca Raton (2012). https://doi.org/10.1002/9781118348239.ch1

  8. Langsrud, Ø.: Information preserving regression-based tools for statistical disclosure control. Statist. Comput. 29(5), 965–976 (2019). https://doi.org/10.1007/s11222-018-9848-9

    Article  MathSciNet  MATH  Google Scholar 

  9. Langsrud, Ø.: easySdcTable: Easy Interface to the Statistical Disclosure Control Package ’sdcTable’ Extended with the ’GaussSuppression’ Method (2022). https://CRAN.R-project.org/package=easySdcTable (r package version 1.0.3)

  10. Langsrud, Ø., Lupp, D.: GaussSuppression: Tabular Data Suppression using Gaussian Elimination (2022). https://CRAN.R-project.org/package=GaussSuppression (r package version 0.4.0)

  11. Lupp, D.P., Langsrud, Ø.: Suppression of directly-disclosive cells in frequency tables. In: Joint UNECE/Eurostat Expert Meeting on Statistical Data Confidentiality, 1–3 December 2021, Poznań, Poland (2021)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Øyvind Langsrud .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Langsrud, Ø., Bøvelstad, H.M. (2022). Synthetic Decimal Numbers as a Flexible Tool for Suppression of Post-published Tabular Data. In: Domingo-Ferrer, J., Laurent, M. (eds) Privacy in Statistical Databases. PSD 2022. Lecture Notes in Computer Science, vol 13463. Springer, Cham. https://doi.org/10.1007/978-3-031-13945-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-13945-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-13944-4

  • Online ISBN: 978-3-031-13945-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics