Abstract
Minimum-distance controlled tabular adjustment methods (CTA), an its variants, are considered an emerging perturbative approach for tabular data protection. Given a table to be protected, the purpose of CTA is to find the closest table that guarantees protection levels for the sensitive cells. We consider the most general CTA formulation which includes binary variables, thus providing protected tables with a higher data utility, at the expense of a larger solution time. The resulting model is a Mixed Integer Linear Problem (MILP). The purpose of this work is twofold. First, it presents and describes the main features of a package for CTA which is linked to both commercial (Cplex and Xpress) and open-source (Glpk, Cbc and Symphony ) MILP solvers. The particular design of the package allows easy integration with additional solvers. The second objective is to perform a computational evaluation of the above two commercial and three open-source MILP solvers for CTA, using both standard instances in the literature and real-world ones. Users of tabular data confidentiality techniques in National Statistical Agencies may find this information useful for the trade-off between the (more efficient but expensive) commercial and the (slower but free) open-source MILP solvers.
Supported by grants MTM2009-08747 of the Spanish Ministry of Science and Innovation, SGR-2009-1122 of the Government of Catalonia, and INFRA-2010-262608 of the European Union.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Castro, J.: Minimum-distance controlled perturbation methods for large-scale tabular data protection. European Journal of Operational Research 171, 39–52 (2006)
Castro, J.: A shortest paths heuristic for statistical disclosure control in positive tables. INFORMS Journal on Computing 19, 520–533 (2007)
Castro, J.: Extending controlled tabular adjustment for non-additive tabular data with negative protection levels. Statistics and Operations Research Transactions–SORT 35, 3–20 (2011)
Castro, J.: Recent advances in optimization techniques for statistical tabular data protection. European Journal of Operational Research 216, 257–269 (2012)
Castro, J., Giessing, S.: Testing variants of minimum distance controlled tabular adjustment. In: Monographs of Official Statistics, pp. 333–343. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2006)
Castro, J., González, J.A., Baena, D.: User’s and programmer’s manual of the RCTA package. Technical Report DR 2009-01, Dept. of Statistics and Operations Research, Universitat Politècnica de Catalunya (2009)
Castro, J., González, J.A.: A Tool for Analyzing and Fixing Infeasible RCTA Instances. In: Domingo-Ferrer, J., Magkos, E. (eds.) PSD 2010. LNCS, vol. 6344, pp. 17–28. Springer, Heidelberg (2010)
Castro, J., González, J.A.: Present and future research on controlled tabular adjustment. In: Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality (2011), http://www.unece.org/fileadmin/DAM/stats/documents/ece/ces/ge.46/2011/48_Castro-Gonzalez.pdf
Dandekar, R.A., Cox, L.H.: Synthetic tabular Data: an alternative to complementary cell suppression, manuscript, Energy Information Administration, U.S. (2002)
Fourer, R., Gay, D.M., Kernighan, D.W.: AMPL: A Modeling Language for Mathematical Programming. Duxbury Press (2002)
Giessing, S., Hundepool, A., Castro, J.: Rounding methods for protecting EU-aggregates. In: Eurostat Methodologies and Working Papers. Worksession on Statistical Data Confidentiality, pp. 255–264. Eurostat-Office for Official Publications of the European Communities, Luxembourg (2009)
Hundepool, A., Domingo-Ferrer, J., Franconi, L., Giessing, S., Lenz, R., Naylor, J., Schulte-Nordholt, E., Seri, G., de Wolf, P.P.: Handbook on Statistical Disclosure Control (v. 1.2), Network of Excellence in the European Statistical System in the field of Statistical Disclosure Control (2010), http://neon.vb.cbs.nl/casc/SDC_Handbook.pdf
Kelly, J.P., Golden, B.L., Assad, A.A.: Cell suppression: disclosure protection for sensitive tabular data. Networks 22, 28–55 (1992)
Lougee-Heimer, R.: The Common Optimization INterface for Operations Research. IBM Journal of Research and Development 47, 57–66 (2003)
Stroustrup, B.: The C++ Programming Language. Addison-Wesley (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Castro, J. (2012). A Computational Evaluation of Optimization Solvers for CTA. In: Domingo-Ferrer, J., Tinnirello, I. (eds) Privacy in Statistical Databases. PSD 2012. Lecture Notes in Computer Science, vol 7556. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33627-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-33627-0_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33626-3
Online ISBN: 978-3-642-33627-0
eBook Packages: Computer ScienceComputer Science (R0)