research-article

Public Access

Privacy-Preserving Publishing of Multilevel Utility-Controlled Graph Datasets

Authors:

Balaji Palanisamy,

Qingyang WangAuthors Info & Claims

ACM Transactions on Internet Technology (TOIT), Volume 18, Issue 2

Article No.: 24, Pages 1 - 21

https://doi.org/10.1145/3125622

Published: 22 February 2018 Publication History

Abstract

Conventional private data publication schemes are targeted at publication of sensitive datasets either after the k-anonymization process or through differential privacy constraints. Typically these schemes are designed with the objective of retaining as much utility as possible for the aggregate queries while ensuring the privacy of the individual records. Such an approach, though suitable for publishing aggregate information as public datasets, is inapplicable when users have different levels of access to the same data. We argue that existing schemes either result in increased disclosure of private information or lead to reduced utility when some users have more access privileges than the others. In this article, we present an anonymization framework for publishing large datasets with the goals of providing different levels of utility to the users based on their access privilege levels. We design and implement our proposed multilevel utility-controlled anonymization schemes in the context of large association graphs considering three levels of user utility, namely, (1) users having access to only the graph structure, (2) users having access to the graph structure and aggregate query results, and (3) users having access to the graph structure, aggregate query results, and individual associations. Our experiments on real large association graphs show that the proposed techniques are effective and scalable and yield the required level of privacy and utility for each user privacy and access privilege level.

References

[1]

C. Aggarwal. 2005. On k-anonymity and the curse of dimensionality. In International Conference on Very Large Databases (VLDB’5).

Digital Library

[2]

L. Backstrom, C. Dwork, and J. Kleinberg. 2007. Wherefore are thou R3579X? Anonymized social networks, hiddern patterns and structural steganography. In International Worldwide Web Conference (WWW’07).

Digital Library

[3]

S. Bhagat, G. Cormode, B. Krishnamurthy, and D. Srivastava. 2009. Class-based graph anonymization for social network data. In International Conference on Very Large Databases (VLDB’09).

Digital Library

[4]

R. Chen. 2011. Publishing set-valued data via differential privacy. In International Conference on Very Large Databases (VLDB’11).

[5]

G. Cormode, D. Srivastava, N. Li, and T. Li. 2010. Minimizing and maximizing utility: Analyzing method-based attacks on anonymized data. In International Conference on Very Large Databases (VLDB’10).

Digital Library

[6]

G. Cormode, D. Srivastava, T. Yu, and Q. Zhang. 2008. Anonymizing bipartite graph data using safe groupings. In International Conference on Very Large Databases (VLDB’08).

Digital Library

[7]

R. A. Fisher and F. Yates. 1938. Statistical tables for biological, agricultural, and medical research. Oliver and Boyd, London, 20, Example 12.

[8]

A. Friedman and A. Schuster. 2010. Data mining with differential privacy. In International Conference on Knowledge Discovery and Data Mining (SIGKDD’10).

Digital Library

[9]

Samarati. 2001. Protecting respondents identities in microdata release. In Transactions on Knowledge and Data Engineering (TKDE’01).

Digital Library

[10]

L. Sweeney. 2002. k-Anonymity: A model for protecting privacy. In International Journal on Uncertainty, Fuzziness and Knowledge-Based Systems.

Digital Library

[11]

G. Ghinita, Y. Tao, and P. Kalnis. 2008. On the anonymization of sparse high-dimensional data. In International Conference on Data Engineering (ICDE’08).

Digital Library

[12]

A. Korolova, R. Motwani, S. Nabar, and Y. Xu. 2008. Link privacy in social networks. In International Conference on Data Engineering (ICDE’08).

Digital Library

[13]

A. Machanavajjhala, J. Gehrke, D. Kifer, and M. Venkitasubramaniam. 2006. l-Diversity: Privacy beyond k-anonymity. In International Conference on Data Engineering (ICDE’06).

Digital Library

[14]

V. Karwa, S. Raskhodnikova, A. Smith, and G. Yaroslavtsev. 2001. Private analysis of graph structure. In International Conference on Very Large Databases (VLDB’01).

[15]

S. Kasiviswanathan, K. Nissim, S. Raskhodnikova, and A. Smith. 2013. Analyzing graphs with node differential privacy. In Theory of Cryptography (TCC’13).

Digital Library

[16]

K. LeFevre, D. DeWitt, and R. Ramakrishnan. 2005. Incognito: Efficient full-domain K-anonymity. In Special Interest Group on Management of Data (SIGMOD’05).

Digital Library

[17]

N. Li, T. Li, and S. Venkatasubramanian. 2007. t-Closeness: Privacy beyond k- anonymity and l-diversity. In International Conference on Data Engineering (ICDE’05).

[18]

A. Sala, X. Zhao, C. Wilson, H. Zheng, and B. Y. Zhao. 2011. Sharing graphs using differentially private graph models. In Internet Measurement Conference (IMC’11).

Digital Library

[19]

A. Serjantov and G. Danezis. 2002. Towards an information theoretic metric for anonymity. In Privacy Enhancing Technologies Symposium (PETS’02).

Digital Library

[20]

C. Task and C. Clifton. 2013. What should we protect? Defining differential privacy for social network analysis. In Social Network Analysis and Mining.

[21]

G. Toth, Z. Hornak, and F. Vajda. 2004. Measuring anonymity revisited. In Nordic Workshop on Secure IT Systems (Nordsec).

[22]

R. C. Wong, A. W. Fu, K. Wang, and J. Pei. 2007. Attack in privacy preserving data publishing. In International Conference on Very Large Databases (VLDB’07).

Digital Library

[23]

R. Wong, J. Li, A. Fu, and K. Wang. 2006. (α, k)-Anonymity: An enhanced k-anonymity model for privacy-preserving data publishing. In International Conference on Knowledge Discovery and Data Mining (SIGKDD’06).

Digital Library

[24]

C. Dwork. 2006. Differential privacy. In International Colloquium on Automata, Languages, and Programming (ICALP’06).

Digital Library

[25]

X. Xiao and Y. Tao. 2006. Anatomy: Simple and effective privacy preservation. In International Conference on Very Large Databases (VLDB’06).

Digital Library

[26]

Y. Yang, Z. Zhang, G. Miklau, M. Winslett, and X. Xiao et al. 2012. Differential privacy in data publication and analysis. In Special Interest Group on Management of Data (SIGMOD’12).

Digital Library

[27]

Q. Zhang, N. Koudas, D. Srivastava, and T. Yu. 2007. Aggregate query answering on anonymized tables. In International Conference on Very Large Databases (VLDB’07).

[28]

B. Zhou and J. Pei. 2008. Preserving privacy in social networks against neighborhood attacks. In International Conference on Data Engineering (ICDE’08).

Digital Library

[29]

W. Day, Ni. Li, and M. Lyu. 2016. Publishing graph degree distribution with node differential privacy. In Special Interest Group on Management of Data (SIGMOD’16).

Digital Library

[30]

J. Zhang, G. Cormode, C. Procopiuc, D. Srivastava, and X. Xiao. 2015. Private release of graph statistics using ladder functions. In Special Interest Group on Management of Data (SIGMOD’15).

Digital Library

[31]

J. Blocki, A. Blum, A. Datta, and O. Sheffet. 2013. Differentially private data analysis of social networks via restricted sensitivity. In Innovations in Theoretical Computer Science (ITCS’13).

Digital Library

[32]

S. Chen and S. Zhou. 2013. Recursive mechanism: Towards node differential privacy and unrestricted joins. In Special Interest Group on Management of Data (SIGMOD’13).

Digital Library

[33]

E. Barker, M. Smid, D. Branstad, and S. Chokhani. 2013. NIST Special Publication 800 -130: A framework for designing cryptographic key management systems. In National Institute of Standards and Technology Report.

[34]

D. Turner. 2016. What is key management? A CISO perspective. In Cryptomathic.

Cited By

Bou Abdo JZeadally S(2024)Disposable identities: Solving web trackingJournal of Information Security and Applications10.1016/j.jisa.2024.10382184(103821)Online publication date: Aug-2024
https://doi.org/10.1016/j.jisa.2024.103821
Ling CZhang WHe H(2023)K-anonymity privacy-preserving algorithm for IoT applications in virtualization and edge computingCluster Computing10.1007/s10586-022-03755-426:2(1495-1510)Online publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1007/s10586-022-03755-4
Zhang ZZhou YZhao XChe TLyu LKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Prompt certified machine unlearning with randomized gradient smoothing and quantizationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601247(13433-13455)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3601247
Show More Cited By

Index Terms

Privacy-Preserving Publishing of Multilevel Utility-Controlled Graph Datasets
1. Security and privacy
  1. Security services
    1. Privacy-preserving protocols

Recommendations

An effective value swapping method for privacy preserving data publishing

Privacy is an important concern in the society, and it has been a fundamental issue when to analyze and publish data involving human individual's sensitive information. Recently, the slicing method has been popularly used for privacy preservation in data ...
A Novel Differential Privacy Approach that Enhances Classification Accuracy
C3S2E '16: Proceedings of the Ninth International C* Conference on Computer Science & Software Engineering

In the recent past, there has been a tremendous increase of large repositories of data, examples being in healthcare data, consumer data from retailers, and airline passenger data. These data are continually being shared with interested parties, either ...
Knowledge Reserving in Privacy Preserving Data Mining
IITA '08: Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application - Volume 02

We present in this paper a novel method to protect data privacy in data mining. Nowadays, privacy is becoming an increasingly important issue in many data mining applications. Among the current privacy preserving techniques, data anonymization provides ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Internet Technology

ACM Transactions on Internet Technology Volume 18, Issue 2

Special Issue on Internetware and Devops and Regular Papers

May 2018

294 pages

ISSN:1533-5399

EISSN:1557-6051

DOI:10.1145/3182619

Editor:
Munindar P. Singh
Department of Computer Science, North Carolina State University

Issue’s Table of Contents

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 February 2018

Accepted: 01 July 2017

Revised: 01 May 2017

Received: 01 March 2017

Published in TOIT Volume 18, Issue 2

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

20
Total Citations
View Citations
389
Total Downloads

Downloads (Last 12 months)50
Downloads (Last 6 weeks)11

Reflects downloads up to 18 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bou Abdo JZeadally S(2024)Disposable identities: Solving web trackingJournal of Information Security and Applications10.1016/j.jisa.2024.10382184(103821)Online publication date: Aug-2024
https://doi.org/10.1016/j.jisa.2024.103821
Ling CZhang WHe H(2023)K-anonymity privacy-preserving algorithm for IoT applications in virtualization and edge computingCluster Computing10.1007/s10586-022-03755-426:2(1495-1510)Online publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1007/s10586-022-03755-4
Zhang ZZhou YZhao XChe TLyu LKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Prompt certified machine unlearning with randomized gradient smoothing and quantizationProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3601247(13433-13455)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3601247
Che TZhang ZZhou YZhao XLiu JJiang ZYan DJin RDou D(2022)Federated Fingerprint Learning with Heterogeneous Architectures2022 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM54844.2022.00013(31-40)Online publication date: Nov-2022
https://doi.org/10.1109/ICDM54844.2022.00013
Yan YEyeleko AMahmood ALi JDong ZXu F(2022)Privacy preserving dynamic data release against synonymous linkage based on microaggregationScientific Reports10.1038/s41598-022-06182-y12:1Online publication date: 11-Feb-2022
https://doi.org/10.1038/s41598-022-06182-y
Su JCao YChen YLiu YSong J(2021)Privacy protection of medical data in social networkBMC Medical Informatics and Decision Making10.1186/s12911-021-01645-021:S1Online publication date: 18-Oct-2021
https://doi.org/10.1186/s12911-021-01645-0
Zhou YZhang ZWu SSheng VHan XZhang ZJin R(2021)Robust Network Alignment via Attack Signal Scaling and Adversarial Perturbation EliminationProceedings of the Web Conference 202110.1145/3442381.3449823(3884-3895)Online publication date: 19-Apr-2021
https://dl.acm.org/doi/10.1145/3442381.3449823
Yan YHerman EMahmood AFeng TXie P(2021)A weighted K-member clustering algorithm for K-anonymizationComputing10.1007/s00607-021-00922-0103:10(2251-2273)Online publication date: 1-Oct-2021
https://dl.acm.org/doi/10.1007/s00607-021-00922-0
Novgorodov SGuy IElad GRadinsky K(2020)Descriptions from the CustomersACM Transactions on Internet Technology10.1145/341820220:4(1-31)Online publication date: 6-Oct-2020
https://dl.acm.org/doi/10.1145/3418202
Kolomvatsos KAnagnostopoulos C(2020)An Intelligent Edge-centric Queries Allocation Scheme based on Ensemble ModelsACM Transactions on Internet Technology10.1145/341729720:4(1-25)Online publication date: 15-Oct-2020
https://dl.acm.org/doi/10.1145/3417297
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents