ABSTRACT
The massive data collected from buildings provide opportunities for data- and information-based building management. Furthermore, to benefit from collective efforts in research communities, there arises a need for methods to share building-related data in a privacy-preserving manner while being able to ensure the utility of published datasets. In this demo abstract, we present PAD, an open-sourced data publication system that offers k-anonymity guarantee. The novelty of this system is to incorporate data recipients' feedbacks into the publication process in order to improve data utility. We demonstrate the interface of PAD and highlight how participants (as data publishers) can generate sanitized datasets using this interface. Also, we demonstrate how participants (as data users) can provide feedback to PAD for improving data quality.
- Josep Domingo-Ferrer and Josep Maria Mateo-Sanz. 2002. Practical data-oriented microaggregation for statistical disclosure control. IEEE Transactions on Knowledge and data Engineering 14, 1 (2002), 189--201. Google ScholarDigital Library
- Ruoxi Jia, Fisayo Caleb Sangogboye, Tianzhen Hong, Costas Spanos, and Mikkel Baun Kjærgaard. 2017. PAD: Protecting Anonymity in Publishing Building Related Datasets. In Proceedings of the 4th ACM Conference on Embedded Systems for Energy-Efficient Buildings. ACM. Google ScholarDigital Library
Recommendations
PAD: protecting anonymity in publishing building related datasets
BuildSys '17: Proceedings of the 4th ACM International Conference on Systems for Energy-Efficient Built EnvironmentsThe diffusion of low-cost sensor network technologies in smart buildings has enabled the collection of massive amounts of data regarding indoor environments, energy use and occupants, which, in turn, creates opportunities for knowledge- and information-...
A Data Sanitization Method for Privacy Preserving Data Re-publication
NCM '08: Proceedings of the 2008 Fourth International Conference on Networked Computing and Advanced Information Management - Volume 02When a table containing personal information is published, sensitive information should not be revealed. Although k-anonymity and l-diversity models are popular approaches to protect privacy, they are limited to one time data publishing. After a dataset ...
Comments