Abstract
Open data initiatives and FAIR data principles have encouraged the publication of large volumes of data, encoding knowledge relevant for the advance of science and technology. However, to mine knowledge, it is usually required the processing of data collected from sources regulated by diverse access and privacy policies. We address the problem of enforcing data privacy and access regulations (EDPR) and propose PURE, a framework able to solve this problem during query processing. PURE relies on the local as view approach for defining the rules that represent the access control policies imposed over a federation of RDF knowledge graphs. Moreover, PURE maps the problem of checking if a query meets the privacy regulations to the problem of query rewriting (QRP) using views; it resorts to state-of-the-art QRP solutions for determining if a query violates or not the defined policies. We have evaluated the efficiency of PURE over the Berlin SPARQL Benchmark (BSBM). Observed results suggest that PURE is able to scale up to complex scenarios where a large number of rules represents diverse types of policies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hirshberg foundation for pancreatic cancer research. Prognosis. http://pancreatic.org/pancreatic-cancer/about-the-pancreas/prognosis/. Accessed 29 Mar 2019
Amini, M., Jalili, R.: Multi-level authorisation model and framework for distributed semantic-aware environments. IET Inf. Secur. 4(4), 301–321 (2010)
Arvelo, Y., Bonet, B., Vidal, M.: Compilation of query-rewriting problems into tractable fragments of propositional logic. In: Proceedings, The Twenty-First National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference, Boston, Massachusetts, USA, 16–20 July 2006, pp. 225–230 (2006)
Blai Bonet: MCDSAT. https://github.com/bonetblai/mcdsat. Accessed 15 Sept 2019
Châabane, S., Jaziri, W., Gargouri, F.: A proposal for a geographic ontology merging methodology. In: 2009 International Conference on the Current Trends in Information Technology (CTIT), pp. 1–6. IEEE (2009)
Costabello, L., Villata, S., Gandon, F.: Context-aware access control for RDF graph stores. In: ECAI-20th European Conference on Artificial Intelligence (2012)
Dai, W., Wang, S., Xiong, H., Jiang, X.: Privacy preserving federated big data analysis. In: Srinivasan, S. (ed.) Guide to Big Data Applications. SBD, vol. 26, pp. 49–82. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-53817-4_3
Dewri, R., Ong, T., Thurimella, R.: Linking health records for federated query processing. Proc. Priv. Enhanc. Technol. 2016(3), 4–23 (2016)
Drake, T.A., et al.: A system for sharing routine surgical pathology specimens across institutions: the shared pathology informatics network. Hum. Pathol. 38(8), 1212–1225 (2007)
Endris, K.M., Almhithawi, Z., Lytra, I., Vidal, M.-E., Auer, S.: BOUNCER: privacy-aware query processing over federations of RDF datasets. In: Hartmann, S., Ma, H., Hameurlain, A., Pernul, G., Wagner, R.R. (eds.) DEXA 2018, Part I. LNCS, vol. 11029, pp. 69–84. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98809-2_5
Gkoulalas-Divanis, A., Loukides, G., Sun, J.: Publishing data from electronic health records while preserving privacy: a survey of algorithms. J. Biomed. Inform. 50, 4–19 (2014)
Gomes, C.P., Kautz, H.A., Sabharwal, A., Selman, B.: Satisfiability solvers (2008)
Halevy, A.Y.: Answering queries using views: a survey. VLDB J. 10(4), 270–294 (2001)
Kamateri, E., Kalampokis, E., Tambouris, E., Tarabanis, K.: The linked medical data access control framework. J. Biomed. Inform. 50, 213–225 (2014)
Kenner, B.J.: Early detection of pancreatic cancer: the role of depression and anxiety as a precursor for disease. Pancreas 47(4), 363 (2018)
Khan, Y., et al.: SAFE: SPARQL federation over RDF data cubes with access control. J. Biomed. Semant. 8(1), 5 (2017)
Kirrane, S., Villata, S., d’Aquin, M.: Privacy, security and policies: a review of problems and solutions with semantic web technologies. Semant. Web 9(2), 153–161 (2018)
Kohane, I.S., Churchill, S.E., Murphy, S.N.: A translational engine at the national scale: informatics for integrating biology and the bedside. JAMIA 19(2), 181–185 (2012)
Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)
Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proceedings of 22nd International Conference on Very Large Data Bases (1996)
Malin, B., Karp, D., Scheuermann, R.H.: Technical and policy approaches to balancing patient privacy and data sharing in clinical and translational research. J. Investig. Med. 58(1), 11–18 (2010)
Pérez, J., Arenas, M., Gutiérrez, C.: Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34(3), 16:1–16:45 (2009)
Saad, A.M., et al.: Suicidal death within a year of a cancer diagnosis: a population-based study. Cancer 125(6), 972–979 (2019)
Shenoy, K.M., Shet, K.C., Acharya, U.D.: Secured ontology matching using graph matching. In: Meghanathan, N., Nagamalai, D., Chaki, N. (eds.) Advances in Computing and Information Technology. Advances in Intelligent Systems and Computing, vol. 177, pp. 11–18. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-31552-7_2
Stephens, Z.D., et al.: Big data: astronomical or genomical? PLoS ONE 13(7), e1002195 (2015)
Turaga, K.K., Malafa, M.P., Jacobsen, P.B., Schell, M.J., Sarr, M.G.: Suicide in patients with pancreatic cancer. Cancer 117(3), 642–647 (2011)
Unbehauen, J., Frommhold, M., Martin, M.: Enforcing scalable authorization on SPARQL queries. In: SEMANTiCS (Posters, Demos, SuCCESS) (2016)
Acknowledgement
This work has been partially supported by the EU H2020 RIA funded project iASiS with grant agreement No. 727658.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Goncalves, M., Vidal, ME., Endris, K.M. (2019). PURE: A Privacy Aware Rule-Based Framework over Knowledge Graphs. In: Hartmann, S., Küng, J., Chakravarthy, S., Anderst-Kotsis, G., Tjoa, A., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2019. Lecture Notes in Computer Science(), vol 11706. Springer, Cham. https://doi.org/10.1007/978-3-030-27615-7_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-27615-7_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-27614-0
Online ISBN: 978-3-030-27615-7
eBook Packages: Computer ScienceComputer Science (R0)