Skip to main content

PURE: A Privacy Aware Rule-Based Framework over Knowledge Graphs

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2019)

Abstract

Open data initiatives and FAIR data principles have encouraged the publication of large volumes of data, encoding knowledge relevant for the advance of science and technology. However, to mine knowledge, it is usually required the processing of data collected from sources regulated by diverse access and privacy policies. We address the problem of enforcing data privacy and access regulations (EDPR) and propose PURE, a framework able to solve this problem during query processing. PURE relies on the local as view approach for defining the rules that represent the access control policies imposed over a federation of RDF knowledge graphs. Moreover, PURE maps the problem of checking if a query meets the privacy regulations to the problem of query rewriting (QRP) using views; it resorts to state-of-the-art QRP solutions for determining if a query violates or not the defined policies. We have evaluated the efficiency of PURE over the Berlin SPARQL Benchmark (BSBM). Observed results suggest that PURE is able to scale up to complex scenarios where a large number of rules represents diverse types of policies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hirshberg foundation for pancreatic cancer research. Prognosis. http://pancreatic.org/pancreatic-cancer/about-the-pancreas/prognosis/. Accessed 29 Mar 2019

  2. Amini, M., Jalili, R.: Multi-level authorisation model and framework for distributed semantic-aware environments. IET Inf. Secur. 4(4), 301–321 (2010)

    Article  Google Scholar 

  3. Arvelo, Y., Bonet, B., Vidal, M.: Compilation of query-rewriting problems into tractable fragments of propositional logic. In: Proceedings, The Twenty-First National Conference on Artificial Intelligence and the Eighteenth Innovative Applications of Artificial Intelligence Conference, Boston, Massachusetts, USA, 16–20 July 2006, pp. 225–230 (2006)

    Google Scholar 

  4. Blai Bonet: MCDSAT. https://github.com/bonetblai/mcdsat. Accessed 15 Sept 2019

  5. Châabane, S., Jaziri, W., Gargouri, F.: A proposal for a geographic ontology merging methodology. In: 2009 International Conference on the Current Trends in Information Technology (CTIT), pp. 1–6. IEEE (2009)

    Google Scholar 

  6. Costabello, L., Villata, S., Gandon, F.: Context-aware access control for RDF graph stores. In: ECAI-20th European Conference on Artificial Intelligence (2012)

    Google Scholar 

  7. Dai, W., Wang, S., Xiong, H., Jiang, X.: Privacy preserving federated big data analysis. In: Srinivasan, S. (ed.) Guide to Big Data Applications. SBD, vol. 26, pp. 49–82. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-53817-4_3

    Chapter  Google Scholar 

  8. Dewri, R., Ong, T., Thurimella, R.: Linking health records for federated query processing. Proc. Priv. Enhanc. Technol. 2016(3), 4–23 (2016)

    Article  Google Scholar 

  9. Drake, T.A., et al.: A system for sharing routine surgical pathology specimens across institutions: the shared pathology informatics network. Hum. Pathol. 38(8), 1212–1225 (2007)

    Article  Google Scholar 

  10. Endris, K.M., Almhithawi, Z., Lytra, I., Vidal, M.-E., Auer, S.: BOUNCER: privacy-aware query processing over federations of RDF datasets. In: Hartmann, S., Ma, H., Hameurlain, A., Pernul, G., Wagner, R.R. (eds.) DEXA 2018, Part I. LNCS, vol. 11029, pp. 69–84. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-98809-2_5

    Chapter  Google Scholar 

  11. Gkoulalas-Divanis, A., Loukides, G., Sun, J.: Publishing data from electronic health records while preserving privacy: a survey of algorithms. J. Biomed. Inform. 50, 4–19 (2014)

    Article  Google Scholar 

  12. Gomes, C.P., Kautz, H.A., Sabharwal, A., Selman, B.: Satisfiability solvers (2008)

    Google Scholar 

  13. Halevy, A.Y.: Answering queries using views: a survey. VLDB J. 10(4), 270–294 (2001)

    Article  Google Scholar 

  14. Kamateri, E., Kalampokis, E., Tambouris, E., Tarabanis, K.: The linked medical data access control framework. J. Biomed. Inform. 50, 213–225 (2014)

    Article  Google Scholar 

  15. Kenner, B.J.: Early detection of pancreatic cancer: the role of depression and anxiety as a precursor for disease. Pancreas 47(4), 363 (2018)

    Google Scholar 

  16. Khan, Y., et al.: SAFE: SPARQL federation over RDF data cubes with access control. J. Biomed. Semant. 8(1), 5 (2017)

    Article  Google Scholar 

  17. Kirrane, S., Villata, S., d’Aquin, M.: Privacy, security and policies: a review of problems and solutions with semantic web technologies. Semant. Web 9(2), 153–161 (2018)

    Article  Google Scholar 

  18. Kohane, I.S., Churchill, S.E., Murphy, S.N.: A translational engine at the national scale: informatics for integrating biology and the bedside. JAMIA 19(2), 181–185 (2012)

    Google Scholar 

  19. Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)

    Google Scholar 

  20. Levy, A.Y., Rajaraman, A., Ordille, J.J.: Querying heterogeneous information sources using source descriptions. In: Proceedings of 22nd International Conference on Very Large Data Bases (1996)

    Google Scholar 

  21. Malin, B., Karp, D., Scheuermann, R.H.: Technical and policy approaches to balancing patient privacy and data sharing in clinical and translational research. J. Investig. Med. 58(1), 11–18 (2010)

    Article  Google Scholar 

  22. Pérez, J., Arenas, M., Gutiérrez, C.: Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34(3), 16:1–16:45 (2009)

    Article  Google Scholar 

  23. Saad, A.M., et al.: Suicidal death within a year of a cancer diagnosis: a population-based study. Cancer 125(6), 972–979 (2019)

    Article  Google Scholar 

  24. Shenoy, K.M., Shet, K.C., Acharya, U.D.: Secured ontology matching using graph matching. In: Meghanathan, N., Nagamalai, D., Chaki, N. (eds.) Advances in Computing and Information Technology. Advances in Intelligent Systems and Computing, vol. 177, pp. 11–18. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-31552-7_2

    Chapter  Google Scholar 

  25. Stephens, Z.D., et al.: Big data: astronomical or genomical? PLoS ONE 13(7), e1002195 (2015)

    Article  Google Scholar 

  26. Turaga, K.K., Malafa, M.P., Jacobsen, P.B., Schell, M.J., Sarr, M.G.: Suicide in patients with pancreatic cancer. Cancer 117(3), 642–647 (2011)

    Article  Google Scholar 

  27. Unbehauen, J., Frommhold, M., Martin, M.: Enforcing scalable authorization on SPARQL queries. In: SEMANTiCS (Posters, Demos, SuCCESS) (2016)

    Google Scholar 

Download references

Acknowledgement

This work has been partially supported by the EU H2020 RIA funded project iASiS with grant agreement No. 727658.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Marlene Goncalves .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Goncalves, M., Vidal, ME., Endris, K.M. (2019). PURE: A Privacy Aware Rule-Based Framework over Knowledge Graphs. In: Hartmann, S., Küng, J., Chakravarthy, S., Anderst-Kotsis, G., Tjoa, A., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2019. Lecture Notes in Computer Science(), vol 11706. Springer, Cham. https://doi.org/10.1007/978-3-030-27615-7_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-27615-7_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-27614-0

  • Online ISBN: 978-3-030-27615-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics