Skip to main content

Generating Executable Mappings from RDF Data Cube Data Structure Definitions

  • Conference paper
  • First Online:
On the Move to Meaningful Internet Systems. OTM 2018 Conferences (OTM 2018)

Abstract

Data processing is increasingly the subject of various internal and external regulations, such as GDPR which has recently come into effect. Instead of assuming that such processes avail of data sources (such as files and relational databases), we approach the problem in a more abstract manner and view these processes as taking datasets as input. These datasets are then created by pulling data from various data sources. Taking a W3C Recommendation for prescribing the structure of and for describing datasets, we investigate an extension of that vocabulary for the generation of executable R2RML mappings. This results in a top-down approach where one prescribes the dataset to be used by a data process and where to find the data, and where that prescription is subsequently used to retrieve the data for the creation of the dataset “just in time”. We argue that this approach to the generation of an R2RML mapping from a dataset description is the first step towards policy-aware mappings, where the generation takes into account regulations to generate mappings that are compliant. In this paper, we describe how one can obtain an R2RML mapping from a data structure definition in a declarative manner using SPARQL CONSTRUCT queries, and demonstrate it using a running example. Some of the more technical aspects are also described.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    While RDF provides us the data model, data is usually integrated using a common model captured in a so-called ontology. Ontologies – being commonly defined as “a [formal] explicit specification of a [shared] conceptualization” [10] – are also developed for a particular purpose, but the ontologies (or vocabularies) we observe on the Linked Data Web are often lightweight and meant for information exchange. Applications that consume such Linked Data are not (necessarily) known beforehand and are often published with very accessible licenses such as Creative Commons.

  2. 2.

    Even if we were not to use RDF Data Cube and R2RML, similar steps would be necessary for capturing the schema or structure of the dataset, and the creation and execution of a mapping to populate that schema.

  3. 3.

    We thank the anonymous reviewer for spotting this issue.

  4. 4.

    https://github.com/chrdebru/r2rml.

  5. 5.

    https://drill.apache.org/.

References

  1. Alexiev, V.: RDF by example: rdfpuml for true RDF diagrams, rdf2rml for R2RML generation. In: Semantic Web in Libraries 2016 (2016). http://vladimiralexiev.github.io/pres/20161128-rdfpuml-rdf2rml/

  2. Bizer, C.: D2R MAP - a database to RDF mapping language. In: King, I., Máray, T. (eds.) Proceedings of the Twelfth International World Wide Web Conference - Posters, WWW 2003, Budapest, Hungary, 20–24 May 2003 (2003)

    Google Scholar 

  3. Bizer, C., Seaborne, A.: D2RQ - treating non-RDF databases as virtual RDF graphs. In: ISWC 2004 (Posters), November 2004. http://sites.wiwiss.fu-berlin.de/suhl/bizer/pub/Bizer-D2RQ-ISWC2004-Poster.pdf

  4. Bonazzi, R., Hussami, L., Pigneur, Y.: Compliance management is becoming a major issue in IS design, pp. 391–398. Physica-Verlag HD, Heidelberg (2010). https://doi.org/10.1007/978-3-7908-2148-2_45

    Chapter  Google Scholar 

  5. Crotti Junior, A., Debruyne, C., Brennan, R., O’Sullivan, D.: An evaluation of uplift mapping languages. IJWIS 13(4), 405–424 (2017)

    Article  Google Scholar 

  6. Das, S., Cyganiak, R., Sundara, S.: R2RML: RDB to RDF mapping language. W3C recommendation, W3C, September 2012. http://www.w3.org/TR/2012/REC-r2rml-20120927/

  7. Debruyne, C., O’Sullivan, D.: R2RML-F: towards sharing and executing domain logic in R2RML mappings. In: Auer, S., Berners-Lee, T., Bizer, C., Heath, T. (eds.) Proceedings of the Workshop on Linked Data on the Web, LDOW 2016, co-located with 25th International World Wide Web Conference (WWW 2016). CEUR Workshop Proceedings, vol. 1593. CEUR-WS.org (2016). http://ceur-ws.org/Vol-1593/article-13.pdf

  8. Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Bizer, C., Auer, S., Berners-Lee, T., Heath, T. (eds.) Proceedings of the Workshop on Linked Data on the Web, LDOW 2014, co-located with the 23rd International World Wide Web Conference (WWW 2014). CEUR Workshop Proceedings, vol. 1184. CEUR-WS.org (2014). http://ceur-ws.org/Vol-1184/ldow2014_paper_01.pdf

  9. Fatema, K., Hadziselimovic, E., Pandit, H.J., Debruyne, C., Lewis, D., O’Sullivan, D.: Compliance through informed consent: semantic based consent permission and data management model. In: Brewster, C., Cheatham, M., d’Aquin, M., Decker, S., Kirrane, S. (eds.) Proceedings of the 5th Workshop on Society, Privacy and the Semantic Web - Policy and Technology (PrivOn 2017), co-located with 16th International Semantic Web Conference (ISWC 2017), Vienna, Austria, 22 October 2017. CEUR Workshop Proceedings, vol. 1951. CEUR-WS.org (2017). http://ceur-ws.org/Vol-1951/PrivOn2017_paper_5.pdf

  10. Gruber, T.R.: Toward principles for the design of ontologies used for knowledge sharing? Int. J. Hum.-Comput. Stud. 43(5–6), 907–928 (1995). https://doi.org/10.1006/ijhc.1995.1081

    Article  Google Scholar 

  11. Michel, F., Djimenou, L., Faron-Zucker, C., Montagnat, J.: Translation of relational and non-relational databases into RDF with xR2RML. In: Monfort, V., Krempels, K.-H., Majchrzak, T.A., Turk, Z. (eds.) WEBIST 2015 - Proceedings of the 11th International Conference on Web Information Systems and Technologies, pp. 443–454. SciTePress (2015)

    Google Scholar 

  12. Kalampokis, E., et al.: Exploiting linked data cubes with opencube toolkit. In: Horridge, M., Rospocher, M., van Ossenbruggen, J. (eds.) Proceedings of the ISWC 2014 Posters & Demonstrations Track a Track within the 13th International Semantic Web Conference, ISWC 2014, Riva del Garda, Italy, 21 October 2014. CEUR Workshop Proceedings, vol. 1272, pp. 137–140. CEUR- WS.org (2014). http://ceur-ws.org/Vol-1272/paper109.pdf

  13. Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015). https://doi.org/10.3233/SW-140134

    Article  Google Scholar 

  14. Meroo-Peuela, A., Hoekstra, R., Guret, C., Schlobach, S.: Detecting and reporting extensional concept drift in statistical linked data. In: Capadisli, S., Cotton, F., Cyganiak, R., Haller, A., Hamilton, A., Troncy, R. (eds.) Proceedings of the 1st International Workshop on Semantic Statistics (SemStats), No. 1549 in CEUR Workshop Proceedings, Aachen (2013). http://ceur-ws.org/Vol-1549/#article-10

  15. Open Geospatial Consortium: GeoSPARQL - a geographic query language for RDF data. OGC (2012). http://www.opengeospatial.org/standards/geosparql

  16. Pandit, H.J., Lewis, D.: Modelling provenance for GDPR compliance using linked open data vocabularies. In: Brewster, C., Cheatham, M., d’Aquin, M., Decker, S., Kirrane, S. (eds.) Proceedings of the 5th Workshop on Society, Privacy and the Semantic Web - Policy and Technology (PrivOn 2017), co-located with 16th International Semantic Web Conference (ISWC 2017), Vienna, Austria, 22 October 2017. CEUR Workshop Proceedings, vol. 1951. CEUR-WS.org (2017). http://ceur-ws.org/Vol-1951/PrivOn2017paper6.pdf

  17. Pollock, R., Tennison, J., Kellogg, G., Herman, I.: Metadata vocabulary for tabular data. W3C recommendation, W3C, December 2015. https://www.w3.org/TR/2015/REC-tabular-metadata-20151217/

  18. Raimond, Y., Schreiber, G.: RDF 1.1 primer. W3C note, W3C, June 2014. http://www.w3.org/TR/2014/NOTE-rdf11-primer-20140624/

  19. Reynolds, D., Cyganiak, R.: The RDF data cube vocabulary. W3C recommendation, W3C, January 2014. http://www.w3.org/TR/2014/REC-vocab-data-cube-20140116/

  20. Rijgersberg, H., Wigham, M., Top, J.L.: How semantics can improve engineering processes: a case of units of measure and quantities. Adv. Eng. Inf. 25(2), 276–287 (2011). https://doi.org/10.1016/j.aei.2010.07.008

    Article  Google Scholar 

  21. Sahoo, S., McGuinness, D., Lebo, T.: PROV-o: the PROV ontology. W3C recommendation, W3C, April 2013. http://www.w3.org/TR/2013/REC-prov-o- 20130430/

  22. Salas, P.E.R., Mota, F.M.D., Breitman, K.K., Casanova, M.A., Martin, M., Auer, S.: Publishing statistical data on the web. Int. J. Semant. Comput. 6(4), 373–388 (2012). https://doi.org/10.1142/S1793351X12400119

    Article  Google Scholar 

  23. Skjaeveland, M.G., Giese, M., Hovland, D., Lian, E.H., Waaler, A.: Engineering ontology-based access to real-world data sources. J. Web Semant. 33, 112–140 (2015). https://doi.org/10.1016/j.websem.2015.03.002

    Article  Google Scholar 

  24. Tandy, J., Herman, I., Kellogg, G.: Generating RDF from tabular data on the web. W3C recommendation, W3C, December 2015. https://www.w3.org/TR/2015/REC-csv2rdf-20151217/

  25. Wigham, M., Rijgersberg, H., de Vos, M., Top, J.: Semantic support for tables using RDF record table. Int. J. Adv. Intell. Syst. 8(1–2), 128–144 (2015)

    Google Scholar 

  26. Zhao, J., Hartig, O.: Towards interoperable provenance publication on the linked data web. In: Bizer, C., Heath, T., Berners-Lee, T., Hausenblas, M. (eds.) WWW 2012 Workshop on Linked Data on the Web, Lyon, France, 16 April 2012. CEUR Workshop Proceedings, vol. 937. CEUR-WS.org (2012). http://ceur-ws.org/Vol-937/ldow2012-paper-03.pdf

Download references

Acknowledgements

The ADAPT Centre for Digital Content Technology is funded under the SFI Research Centres Programme (Grant 13/RC/2106) and is co-funded under the European Regional Development Fund.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Christophe Debruyne .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Debruyne, C., Lewis, D., O’Sullivan, D. (2018). Generating Executable Mappings from RDF Data Cube Data Structure Definitions. In: Panetto, H., Debruyne, C., Proper, H., Ardagna, C., Roman, D., Meersman, R. (eds) On the Move to Meaningful Internet Systems. OTM 2018 Conferences. OTM 2018. Lecture Notes in Computer Science(), vol 11230. Springer, Cham. https://doi.org/10.1007/978-3-030-02671-4_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-02671-4_21

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-02670-7

  • Online ISBN: 978-3-030-02671-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics