Skip to main content

Lifting Tabular Data to RDF: A Survey

  • Conference paper
  • First Online:
Book cover Metadata and Semantic Research (MTSR 2020)

Abstract

Tabular data formats (e.g. CSV and spreadsheets) combine ease of use, versatility and compatibility with information management systems. Despite their numerous advantages, these formats typically rely on column headers and out-of-band agreement to convey semantics. There is clearly a large gap with respect to the Semantic Web, which uses RDF as a graph-based data model, while relying on ontologies for well-defined semantics. Several systems have been developed to close this gap, supporting the conversion of tabular data to RDF. This study is a survey of these systems, which have been analyzed and compared. We identified commonalities and differences among them, discussed different approaches and derived useful insights on the task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://rml.io/specs/rml/.

  2. 2.

    https://labs.sparna.fr/skos-play/convert.

  3. 3.

    https://jena.apache.org/documentation/fuseki2/.

  4. 4.

    https://art.uniroma2.it/sheet2rdf/documentation/heuristics.jsf.

References

  1. Berners-Lee, T.: Linked Data. In: Design Issues (2006). https://www.w3.org/DesignIssues/LinkedData.html

  2. Celli, F., Anibaldi, S., Folch, M., Jaques, Y., Keizer, J.: OpenAGRIS: using bibliographical data for linking into the agricultural knowledge web. In: Agricultural Ontology Services (AOS), Bangkok, Thailand (2011)

    Google Scholar 

  3. Apache Any23 - CSV Extractor. https://any23.apache.org/dev-csv-extractor.html

  4. Grafter - Linked Data Machine Tools. https://grafter.org/

  5. Roman, D., et al.: DataGraft: one-stop-shop for open data management. Semant. Web 9(4), 393–411 (2018)

    Article  Google Scholar 

  6. Jupp, S. et al.: Populous: a tool for building OWL ontologies from templates. BMC Bioinformatics 13(1) (2012)

    Google Scholar 

  7. Egaña, M., Rector, A., Stevens, R., Antezana, E.: Applying ontology design patterns in bio-ontologies. In: Gangemi, A., Euzenat, J. (eds.) EKAW 2008. LNCS, vol. 5268. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87696-0_4

  8. Gangemi, A., Presutti, V.: Ontology design patterns. In: Handbook on ontologies. Springer Berlin Heidelberg (2009), pp.221–243

    Google Scholar 

  9. Han, L., Finin, T., Parr, C., Sachs, J., Joshi, A.: RDF123: from spreadsheets to RDF. In: Sheth, A., et al. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 451–466. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88564-1_29

    Chapter  Google Scholar 

  10. Heyvaert, P., et al.: RMLEditor: a graph-based mapping editor for linked data mappings. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 709–723. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_43

    Chapter  Google Scholar 

  11. Vandenbussche, P.-Y., Atemezing, G.A., Poveda-Villalón, M., Vatant, B.: Linked Open Vocabularies (LOV): a gateway to reusable semantic vocabularies on the Web. Semantic Web 8(3), 437–452 (2017)

    Article  Google Scholar 

  12. A DSL-based converter for spreadsheets to RDF. https://github.com/marcelotto/spread2rdf

  13. TabLinker.: https://github.com/Data2Semantics/TabLinker

  14. W3C: The RDF Data Cube Vocabulary. In: World Wide Web Consortium (W3C). Available at: https://www.w3.org/TR/vocab-data-cube/. Accessed 14 Jan 2014

  15. Ciccarese, P., Soiland-Reyes, S., Clark, T.: Web annotation as a first-class object. IEEE Internet Comput. 17(6), 71–75 (2013)

    Article  Google Scholar 

  16. Tarql. https://github.com/cygri/tarql

  17. Vertere-RDF. https://github.com/knudmoeller/Vertere-RDF

  18. Langegger, A., Wöß, W.: XLWrap – querying and integrating arbitrary spreadsheets with SPARQL. In: Bernstein, A., et al. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 359–374. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_23

    Chapter  Google Scholar 

  19. Sparqlify. https://aksw.org/Projects/Sparqlify.html

  20. O’Connor, M.J., Halaschek-Wiener, C., Musen, M.A.: Mapping master: a flexible approach for mapping spreadsheets to OWL. In: Patel-Schneider, P.F., et al. (eds.) ISWC 2010. LNCS, vol. 6497, pp. 194–208. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17749-1_13

    Chapter  Google Scholar 

  21. In: PoolParty Semantic Suite - Semantic Technology Platform. https://www.poolparty.biz/

  22. ART Group: Sheet2RDF. https://art.uniroma2.it/sheet2rdf/

  23. Fiorelli, M., Lorenzetti, T., Pazienza, M.T., Stellato, A., Turbati, A.: Sheet2RDF: a flexible and dynamic spreadsheet import&lifting framework for RDF. In: Ali, M., Kwon, Y.S., Lee, C.-H., Kim, J., Kim, Y. (eds.) IEA/AIE 2015. LNCS (LNAI), vol. 9101, pp. 131–140. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19066-2_13

    Chapter  Google Scholar 

  24. Stellato, A., et al.: VocBench 3: a collaborative semantic web editor for ontologies. Thesauri and Lexicons. Semantic Web 11(5), 855–881 (2020)

    Article  Google Scholar 

  25. World Wide Web Consortium (W3C): SKOS Simple Knowledge Organization System eXtension for Labels (SKOS-XL). In: World Wide Web Consortium (W3C). https://www.w3.org/TR/skos-reference/skos-xl.html. Accessed 18 Aug 2009

  26. Fiorelli, M., Pazienza, M.T., Stellato, A., Turbati, A.: CODA: computer-aided ontology development architecture. IBM J. Res. Dev. 58(2/3), 14:1–14:12 (2014)

    Google Scholar 

  27. Pazienza, M.T., Stellato, A., Turbati, A.: PEARL: ProjEction of annotations rule language, a language for projecting (UIMA) annotations over RDF knowledge bases. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey (2012)

    Google Scholar 

  28. TopBraid Composer. https://www.topquadrant.com/products/topbraid-composer/

  29. Ontotext: Ontotext GraphDB. https://graphdb.ontotext.com/

  30. Knoblock, C.A., et al.: Semi-automatically mapping structured sources into the semantic web. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 375–390. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_32

    Chapter  Google Scholar 

  31. Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language. In: World Wide Web Consortium - Web Standards. https://www.w3.org/TR/r2rml/. Accessed 27 Sept 2012

  32. Ermilov, I., Auer, S., Stadler, C.: CSV2RDF: User-Driven CSV to RDF Mass Conversion Framework. In: Proceedings of the ISEM 2013, 04–06 September 2013, Graz, Austria (2013)

    Google Scholar 

  33. Krötzsch, M., Vrandečić, D., Völkel, M., Haller, H., Studer, R.: Semantic Wikipedia. Web Semantics Sci. Serv. Agents World Wide Web 5(4), 251–261 (2007)

    Article  Google Scholar 

  34. Martin, M., Abicht, K., Stadler, C., Auer, S., Ngomo, A.-C.N., Soru, T.: CubeViz - exploration and visualization of statistical linked data. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 (2015)

    Google Scholar 

  35. Stadler, C., Martin, M., Auer, S.: Exploring the web of spatial data with facete. In: Companion proceedings of 23rd International World Wide Web Conference (WWW), pp. 175–178 (2014)

    Google Scholar 

  36. Lebo, T., Williams, G.T.: Converting governmental datasets into linked data. In: Proceedings of the 6th International Conference on Semantic Systems, New York, NY, USA, pp. 38:1–38:3 (2010)

    Google Scholar 

  37. Scharffe, F. et al.: Enabling linked data publication with the Datalift platform. In: AAAI Workshop on Semantic Cities (2012)

    Google Scholar 

  38. Klímek, J., Škoda, P., Nečaský, M.: LinkedPipes ETL: evolved linked data preparation. In: Sack, H., Rizzo, G., Steinmetz, N., Mladenić, D., Auer, S., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9989, pp. 95–100. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47602-5_20

    Chapter  Google Scholar 

  39. Knap, T., et al.: UnifiedViews: an ETL tool for RDF data management. Semantic Web 9(5), 661–676 (2018)

    Article  Google Scholar 

  40. World Wide Web Consortium (W3C): A Direct Mapping of Relational Data to RDF. In: World Wide Web Consortium (W3C). https://www.w3.org/TR/rdb-direct-mapping/. Accessed 27 Sept 2012

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manuel Fiorelli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fiorelli, M., Stellato, A. (2021). Lifting Tabular Data to RDF: A Survey. In: Garoufallou, E., Ovalle-Perandones, MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-71903-6_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-71902-9

  • Online ISBN: 978-3-030-71903-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics