Abstract
Tabular data formats (e.g. CSV and spreadsheets) combine ease of use, versatility and compatibility with information management systems. Despite their numerous advantages, these formats typically rely on column headers and out-of-band agreement to convey semantics. There is clearly a large gap with respect to the Semantic Web, which uses RDF as a graph-based data model, while relying on ontologies for well-defined semantics. Several systems have been developed to close this gap, supporting the conversion of tabular data to RDF. This study is a survey of these systems, which have been analyzed and compared. We identified commonalities and differences among them, discussed different approaches and derived useful insights on the task.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berners-Lee, T.: Linked Data. In: Design Issues (2006). https://www.w3.org/DesignIssues/LinkedData.html
Celli, F., Anibaldi, S., Folch, M., Jaques, Y., Keizer, J.: OpenAGRIS: using bibliographical data for linking into the agricultural knowledge web. In: Agricultural Ontology Services (AOS), Bangkok, Thailand (2011)
Apache Any23 - CSV Extractor. https://any23.apache.org/dev-csv-extractor.html
Grafter - Linked Data Machine Tools. https://grafter.org/
Roman, D., et al.: DataGraft: one-stop-shop for open data management. Semant. Web 9(4), 393–411 (2018)
Jupp, S. et al.: Populous: a tool for building OWL ontologies from templates. BMC Bioinformatics 13(1) (2012)
Egaña, M., Rector, A., Stevens, R., Antezana, E.: Applying ontology design patterns in bio-ontologies. In: Gangemi, A., Euzenat, J. (eds.) EKAW 2008. LNCS, vol. 5268. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87696-0_4
Gangemi, A., Presutti, V.: Ontology design patterns. In: Handbook on ontologies. Springer Berlin Heidelberg (2009), pp.221–243
Han, L., Finin, T., Parr, C., Sachs, J., Joshi, A.: RDF123: from spreadsheets to RDF. In: Sheth, A., et al. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 451–466. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-88564-1_29
Heyvaert, P., et al.: RMLEditor: a graph-based mapping editor for linked data mappings. In: Sack, H., Blomqvist, E., d’Aquin, M., Ghidini, C., Ponzetto, S.P., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9678, pp. 709–723. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-34129-3_43
Vandenbussche, P.-Y., Atemezing, G.A., Poveda-Villalón, M., Vatant, B.: Linked Open Vocabularies (LOV): a gateway to reusable semantic vocabularies on the Web. Semantic Web 8(3), 437–452 (2017)
A DSL-based converter for spreadsheets to RDF. https://github.com/marcelotto/spread2rdf
TabLinker.: https://github.com/Data2Semantics/TabLinker
W3C: The RDF Data Cube Vocabulary. In: World Wide Web Consortium (W3C). Available at: https://www.w3.org/TR/vocab-data-cube/. Accessed 14 Jan 2014
Ciccarese, P., Soiland-Reyes, S., Clark, T.: Web annotation as a first-class object. IEEE Internet Comput. 17(6), 71–75 (2013)
Vertere-RDF. https://github.com/knudmoeller/Vertere-RDF
Langegger, A., Wöß, W.: XLWrap – querying and integrating arbitrary spreadsheets with SPARQL. In: Bernstein, A., et al. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 359–374. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04930-9_23
Sparqlify. https://aksw.org/Projects/Sparqlify.html
O’Connor, M.J., Halaschek-Wiener, C., Musen, M.A.: Mapping master: a flexible approach for mapping spreadsheets to OWL. In: Patel-Schneider, P.F., et al. (eds.) ISWC 2010. LNCS, vol. 6497, pp. 194–208. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-17749-1_13
In: PoolParty Semantic Suite - Semantic Technology Platform. https://www.poolparty.biz/
ART Group: Sheet2RDF. https://art.uniroma2.it/sheet2rdf/
Fiorelli, M., Lorenzetti, T., Pazienza, M.T., Stellato, A., Turbati, A.: Sheet2RDF: a flexible and dynamic spreadsheet import&lifting framework for RDF. In: Ali, M., Kwon, Y.S., Lee, C.-H., Kim, J., Kim, Y. (eds.) IEA/AIE 2015. LNCS (LNAI), vol. 9101, pp. 131–140. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-19066-2_13
Stellato, A., et al.: VocBench 3: a collaborative semantic web editor for ontologies. Thesauri and Lexicons. Semantic Web 11(5), 855–881 (2020)
World Wide Web Consortium (W3C): SKOS Simple Knowledge Organization System eXtension for Labels (SKOS-XL). In: World Wide Web Consortium (W3C). https://www.w3.org/TR/skos-reference/skos-xl.html. Accessed 18 Aug 2009
Fiorelli, M., Pazienza, M.T., Stellato, A., Turbati, A.: CODA: computer-aided ontology development architecture. IBM J. Res. Dev. 58(2/3), 14:1–14:12 (2014)
Pazienza, M.T., Stellato, A., Turbati, A.: PEARL: ProjEction of annotations rule language, a language for projecting (UIMA) annotations over RDF knowledge bases. In: Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey (2012)
TopBraid Composer. https://www.topquadrant.com/products/topbraid-composer/
Ontotext: Ontotext GraphDB. https://graphdb.ontotext.com/
Knoblock, C.A., et al.: Semi-automatically mapping structured sources into the semantic web. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 375–390. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30284-8_32
Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language. In: World Wide Web Consortium - Web Standards. https://www.w3.org/TR/r2rml/. Accessed 27 Sept 2012
Ermilov, I., Auer, S., Stadler, C.: CSV2RDF: User-Driven CSV to RDF Mass Conversion Framework. In: Proceedings of the ISEM 2013, 04–06 September 2013, Graz, Austria (2013)
Krötzsch, M., Vrandečić, D., Völkel, M., Haller, H., Studer, R.: Semantic Wikipedia. Web Semantics Sci. Serv. Agents World Wide Web 5(4), 251–261 (2007)
Martin, M., Abicht, K., Stadler, C., Auer, S., Ngomo, A.-C.N., Soru, T.: CubeViz - exploration and visualization of statistical linked data. In: Proceedings of the 24th International Conference on World Wide Web, WWW 2015 (2015)
Stadler, C., Martin, M., Auer, S.: Exploring the web of spatial data with facete. In: Companion proceedings of 23rd International World Wide Web Conference (WWW), pp. 175–178 (2014)
Lebo, T., Williams, G.T.: Converting governmental datasets into linked data. In: Proceedings of the 6th International Conference on Semantic Systems, New York, NY, USA, pp. 38:1–38:3 (2010)
Scharffe, F. et al.: Enabling linked data publication with the Datalift platform. In: AAAI Workshop on Semantic Cities (2012)
Klímek, J., Škoda, P., Nečaský, M.: LinkedPipes ETL: evolved linked data preparation. In: Sack, H., Rizzo, G., Steinmetz, N., Mladenić, D., Auer, S., Lange, C. (eds.) ESWC 2016. LNCS, vol. 9989, pp. 95–100. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47602-5_20
Knap, T., et al.: UnifiedViews: an ETL tool for RDF data management. Semantic Web 9(5), 661–676 (2018)
World Wide Web Consortium (W3C): A Direct Mapping of Relational Data to RDF. In: World Wide Web Consortium (W3C). https://www.w3.org/TR/rdb-direct-mapping/. Accessed 27 Sept 2012
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Fiorelli, M., Stellato, A. (2021). Lifting Tabular Data to RDF: A Survey. In: Garoufallou, E., Ovalle-Perandones, MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-71903-6_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71902-9
Online ISBN: 978-3-030-71903-6
eBook Packages: Computer ScienceComputer Science (R0)