skip to main content
10.1145/3151759.3151809acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiiwasConference Proceedingsconference-collections
short-paper

LinkedPipes ETL in use: practical publication and consumption of linked data

Published:04 December 2017Publication History

ABSTRACT

Companies and institutions now realize the potential of Linked Open Data (LOD) and they start publishing their own data as LOD. However, publishing LOD is still a challenging task. One of the main reasons is a lack of user friendly tooling which would properly support the whole LOD publishing process. The process typically consists of source data extraction, transformation to RDF, alignment with commonly used vocabularies, linking to other datasets, computing metadata, publishing on the web as a dump, loading into a triplestore and recording the dataset in a data catalog such as CKAN. In this paper we present LinkedPipes ETL, a tool for ETL-like LOD publishing, which mainly focuses on supporting such LOD publishing workflows in a user friendly way. In addition, the tool also eases consumption of already existing LOD data sources as it addresses some of the practical issues associated with it. Finally, the tool itself uses Linked Data technologies for representation of the ETL processes. We describe LinkedPipes ETL and its main distinguishing features in context of the use cases in which the tool has already been deployed. They include an institution of public administration, a municipality, a university, a software company and an open data initiative.

References

  1. Sören Auer, Sebastian Dietzold, Jens Lehmann, Sebastian Hellmann, and David Aumueller. 2009. Triplify: Light-weight Linked Data Publication from Relational Databases. In Proceedings of the 18th International Conference on World Wide Web (WWW '09). ACM, New York, NY, USA, 621--630. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Anastasia Dimou, Pieter Heyvaert, Wouter Maroy, Laurens De Graeve, Ruben Verborgh, and Erik Mannens. 2016. Towards an Interface for User-Friendly Linked Data Generation Administration. In Proceedings of the 15th International Semantic Web Conference: Posters and Demos (CEUR Workshop Proceedings), Takahiro Kawamura and Heiko Paulheim (Eds.), Vol. 1690. http://ceur-ws.org/Vol-1690/paper98.pdfGoogle ScholarGoogle Scholar
  3. Adrian Gschwend, Alessia C. Neuroni, Thomas Gehrig, and Marco Combettoo. 2015. Publication and Reuse of Linked Data: The Fusepool Publish-Process-Perform Platform for Linked Data. Innovation and the Public Sector 22 (2015), 116--123. http://ebooks.iospress.nl/volumearticle/40812Google ScholarGoogle Scholar
  4. Jakub Klímek and Petr Škoda. 2017. Speeding up publication of Linked Data using data chunking in LinkedPipes ETL. In Proceedings of the 16th International Conference on Ontologies, DataBases, and Applications of Semantics (ODBASE 2017) (Lecture Notes in Computer Science), Vol. 10574. Springer.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jakub Klímek, Petr Škoda, and Martin Nečaský. 2016. LinkedPipes ETL: Evolved Linked Data Preparation. In The Semantic Web - ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 - June 2, 2016, Revised Selected Papers. 95--100.Google ScholarGoogle Scholar
  6. Tomáš Knap, Peter Hanečák, Jakub Klímek, Christian Mader, Martin Nečaský, Bert Van Nuffelen, and Petr Škoda. 2017. UnifiedViews: An ETL Tool for RDF Data Management. Semantic Web accepted for publication (2017). http://semantic-web-journal.net/content/unifiedviews-etl-tool-rdf-data-management-0.Google ScholarGoogle Scholar
  7. Craig A. Knoblock, Pedro Szekely, Jose Luis Ambite, Shubham Gupta, Aman Goel, Maria Muslea, Kristina Lerman, Mohsen Taheriyan, and Parag Mallick. 2012. Semi-Automatically Mapping Structured Sources into the Semantic Web. In Proceedings of the Extended Semantic Web Conference. Crete, Greece. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jakub Kozák, Martin Nečaský, and Jaroslav Pokorný. 2015. Drug Encyclopedia - Linked Data Application for Physicians. In The Semantic Web - ISWC 2015 - 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11--15, 2015, Proceedings, Part II. 41--56.Google ScholarGoogle Scholar
  9. Ben De Meester, Wouter Maroy, Anastasia Dimou, Ruben Verborgh, and Erik Mannens. 2017. Declarative Data Transformations for Linked Data Generation: The Case of DBpedia. In The Semantic Web - 14th International Conference, ESWC 2017, Portorož, Slovenia, May 28 - June 1, 2017, Proceedings, Part II. 33--48.Google ScholarGoogle Scholar
  10. François Scharffe, Ghislain Atemezing, Raphaël Troncy, Fabien Gandon, Serena Villata, Bénédicte Bucher, Fayçal Hamdi, Laurent Bihanic, Gabriel Képéklian, Franck Cotton, Jérôme Euzenat, Zhengjie Fan, Pierre-Yves Vandenbussche, and Bernard Vatant. 2012. Enabling linked data publication with the Datalift platform. In Proc. AAAI workshop on semantic cities. Toronto, Canada. https://hal.inria.fr/hal-00768424Google ScholarGoogle Scholar
  11. Klaudia Thellmann, Fabrizio Orlandi, and Sören Auer. 2014. LinDA - Visualising and Exploring Linked Data. In Proceedings of the Posters and Demos Track of 10th International Conference on Semantic Systems - SEMANTiCS2014. Leipzig, Germany. http://ceur-ws.org/Vol-1224/paper10.pdfGoogle ScholarGoogle Scholar
  12. Pierre-Yves Vandenbussche, Ghislain Atemezing, María Poveda-Villalón, and Bernard Vatant. 2017. Linked Open Vocabularies (LOV): A gateway to reusable semantic vocabularies on the Web. Semantic Web 8, 3 (2017), 437--452.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. LinkedPipes ETL in use: practical publication and consumption of linked data

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          iiWAS '17: Proceedings of the 19th International Conference on Information Integration and Web-based Applications & Services
          December 2017
          609 pages
          ISBN:9781450352994
          DOI:10.1145/3151759

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 December 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader