Skip to main content

Extraction and Semantic Annotation of Workshop Proceedings in HTML Using RML

  • Conference paper
  • First Online:
Semantic Web Evaluation Challenge (SemWebEval 2014)

Abstract

Despite the significant number of existing tools, incorporating data into the Linked Open Data cloud remains complicated; hence discouraging data owners to publish their data as Linked Data. Unlocking the semantics of published data, even if they are not provided by the data owners, can contribute to surpass the barriers posed by the low availability of Linked Data and come closer to the realisation of the envisaged Semantic Web. rml, a generic mapping language based on an extension over , the standard for mapping relational databases into rdf, offers a uniform way of defining the mapping rules for data in heterogeneous formats. In this paper, we present how we adjusted our prototype rml  Processor, taking advantage of rml’s scalability, to extract and map data of workshop proceedings published in html to the rdf data model for the Semantic Publishing Challenge needs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://2014.eswc-conferences.org/semantic-publishing-challenge

  2. 2.

    https://github.com/mmlab/RMLProcessor

  3. 3.

    http://www.w3.org/TR/xhtml11/

  4. 4.

    http://triplr.org/

  5. 5.

    http://rml.io

  6. 6.

    http://www.w3.org/TR/selectors/

  7. 7.

    http://jquery.com

  8. 8.

    https://github.com/mmlab/RMLProcessor

  9. 9.

    http://jodd.org/doc/csselly/

  10. 10.

    http://www.isi.edu/integration/karma/

  11. 11.

    http://openrefine.org/

References

  1. Coetzee, P., Heath, T., Motta, E.: Sparqplug: generating linked data from legacy HTML, SPARQL and the DOM (2008)

    Google Scholar 

  2. Connolly, D.: Gleaning resource descriptions from dialects of languages (GRDDL). W3C recommendation, September 2007

    Google Scholar 

  3. Dimou, A., Vander Sande, M., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Workshop on Linked Data on the Web (2013)

    Google Scholar 

  4. Dimou, A., Vander Sande, M., De Nies, T., Verborgh, R., Mannens, E., Van de Walle, R.: RDF mapping rules refinements according to data consumers feedback. In: 2nd International World Wide Web Conference, Poster Track Proceedings (2014)

    Google Scholar 

  5. Droop, M., et al.: Translating XPath queries into SPARQL queries. In: Meersman, R., Tari, Z. (eds.) OTM 2007 Workshops, Part I. LNCS, vol. 4805, pp. 9–10. Springer, Heidelberg (2007)

    Google Scholar 

Download references

Acknowledgments

The described research activities were funded by Ghent University, the Institute for the Promotion of Innovation by Science and Technology in Flanders (IWT), the Fund for Scientific Research Flanders (FWO Flanders), and the European Union.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anastasia Dimou .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Dimou, A. et al. (2014). Extraction and Semantic Annotation of Workshop Proceedings in HTML Using RML. In: Presutti, V., et al. Semantic Web Evaluation Challenge. SemWebEval 2014. Communications in Computer and Information Science, vol 475. Springer, Cham. https://doi.org/10.1007/978-3-319-12024-9_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12024-9_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12023-2

  • Online ISBN: 978-3-319-12024-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics