Abstract
The appearance of Linked Open Data (LOD) was an important milestone for reaching a Web of Data. More and more RDF data sets get published to be consumed and integrated into a variety of applications. Pointing out one application, Linked Data can be used to enrich web pages with semantic annotations. This gives readers the chance to recall Semantic Web’s knowledge about text passages. RDFa provides a well-defined base, as it extends HTML tags in web pages to a form that contains RDF data. Nevertheless, asking web authors to manually annotate their web pages with semantic annotations is illusive. We present Epiphany, a service that annotates Linked Data to web pages automatically by creating RDFa enhanced versions of the input HTML pages. In Epiphany, Linked Data can be any RDF dataset or mashup (e.g., DBpedia, BBC programs, etc.). Based on ontology-based information extraction and the dataset, Epiphany generates an RDF graph about a web page’s content. Based on this RDF graph, RDFa annotations are generated and integrated in an RDFa enhanced version of the web page. Authors can use Epiphany to get RDFa enhanced versions of their articles that link to Linked Data models. Readers may use Epiphany to receive RDFa enhanced versions of web pages while surfing. We analysed results of Epiphany with Linked Data from BBC about music biographies and show a similar quality compared to results of Open Calais. Epiphany provides annotations from a couple of Linked Data sets.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bizer, C., Heath, T., Berners-Lee, T.: Linked Data – the story so far. Int. Journal on Semantic Web and Information Systems, IJSWIS (2009)
W3C: RDFa in XHTML: syntax and processing rules for embedding rdf through attributes. W3C working draft, W3C (2010)
Khare, R.: Microformats: The next (small) thing on the semantic web? IEEE Internet Computing 10(1), 68–75 (2006)
Burel, G., Cano, A.E., Lanfranchi, V.: Ozone browser: Augmenting the web with semantic overlays. In: Proceedings of the 5th Workshop on Scripting and Development for the Semantic Web SFSW 2009. CEUR Workshop Proceedings, vol. 449 (2009)
Corlosquet, S., Delbru, R., Clark, T., Polleres, A., Decker, S.: Produce and Consume Linked Data with Drupal! In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 763–778. Springer, Heidelberg (2009)
Google: Help us make the web better: An update on Rich Snippets (2009), http://googlewebmastercentral.blogspot.com/2009/10/help-us-make-web-better-update-on-rich.html
Yahoo! Inc.: SearchMonkey Guide - A Manual for SearchMonkey Developers and Publishers (2008), http://developer.yahoo.com/searchmonkey/smguide
Bizer, C., Cyganiak, R., Heath, T.: How to publish linked data on the web. Web page (2007), http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial
Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM - Semi-automatic CREAtion of Metadata. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 358–372. Springer, Heidelberg (2002)
Adrian, B.: Incorporating ontological background knowledge into information extraction. In: Maynard, D. (ed.) ISWC 2009 Doctoral Consortium (2009)
Huynh, D., Mazzocchi, S., Karger, D.: Piggy bank: Experience the semantic web inside your web browser. Web Semantics 5(1), 16–27 (2007)
W3C: Gleaning resource descriptions from dialects of languages (GRDDL). W3C rec., W3C (2007)
Pilgrim, M.: Greasemonkey Hacks: Tips & Tools for Remixing the Web with Firefox (Hacks). O’Reilly Media, Inc., Sebastopol (2005)
Tori, A.: Zemanta Service (2008)
Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: voiD Guide - Using the Vocabulary of Interlinked Datasets (2009), http://rdfs.org/ns/void-guide
Dublin Core Metadata Initiative: DCMI Metadata Terms (2006), http://dublincore.org/documents/dcmi-terms
Adrian, B., Hees, J., van Elst, L., Dengel, A.: iDocument: using ontologies for extracting and annotating information from unstructured text. In: Mertsching, B., Hund, M., Aziz, Z. (eds.) KI 2009. LNCS (LNAI), vol. 5803, pp. 249–256. Springer, Heidelberg (2009)
Adrian, B., Dengel, A.: Believing finite-state cascades in knowledge-based information extraction. In: KI. LNCS (LNAI). Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Adrian, B., Hees, J., Herman, I., Sintek, M., Dengel, A. (2010). Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data. In: Cimiano, P., Pinto, H.S. (eds) Knowledge Engineering and Management by the Masses. EKAW 2010. Lecture Notes in Computer Science(), vol 6317. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16438-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-16438-5_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16437-8
Online ISBN: 978-3-642-16438-5
eBook Packages: Computer ScienceComputer Science (R0)