Skip to main content
Log in

Extraktion, Mapping und Verlinkung von Daten im Web

Phasen im Lebenszyklus von Linked Data

  • Schwerpunktbeitrag
  • Published:
Datenbank-Spektrum Aims and scope Submit manuscript

Zusammenfassung

In diesem Artikel geben wir einen Überblick über verschiedene Herausforderungen des Managements von Linked Data im Web. Mit der DBpedia Wissensextraktion aus Wikipedia, dem skalierbaren Linking von Wissensbasen und dem Mapping relationaler Daten nach RDF stellen wir drei Ansätze vor, die zentrale Phasen des Lebenszyklus von Daten im Web ausmachen.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Abb. 1
Abb. 2
Abb. 3
Abb. 4
Abb. 5
Abb. 6
Abb. 7
Abb. 8
Abb. 9
Listing 1
Listing 2
Listing 3

Notes

  1. http://www.w3.org/2001/sw/rdb2rdf/.

  2. Siehe http://www.alexa.com/topsites.

  3. http://en.wikipedia.org/wiki/Special:Statistics.

  4. Für wichtige Abfragen erstellen Wikipedia-Nutzer eigene Listen, aber das deckt nur populäre Abfragen ab und solche Listen müssen manuell verwaltet werden.

  5. Siehe http://mappings.dbpedia.org.

  6. http://www.openstreetmap.org/.

  7. http://musicbrainz.org/.

  8. http://www.w3.org/TR/r2rml/.

  9. http://virtuoso.openlinksw.com/whitepapers/relational%20rdf%20views%20mapping.html.

  10. http://sparqlify.org.

  11. http://linkedgeodata.org.

  12. http://openstreetmap.org

  13. http://planet.openstreetmap.org.

  14. http://wiki.openstreetmap.org/wiki/Osmosis.

  15. http://postgis.net/.

  16. http://www.postgresql.org/.

  17. http://wiki.postgresql.org/wiki/Cross_Columns_Stats.

Literatur

  1. Angles R, Gutierrez C (2008) The expressive power of SPARQL. In: Sheth AP et al. (Hrsg) Proc 7th international semantic web conference. Lecture notes in computer science, Bd 5318. Springer, Berlin, S 114–129

    Google Scholar 

  2. Auer S, Bizer C, Kobilarov G, Lehmann J, Cyganiak R, Ives Z (2007) DBpedia: a nucleus for a web of open data. In: Aberer K et al. (Hrsg) Proc 6th international semantic web conference. Lecture notes in computer science, Bd 4825. Springer, Berlin, S 722–735

    Google Scholar 

  3. Auer S, Dietzold S, Lehmann J, Hellmann S, Aumueller D (2009) Triplify: light-weight linked data publication from relational databases. In: Quemada J et al. (Hrsg) Proc 18th international conference on world wide web. ACM, New York, S 621–630

    Chapter  Google Scholar 

  4. Auer S, Lehmann J (2007) What have innsbruck and Leipzig in common? Extracting semantics from wiki content. In: Proc 4th European semantic web conference, S 503–517

    Google Scholar 

  5. Bizer C, Cyganiak R (2006) D2r server—publishing relational databases on the semantic web. Poster at the 5th international semantic web conference. http://www4.wiwiss.fu-berlin.de/bizer/pub/Bizer-Cyganiak-D2R-Server-ISWC2006.pdf (15.01.2013)

  6. Chong EI, Das S, Eadon G, Srinivasan J (2005) An efficient SQL-based rdf querying scheme. In: Böhm K et al. (Hrsg) Proc 31st international conference on very large data bases. ACM, New York, S 1216–1227

    Google Scholar 

  7. Fensel D, van Harmelen F, Andersson B, (2008) Towards LarKC: a platform for web-scale reasoning. In: Proc 2nd IEEE international conference on semantic computing. IEEE Comp Soc, Los Alamitos, S 524–529

    Google Scholar 

  8. Goel K, Guha RV, Hansson O (2009) Introducing rich snippets. Google webmaster central blog. http://googlewebmastercentral.blogspot.com/2009/05/introducing-rich-snippets.html. (15.01.2013)

  9. Hahn R, Bizer C, Sahnwaldt C, Herta C, Robinson S, Bürgle M, Düwiger H, Scheel U (2010) Faceted wikipedia search. In: Abramowicz W, Tolksdorf R (Hrsg) Proc 13th international conference business information systems. Lecture notes in business information processing, Bd 47. Springer, Berlin, S 1–11

    Chapter  Google Scholar 

  10. Heim P, Ertl T, Ziegler J (2010) Facet graphs: complex semantic querying made easy. In: Aroyo L et al. (Hrsg) Proc 7th extended semantic web conference. Lecture notes in computer science, Bd 6088. Springer, Berlin, S 288–302

    Google Scholar 

  11. Heim P, Hellmann S, Lehmann J, Lohmann S, Stegemann T (2009) RelFinder: revealing relationships in RDF knowledge bases. In: Chua TS et al. (Hrsg) Proc 3rd international conference on semantic and media technologies. Lecture notes in computer science, Bd 5887. Springer, Berlin, S 182–187

    Google Scholar 

  12. Hellmann S, Lehmann J, Auer S (2012) Linked-data aware uri schemes for referencing text fragments. In: ten Teije A et al. (Hrsg) Proc 18th international conference on knowledge engineering and knowledge management. Lecture notes in computer science, Bd 7603. Springer, Berlin

    Google Scholar 

  13. Isele R, Bizer C (2011) Learning linkage rules using genetic programming. In: Shvaiko P et al. (Hrsg) Proc 6th international ontology matching workshop, CEUR workshop proceedings, Bd 814, S 13–24, CEUR-WS.org,

    Google Scholar 

  14. Isele R, Jentzsch A, Bizer C (2011) Efficient multidimensional blocking for link discovery without losing recall. In: Marian A, Vassalos V (Hrsg) Proc 14th international workshop on the web and databases

    Google Scholar 

  15. Kobilarov G, Scott T, Raimond Y, Oliver S, Sizemore C, Smethurst M, Bizer C, Lee R (2009) Media meets semantic web—how the bbc uses dbpedia and linked data to make connections. In: Aroyo L et al. (Hrsg) Proc 6th European semantic web conference. Lecture notes in computer science, Bd 5554. Springer, Berlin, S 723–737

    Google Scholar 

  16. Lehmann J (2009) DL-learner: learning concepts in description logics. J Mach Learn Res 10:2639–2642

    MathSciNet  MATH  Google Scholar 

  17. Lehmann J, Bizer C, Kobilarov G, Auer S, Becker C, Cyganiak R, Hellmann S (2009) DBpedia—a crystallization point for the web of data. J Web Semant 7(3):154–165

    Article  Google Scholar 

  18. Lehmann J, Bühmann L (2010) Ore—a tool for repairing and enriching knowledge bases. In: Patel-Schneider PF et al. (Hrsg) Proc 9th international semantic web conference. Lecture notes in computer science, Bd 6496. Springer, Berlin

    Google Scholar 

  19. Leser U, Naumann F (2007) Informationsintegration – Architekturen und Methoden zur Integration verteilter und heterogener Datenquellen. dpunkt.verlag, Heidelberg

    MATH  Google Scholar 

  20. Mika P (2009) Year of the monkey: lessons from the first year of searchmonkey. In: GI jahrestagung. LNI, Bd 154, S 387, GI

    Google Scholar 

  21. Morsey M, Lehmann J, Auer S, Stadler C, Hellmann S (2012) DBpedia and the live extraction of structured data from wikipedia. In: Program: Electronic library and information systems, Bd 46, S 27

    Google Scholar 

  22. Ngonga Ngomo AC Learning conformation rules for linked data integration. In: Franconi E et al (Hrsg) Proc 7th international workshop on ontology matching

  23. Ngonga Ngomo AC (2012) Link discovery with guaranteed reduction ratio in affine spaces with minkowski measures. In: Proc 11th international semantic web conference

    Google Scholar 

  24. Ngonga Ngomo AC (2012) On link discovery using a hybrid approach. J Data Semant 1:203–217

    Article  Google Scholar 

  25. Ngonga Ngomo AC, Auer S (2011) Limes—a time-efficient approach for large-scale link discovery on the web of data. In: Walsh T (Hrsg) Proc 22nd international joint conference on artificial intelligence. IJCAI/AAAI

    Google Scholar 

  26. Ngonga Ngomo AC, Heino N, Lyko K, Speck R, Kaltenböck M (2011) Scms—semantifying content management systems. In: Aroyo L et al. (Hrsg) Proc 10th international semantic web conference. Lecture notes in computer science, Bd 7031. Springer, Berlin

    Google Scholar 

  27. Ngonga Ngomo AC, Lyko K (2012) Eagle: efficient active learning of link specifications using genetic programming. In: Cudré-Mauroux P et al. (Hrsg) Proc 9th extended semantic web conference. Lecture notes in computer science, Bd 7649. Springer, Berlin

    Google Scholar 

  28. Nikolov A, D’Aquin M, Motta E (2012) Unsupervised learning of data linking configuration. In: Simperl E et al. (Hrsg) Proc 9th extended semantic web conference. Lecture notes in computer science, Bd 7295. Springer, Berlin, S 119–133

    Google Scholar 

  29. Rahm E, Thor A, Aumueller D, Do HH, Golovin N, Kirsten T (2005) Ifuice—information fusion utilizing instance correspondences and peer mappings. In: Doan A et al. (Hrsg) Proc 8th international workshop on the web and databases, S 7–12

    Google Scholar 

  30. Schwarte A, Haase P, Hose K, Schenkel R, Schmidt M (2010) Fedx: optimization techniques for federated query processing on linked data. In: Patel-Schneider PF et al. (Hrsg) Proc 10th international semantic web conference. Lecture notes in computer science, Bd 6496. Springer, Berlin, S 601–616

    Google Scholar 

  31. Sequeda JF, Miranker DP Ultrawrap: SPARQL Execution on Relational Data. Poster at the 10th international semantic web conference (2011). https://files.ifi.uzh.ch/ddis/iswc_archive/iswc/ab/2011pre/iswc2011.semanticweb.org/fileadmin/iswc/Papers/PostersDemos/iswc11pd_submission_94.pdf. (15.01.2013)

  32. Sertkaya B, OntocomP A protégé plugin for completing OWL ontologies. In: Proc 6th European semantic web conference

  33. Shekarpour S, Auer S, Ngonga Ngomo AC (2013) Question answering on interlinked data. In: Proc 22nd international conference on world wide web

    Google Scholar 

  34. Soru T, Ngonga Ngomo AC (2012) Active learning of domain-specic distances for link discovery. In: Hideaki T et al. (Hrsg) Proc 2nd joint international semantic technology conference. Lecture notes in computer science, Bd 7774. Springer, Berlin

    Google Scholar 

  35. Sundara S, Atre M, Kolovski V, Das S, Wu Z, Chong EI, Srinivasan J (2010) Visualizing large-scale rdf data using subsets, summaries, and sampling in oracle. In: Li F et al. (Hrsg) Proc 26th international conference on data engineering. IEEE Press, New York, S 1048–1059

    Google Scholar 

  36. Unbehauen J, Stadler C, Auer S (2012) Accessing relational data on the web with sparqlmap. In: Hideaki T et al. (Hrsg) Proc 2nd joint international semantic technology conference. Lecture notes in computer science, Bd 7774. Springer, Berlin

    Google Scholar 

  37. Auer S, Bühmann L, Lehmann J, Hausenblas M, Tramp S, van Nuffelen B, Mendes P, Dirschl C, Isele R, Williams H, Erling O (2012) Managing the life-cycle of linked data with the lod2 stack. In: Proc. 11th International Semantic Web Conference

    Google Scholar 

Download references

Danksagung

Wir danken den Mitgliedern der Arbeitsgruppe AKSW und den Projektpartnern der EU RP7 Projekte LOD2 (GA no. 257943), GeoKnow (GA no. 318159) und BIG (GA no. 318062), welche die in diesem Artikel vorgestellten Arbeiten unterstützt haben.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sören Auer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Auer, S., Lehmann, J., Ngonga Ngomo, AC. et al. Extraktion, Mapping und Verlinkung von Daten im Web. Datenbank Spektrum 13, 77–87 (2013). https://doi.org/10.1007/s13222-013-0124-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13222-013-0124-z

Schlüsselwörter

Navigation