Skip to main content

Metadata Aggregation via Linked Data: Results of the Europeana Common Culture Project

  • Conference paper
  • First Online:
Metadata and Semantic Research (MTSR 2020)

Abstract

Digital cultural heritage resources are widely available on the web through the digital libraries of heritage institutions. To address the difficulties of discoverability in cultural heritage, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources’ metadata. We present the results of the linked data aggregation task conducted within the Europeana Common Culture project, which attempted an innovative approach to aggregation based on linked data made available by cultural heritage institutions. This task ran for one year with participation of twelve organizations, involving the three member roles of the Europeana network: data providers, intermediary aggregators, and the central aggregation hub, Europeana. We report on the challenges that were faced by data providers, the standards and specifications applied, and the resulting aggregated metadata.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    For more information about the role of Schema.org for structured data on the web, please consult: https://schema.org/docs/about.html.

  2. 2.

    For information about the Europeana Common Culture project, consult https://pro.europeana.eu/project/europeana-common-culture.

  3. 3.

    In fact, the related LD work here has not changed since our earlier papers.

  4. 4.

    The maximum depth is a parameter of crawlers that defines when they should stop following newly found links, based on distance from the resource where the crawl started.

  5. 5.

    For details consult the section ‘EDM XML Schema and EDM validation in Oxygen’ at https://pro.europeana.eu/page/edm-documentation.

  6. 6.

    The GitHub repository of the software resulting from this task: https://github.com/netwerk-digitaal-erfgoed/lod-aggregator.

  7. 7.

    The data providers of the LD aggregation task were the following: erfgoedplus.be (Belgium), National Library of Finland, German Digital Library, National Documentation Centre (Greece), DigiPhil (Hungary), CulturaItalia (Italy), National Library of Latvia, National Library of Portugal, National Library of the Netherlands, Nationaal Museum van Wereldculturen (The Netherlands), UMA information technology gmbh (Austria), Swedish National Heritage Board.

  8. 8.

    The LD published by the Nationaal Museum van Wereldculturen is not 100% valid according to the Europeana requirements, conversion into valid EDM was achieved by using the Mapping Server of our LD toolset.

  9. 9.

    The LD published by the Swedish National Heritage Board is publish KSAMSÖK ontology. For the purposes of the pilot, it served as a test for the capabilities of the Mapper Service component to function with other different ontologies. A mapping from KSAMSÖK to EDM was available and it was reimplemented with SPARQL construct queries for execution by the Mapper Service.

References

  1. Lagoze, C., Van de Sompel, H., Nelson, M.L., Warner, S.: The open archives initiative protocol for metadata harvesting, version 2.0. Open Archives Initiative (2002)

    Google Scholar 

  2. Jones, E., Seikel, M. (eds): Linked Data for Cultural Heritage. Facet Publishing, London (2016). ISBN: 9781783301621

    Google Scholar 

  3. Szekely, P., et al.: Connecting the Smithsonian american art museum to the linked data cloud. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC. LNCS, vol. 7882, pp. 593–607. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-38288-8_40

    Chapter  Google Scholar 

  4. Dragoni, M., Tonelli, S., Moretti, G.: A knowledge management architecture for digital cultural heritage. J. Comput. Cult. Herit. (JOCCH) – Spec. Issue Digit. Infrastruct. Cult. Herit. Part 2 10(3) (2017). https://doi.org/10.1145/3012289

  5. McRoberts, M. (eds): A Guide to the Research & Education Space for Contributors and Developers. BBC (2016). https://bbcarchdev.github.io/inside-acropolis/. Accessed 29 May 2019

  6. Vander Sande, M., Verborgh, R., Hochstenbach, P., Van de Sompel, H.: Towards sustainable publishing and querying of distributed linked data archives. J. Doc. 74(1) (2018). https://doi.org/10.1108/JD-03-2017-0040

  7. Freire, N., Robson, G., Howard, J.B., Manguinhas, H., Isaac, A.: Metadata aggregation: assessing the application of IIIF and sitemaps within cultural heritage. In: Kamps, J., Tsakonas, G., Manolopoulos, Y., Iliadis, L., Karydis, I. (eds.) TPDL. LNCS, vol. 10450, pp. 220–232. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-67008-9_18

    Chapter  Google Scholar 

  8. Freire, N., Meijers, E., Voorburg, R., Cornelissen, R., Isaac, A., de Valk, S.: Aggregation of linked data: a case study in the cultural heritage domain. Information 10(8), 252 (2019). https://doi.org/10.3390/info10080252

  9. Meijers, E., de Valk, S., Helmus, W.: A distributed network of heritage information. White paper 2018 (2018). https://zenodo.org/record/1185401. Accessed 15 June 2019

  10. Google Inc., Yahoo Inc., Microsoft Corporation and Yandex; “About Schema.org” n.d. https://schema.org/docs/about.html. Accessed 29 May 2019

  11. Wallis, R., Isaac, A., Charles, V., Manguinhas, H.: Recommendations for the application of Schema.org to aggregated cultural heritage metadata to increase relevance and visibility to search engines: the case of Europeana. Code4Lib J. (36) (2017). ISSN 1940-5758

    Google Scholar 

  12. Freire, N., Charles, V., Isaac, A.: Evaluation of Schema.org for aggregation of cultural heritage metadata. In: Gangemi, A., et al. (eds.) ESWC. LNCS, vol. 10843, pp. 225–239. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_15

    Chapter  Google Scholar 

  13. Europeana Foundation: Definition of the Europeana Data Model v5.2.8 (2017). https://pro.europeana.eu/edm-documentation

  14. Freire, N.: Specifying a linked data dataset for Europeana and aggregators (Version 0.2). Zenodo (2020). https://doi.org/10.5281/zenodo.3817314

  15. Freire, N.: Guidelines for providing and handling Schema.org metadata in compliance with Europeana (Version 0.1). Zenodo (2020). https://doi.org/10.5281/zenodo.3817236

  16. Sauermann, L., Cyganiak, R.: Cool URIs for the Semantic Web. W3C Interest Group Note (2008). https://www.w3.org/TR/cooluris/

  17. Albertoni, R., Browning, D., Cox, S., Beltran, A.G., Perego, A., Winstanley, P.: Data catalog vocabulary (DCAT) - version 2. W3C Recommendation (2020). https://www.w3.org/TR/vocab-dcat-2/

  18. Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: Describing linked datasets with the VoID vocabulary. W3C Interest Group Note (2011). https://www.w3.org/TR/void/

  19. Knublauch, H., Kontokostas, D. (eds.): Shapes Constraint Language (SHACL). W3C Recommendation. W3C (2017). https://www.w3.org/TR/shacl/

  20. Isele, R., Umbrich, J., Bizer, C., Harth, A.: LDSpider: an open-source crawling framework for the web of linked data. In: Proceedings of 9th International Semantic Web Conference (ISWC 2010) Posters and Demos (2010)

    Google Scholar 

  21. Prud'hommeaux, E., Seaborne, A.: SPARQL query language for RDF. W3C Recommendation. W3C (2008). https://www.w3.org/TR/rdf-sparql-query/

Download references

Acknowledgments

We would like to acknowledge the support given by the participants of the pilot: Helena Patrício, Tetiana Kysla, Minna Rönkä, Osma Suominen, Haris Georgiadis, Agathi Papanoti, and Palko Gabor, as well as Erwin Verbruggen for his coordination. We also would like to acknowledge Adina Ciocoiu from the Europeana Foundation for the analysis of data samples collected during the case study.

This work was partially supported by Portuguese national funds through Fundação para a Ciência e a Tecnologia (FCT) with reference UIDB/50021/2020, the European Commission under contract 30-CE-0885387/00-80, and grant agreement INEA/CEF/ICT/A2018/1633581 and by the NDE-program funded by the Dutch Ministry of Education, Culture and Science.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nuno Freire .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Freire, N., Meijers, E., de Valk, S., Raemy, J.A., Isaac, A. (2021). Metadata Aggregation via Linked Data: Results of the Europeana Common Culture Project. In: Garoufallou, E., Ovalle-Perandones, MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-71903-6_35

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-71902-9

  • Online ISBN: 978-3-030-71903-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics