Skip to main content

Enriching Archival Linked Data Descriptions with Information from Wikidata and DBpedia

  • Conference paper
  • First Online:
Linking Theory and Practice of Digital Libraries (TPDL 2024)

Abstract

Various sectors within the heritage domain have developed linked data models to describe their cultural artefacts comprehensively. Within the archival domain, ArchOnto, a data model rooted in CIDOC CRM, uses linked data to open archival information to new uses through the prism of linked data. This paper seeks to investigate the potential to use information in archival records in a larger context. It aims to leverage classes and properties sourced from repositories deemed informal due to their crowd-sourcing nature and the possibility of inconsistencies or lack of precision in the data but rich in content, such as the cases of Wikidata and DBpedia. The anticipated outcome is attaining a more comprehensive and expressive archival description, fostering enhanced understanding and assimilation of archival information among domain specialists and lay users. To achieve this, we first analyse existing archive records currently described under the ISAD(G) standard to discern the typologies of entities involved. Subsequently, we map these entities within the ArchOnto ontology and establish correspondences with alternative models. We observed that entities associated with people, places, and events benefited the most from integrating properties sourced from Wikidata and DBpedia. This integration enhanced their comprehensibility and enriched them at a semantic level.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    Ontology available at: https://purl.org/episa/archonto.

  2. 2.

    More information about this transition on the EPISA Project website – https://episa.inesctec.pt/.

  3. 3.

    Ontology available at: https://purl.org/episa/archonto.

  4. 4.

    Website DBpedia - Last consulted on 12/04/2024. Available at: https://www.dbpedia.org/resources/ontology/.

  5. 5.

    Due to space limitations, we only present some examples. The complete identification is available in the Tribunal do Santo Ofício entities file in the dataset [8].

  6. 6.

    Due to the loss of detail in the image, the full image of ArchOnto representation is available in the Inquisição de Coimbra – ArchOnto file in the dataset [8].

  7. 7.

    Due to the loss of detail in the full images, the complete picture of these representations is available in the dataset [8].

  8. 8.

    Due to space limitations, all tables are available in Properties extension file [8].

  9. 9.

    More properties can be consulted in the complete representation of this individual found in the Person Extension file available in the dataset [8].

References

  1. Abián, D., Guerra, F., Martínez-Romanos, J., Trillo-Lado, R.: Wikidata and DBpedia: a comparative study. In: Szymański, J., Velegrakis, Y. (eds.) IKC 2017. LNCS, vol. 10546, pp. 142–154. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-74497-1_14

    Chapter  Google Scholar 

  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52

    Chapter  Google Scholar 

  3. Erxleben, F., Günther, M., Krötzsch, M., Mendez, J., Vrandečić, D.: Introducing Wikidata to the linked data web. In: Mika, P., et al. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 50–65. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-11964-9_4

    Chapter  Google Scholar 

  4. Hellmann, S., Stadler, C., Lehmann, J., Auer, S.: DBpedia live extraction. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2009. LNCS, vol. 5871, pp. 1209–1223. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-05151-7_33

    Chapter  Google Scholar 

  5. Hiremath, B.K., Kenchakkanavar, A.Y.: An alteration of the web 1.0, web 2.0 and web 3.0: a comparative study. Imperial J. Interdisc. Res. 2(4), 705–710 (2016)

    Google Scholar 

  6. ICOM/CIDOC CRM Special Interest Group: Definition of the CIDOC conceptual reference model, 7.1.3 edition. ICOM (2024). https://www.cidoc-crm.org/sites/default/files/cidoc_crm_version_7.1.3.pdf

  7. Koch, I., Teixeira Lopes, C., Ribeiro, C.: Moving from ISAD(G) to a CIDOC CRM-based linked data model in the Portuguese archives. J. Comput. Cult. Herit. 16(4) (2023). https://doi.org/10.1145/3605910

  8. Koch, I.: Tribunal do Santo Ofício in ArchOnto – extension of archival records through Wikidata and DBpedia properties. Dataset, INESCTEC (2024). https://doi.org/10.25747/SRYA-8115

  9. Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Seman. Web 6(2), 167–195 (2015). https://doi.org/10.3233/SW-140134

    Article  Google Scholar 

  10. Piuri, V., Balas, V.E., Borah, S., Syed Ahmad, S.S. (eds.): Intelligent and Interactive Computing. LNNS, vol. 67. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-6031-2

    Book  Google Scholar 

Download references

Acknowledgements

Thanks to the Ontology Engineering Group for allowing us to develop this work, especially to María Poveda Villalón and Mariano Rico for their support during the collaboration period and during Inês Koch’s stay in Madrid. Inês Koch is financed by National Funds through the Portuguese funding agency, FCT – Fundação para a Ciência e a Tecnologia, within the research grant 2020.08755.BD - DOI: 10.54499/2020.08755.BD.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Inês Koch .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Koch, I., Ribeiro, C., Poveda-Villalón, M., Rico, M., Teixeira Lopes, C. (2024). Enriching Archival Linked Data Descriptions with Information from Wikidata and DBpedia. In: Antonacopoulos, A., et al. Linking Theory and Practice of Digital Libraries. TPDL 2024. Lecture Notes in Computer Science, vol 15177. Springer, Cham. https://doi.org/10.1007/978-3-031-72437-4_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-72437-4_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-72436-7

  • Online ISBN: 978-3-031-72437-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics