Skip to main content
Log in

Improving interoperability using vocabulary linked data

  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

The concept of Linked Data has been an emerging theme within the computing and digital heritage areas in recent years. The growth and scale of Linked Data has underlined the need for greater commonality in concept referencing, to avoid local redefinition and duplication of reference resources. Achieving domain-wide agreement on common vocabularies would be an unreasonable expectation; however, datasets often already have local vocabulary resources defined, and so the prospects for large-scale interoperability can be substantially improved by creating alignment links from these local vocabularies out to common external reference resources. The ARIADNE project is undertaking large-scale integration of archaeology dataset metadata records, to create a cross-searchable research repository resource. Key to enabling this cross search will be the ‘subject’ metadata originating from multiple data providers, containing terms from multiple multilingual controlled vocabularies. This paper discusses various aspects of vocabulary mapping. Experience from the previous SENESCHAL project in the publication of controlled vocabularies as Linked Open Data is discussed, emphasizing the importance of unique URI identifiers for vocabulary concepts. There is a need to align legacy indexing data to the uniquely defined concepts and examples are discussed of SENESCHAL data alignment work. A case study for the ARIADNE project presents work on mapping between vocabularies, based on the Getty Art and Architecture Thesaurus as a central hub and employing an interactive vocabulary mapping tool developed for the project, which generates SKOS mapping relationships in JSON and other formats. The potential use of such vocabulary mappings to assist cross search over archaeological datasets from different countries is illustrated in a pilot experiment. The results demonstrate the enhanced opportunities for interoperability and cross searching that the approach offers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. Semantic Enrichment Enabling Sustainability of Archaeological Links (SENESCHAL) [4] was funded by the UK Arts and Humanities Research Council and coordinated by the Hypermedia Research Group at the University of South Wales (formerly University of Glamorgan). Advanced Research Infrastructure for Archaeological Dataset Networking in Europe (ARIADNE) [5] is an ongoing EC FP7 Project.

  2. Following completion of the project operational management of HeritageData.org and governance of the vocabularies was transferred to EH, RCAHMS, RCAHMW collectively, under the rubric of the FISH Terminology Working Group [19].

  3. It should be noted that the criteria of the “5 star” scheme as described in Fig. 1 do not measure data quality OR quantity, the scheme only grades LOD in terms of data formats used, licensing conditions and the presence of (an unspecified quantity of) external links.

References

  1. EUROPEANA project. http://www.europeana.eu/

  2. Golub, K., Tudhope, D., Zeng, M., Žumer, M.: Terminology registries for knowledge organization systems: functionality, use, and attributes. J. Assoc. Inf. Sci. Technol. 65(9), 1901–1916 (2014)

    Article  Google Scholar 

  3. Zeng, M., Chan, L.: Trends and issues in establishing interoperability among knowledge organization systems. J. Am. Soc. Inf. Sci. Technol. 55(5), 377–395 (2004)

    Article  Google Scholar 

  4. SENESCHAL project: semantic enrichment enabling sustainability of archaeological links. University of South Wales, Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/seneschal/

  5. ARIADNE FP7 project: advanced research infrastructure for archaeological dataset networking in Europe. http://www.ariadne-infrastructure.eu/

  6. STAR project: semantic technologies for archaeological resources. University of South Wales: Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/star/

  7. STELLAR project: semantic technologies enhancing links and linked data for archaeological resources. University of South Wales: Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/STELLAR/

  8. ISO standard 21127:2014—the CIDOC conceptual reference model (CRM). http://www.cidoc-crm.org/

  9. Tudhope, D., May, K., Binding, C., Vlachidis, A.: Connecting archaeological data and grey literature via semantic cross search. Internet archaeology 30 (2011). doi:10.11141/ia.30.5

  10. Binding, C., Tudhope, D.: Terminology web services. Knowl. Organ. 37(4), 287–298 (2010)

    Google Scholar 

  11. Resource description framework (RDF). W3C. http://www.w3.org/RDF/

  12. Archaeology Data Service: Linked Data, http://data.archaeologydataservice.ac.uk/

  13. Binding, C., Charno, M., Jeffrey, S., May, K., Tudhope, D.: Template based semantic integration: from legacy archaeological datasets to linked data. Int. J. Semant. Web Inf. Syst. 11(1) (2015) (in press)

  14. Richards, J.D., Hardman, C.S.: Stepping back from the trench edge: an archaeological perspective on the development of standards for recording and publication. In: Greengrass, M. & Hughes, L. (eds.) The Virtual Representation of the Past. Ashgate, pp. 101–112 (2008). http://eprints.whiterose.ac.uk/7795/

  15. Bizer, C., Heath, T., Berners-Lee, T.: Linked data—the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009). doi:10.4018/jswis.2009081901

    Article  Google Scholar 

  16. HeritageData.org—documentation of web services and widget controls produced as part of the SENESCHAL project. http://www.heritagedata.org/blog/

  17. Simple Knowledge Organization System (SKOS). W3C. http://www.w3.org/2004/02/skos/

  18. SPARQL 1.1 query language. W3C (2013). [http://www.w3.org/TR/sparql11-query/]

  19. Forum on Information Standards in Heritage (FISH) Terminology Working Group. http://www.heritagedata.org/blog/about-heritage-data/fish/

  20. Berners-Lee, T.: Linked data—design issues (5 star linked data deployment scheme). http://www.w3.org/DesignIssues/LinkedData.html

  21. ISO 25964-2:2013 Information and documentation—thesauri and interoperability with other vocabularies—part 2: interoperability with other vocabularies (2013). http://www.iso.org/iso/home/store/catalogue_tc/catalogue_detail.htm?csnumber=53658

  22. Ontology alignment evaluation initiative. http://oaei.ontologymatching.org

  23. Cuenca Grau, B., Dragisic, Z., Eckert, K., Euzenat, J., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A.O., Lambrix, P., Nikolov, A., Paulheim, H., Ritze, D., Scharffe, F., Shvaiko, P., Trojahn, C., Zamazal, O.: Results of the ontology alignment evaluation initiative 2013, pp. 29–31 (2013). http://disi.unitn.it/~p2p/OM-2013/oaei13_paper0.pdf

  24. Alexander, K., Cyganiak, R., Hausenblad, M., Zhao, J.: Describing linked datasets with the voID vocabulary. W3C interest group note (2011). http://www.w3.org/TR/void/

  25. Liang, A., Sini, M.: Mapping AGROVOC and the Chinese agricultural thesaurus: definitions, tools, procedures. New Rev. Hypermedia Multimed. 12(1), 51–62 (2006)

    Article  Google Scholar 

  26. References to tools and papers about link generation techniques. http://esw.w3.org/TaskForces/CommunityProjects/LinkingOpenData/EquivalenceMining

  27. OpenRefine data cleansing and transformation tool. http://openrefine.org/

  28. LODRefine (an extension of OpenRefine). https://github.com/sparkica/LODRefine

  29. SAIM instance matching application. http://saim.aksw.org/

  30. LIMES link discovery framework for metric spaces. http://aksw.org/Projects/LIMES.html

  31. Silk link discovery framework. http://wifo5-03.informatik.uni-mannheim.de/bizer/silk/

  32. ARIADNE, D12.2 infrastructure design—annex II—ACDM catalogue model, pp. 47–56 (2015). http://www.ariadne-infrastructure.eu/Resources/D12.2-Infrastructure-Design

  33. Data Catalog Vocabulary (DCAT) http://www.w3.org/TR/vocab/dcat/

  34. Getty vocabularies as linked open data. http://www.getty.edu/research/tools/vocabularies/lod/

  35. SKOS mapping relationships. http://www.w3.org/TR/skos-reference/L4138

Download references

Acknowledgments

The SENESCHAL project was supported by the UK Arts and Humanities Research Council [grant number AH/K002112/1]. The ARIADNE project is funded by the European Commission’s 7th Framework Programme (FP7-INFRASTRUCTURES-2012-1-313193). An early version of this work was presented at the 13th European Networked Knowledge Organization Systems (NKOS) Workshop in association with the Digital Libraries 2014 conference. Thanks are due to the Archaeology Data Service for their work with the vocabulary mapping tool reported in this paper. Thanks are also due to ARIADNE and SENESCHAL project partners and the participants of the SENESCHAL workshops.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ceri Binding.

Appendices

Appendix 1

Extracts of the concept mappings used for the exploratory case study described in Sect. 4 (expressed in TURTLE RDF format):

figure a
figure b

Appendix 2

Example concept mappings produced by ADS in the vocabulary matching exercise as described in Sect. 5 (Vocabulary Matching Tool output in JSON format):

figure c

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Binding, C., Tudhope, D. Improving interoperability using vocabulary linked data. Int J Digit Libr 17, 5–21 (2016). https://doi.org/10.1007/s00799-015-0166-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-015-0166-y

Keywords

Navigation