Abstract
The concept of Linked Data has been an emerging theme within the computing and digital heritage areas in recent years. The growth and scale of Linked Data has underlined the need for greater commonality in concept referencing, to avoid local redefinition and duplication of reference resources. Achieving domain-wide agreement on common vocabularies would be an unreasonable expectation; however, datasets often already have local vocabulary resources defined, and so the prospects for large-scale interoperability can be substantially improved by creating alignment links from these local vocabularies out to common external reference resources. The ARIADNE project is undertaking large-scale integration of archaeology dataset metadata records, to create a cross-searchable research repository resource. Key to enabling this cross search will be the ‘subject’ metadata originating from multiple data providers, containing terms from multiple multilingual controlled vocabularies. This paper discusses various aspects of vocabulary mapping. Experience from the previous SENESCHAL project in the publication of controlled vocabularies as Linked Open Data is discussed, emphasizing the importance of unique URI identifiers for vocabulary concepts. There is a need to align legacy indexing data to the uniquely defined concepts and examples are discussed of SENESCHAL data alignment work. A case study for the ARIADNE project presents work on mapping between vocabularies, based on the Getty Art and Architecture Thesaurus as a central hub and employing an interactive vocabulary mapping tool developed for the project, which generates SKOS mapping relationships in JSON and other formats. The potential use of such vocabulary mappings to assist cross search over archaeological datasets from different countries is illustrated in a pilot experiment. The results demonstrate the enhanced opportunities for interoperability and cross searching that the approach offers.
Similar content being viewed by others
Notes
Semantic Enrichment Enabling Sustainability of Archaeological Links (SENESCHAL) [4] was funded by the UK Arts and Humanities Research Council and coordinated by the Hypermedia Research Group at the University of South Wales (formerly University of Glamorgan). Advanced Research Infrastructure for Archaeological Dataset Networking in Europe (ARIADNE) [5] is an ongoing EC FP7 Project.
Following completion of the project operational management of HeritageData.org and governance of the vocabularies was transferred to EH, RCAHMS, RCAHMW collectively, under the rubric of the FISH Terminology Working Group [19].
It should be noted that the criteria of the “5 star” scheme as described in Fig. 1 do not measure data quality OR quantity, the scheme only grades LOD in terms of data formats used, licensing conditions and the presence of (an unspecified quantity of) external links.
References
EUROPEANA project. http://www.europeana.eu/
Golub, K., Tudhope, D., Zeng, M., Žumer, M.: Terminology registries for knowledge organization systems: functionality, use, and attributes. J. Assoc. Inf. Sci. Technol. 65(9), 1901–1916 (2014)
Zeng, M., Chan, L.: Trends and issues in establishing interoperability among knowledge organization systems. J. Am. Soc. Inf. Sci. Technol. 55(5), 377–395 (2004)
SENESCHAL project: semantic enrichment enabling sustainability of archaeological links. University of South Wales, Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/seneschal/
ARIADNE FP7 project: advanced research infrastructure for archaeological dataset networking in Europe. http://www.ariadne-infrastructure.eu/
STAR project: semantic technologies for archaeological resources. University of South Wales: Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/star/
STELLAR project: semantic technologies enhancing links and linked data for archaeological resources. University of South Wales: Hypermedia Research Group. http://hypermedia.research.southwales.ac.uk/kos/STELLAR/
ISO standard 21127:2014—the CIDOC conceptual reference model (CRM). http://www.cidoc-crm.org/
Tudhope, D., May, K., Binding, C., Vlachidis, A.: Connecting archaeological data and grey literature via semantic cross search. Internet archaeology 30 (2011). doi:10.11141/ia.30.5
Binding, C., Tudhope, D.: Terminology web services. Knowl. Organ. 37(4), 287–298 (2010)
Resource description framework (RDF). W3C. http://www.w3.org/RDF/
Archaeology Data Service: Linked Data, http://data.archaeologydataservice.ac.uk/
Binding, C., Charno, M., Jeffrey, S., May, K., Tudhope, D.: Template based semantic integration: from legacy archaeological datasets to linked data. Int. J. Semant. Web Inf. Syst. 11(1) (2015) (in press)
Richards, J.D., Hardman, C.S.: Stepping back from the trench edge: an archaeological perspective on the development of standards for recording and publication. In: Greengrass, M. & Hughes, L. (eds.) The Virtual Representation of the Past. Ashgate, pp. 101–112 (2008). http://eprints.whiterose.ac.uk/7795/
Bizer, C., Heath, T., Berners-Lee, T.: Linked data—the story so far. Int. J. Semant. Web Inf. Syst. 5(3), 1–22 (2009). doi:10.4018/jswis.2009081901
HeritageData.org—documentation of web services and widget controls produced as part of the SENESCHAL project. http://www.heritagedata.org/blog/
Simple Knowledge Organization System (SKOS). W3C. http://www.w3.org/2004/02/skos/
SPARQL 1.1 query language. W3C (2013). [http://www.w3.org/TR/sparql11-query/]
Forum on Information Standards in Heritage (FISH) Terminology Working Group. http://www.heritagedata.org/blog/about-heritage-data/fish/
Berners-Lee, T.: Linked data—design issues (5 star linked data deployment scheme). http://www.w3.org/DesignIssues/LinkedData.html
ISO 25964-2:2013 Information and documentation—thesauri and interoperability with other vocabularies—part 2: interoperability with other vocabularies (2013). http://www.iso.org/iso/home/store/catalogue_tc/catalogue_detail.htm?csnumber=53658
Ontology alignment evaluation initiative. http://oaei.ontologymatching.org
Cuenca Grau, B., Dragisic, Z., Eckert, K., Euzenat, J., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A.O., Lambrix, P., Nikolov, A., Paulheim, H., Ritze, D., Scharffe, F., Shvaiko, P., Trojahn, C., Zamazal, O.: Results of the ontology alignment evaluation initiative 2013, pp. 29–31 (2013). http://disi.unitn.it/~p2p/OM-2013/oaei13_paper0.pdf
Alexander, K., Cyganiak, R., Hausenblad, M., Zhao, J.: Describing linked datasets with the voID vocabulary. W3C interest group note (2011). http://www.w3.org/TR/void/
Liang, A., Sini, M.: Mapping AGROVOC and the Chinese agricultural thesaurus: definitions, tools, procedures. New Rev. Hypermedia Multimed. 12(1), 51–62 (2006)
References to tools and papers about link generation techniques. http://esw.w3.org/TaskForces/CommunityProjects/LinkingOpenData/EquivalenceMining
OpenRefine data cleansing and transformation tool. http://openrefine.org/
LODRefine (an extension of OpenRefine). https://github.com/sparkica/LODRefine
SAIM instance matching application. http://saim.aksw.org/
LIMES link discovery framework for metric spaces. http://aksw.org/Projects/LIMES.html
Silk link discovery framework. http://wifo5-03.informatik.uni-mannheim.de/bizer/silk/
ARIADNE, D12.2 infrastructure design—annex II—ACDM catalogue model, pp. 47–56 (2015). http://www.ariadne-infrastructure.eu/Resources/D12.2-Infrastructure-Design
Data Catalog Vocabulary (DCAT) http://www.w3.org/TR/vocab/dcat/
Getty vocabularies as linked open data. http://www.getty.edu/research/tools/vocabularies/lod/
SKOS mapping relationships. http://www.w3.org/TR/skos-reference/L4138
Acknowledgments
The SENESCHAL project was supported by the UK Arts and Humanities Research Council [grant number AH/K002112/1]. The ARIADNE project is funded by the European Commission’s 7th Framework Programme (FP7-INFRASTRUCTURES-2012-1-313193). An early version of this work was presented at the 13th European Networked Knowledge Organization Systems (NKOS) Workshop in association with the Digital Libraries 2014 conference. Thanks are due to the Archaeology Data Service for their work with the vocabulary mapping tool reported in this paper. Thanks are also due to ARIADNE and SENESCHAL project partners and the participants of the SENESCHAL workshops.
Author information
Authors and Affiliations
Corresponding author
Appendices
Rights and permissions
About this article
Cite this article
Binding, C., Tudhope, D. Improving interoperability using vocabulary linked data. Int J Digit Libr 17, 5–21 (2016). https://doi.org/10.1007/s00799-015-0166-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-015-0166-y