Elsevier

Future Generation Computer Systems

Volume 55, February 2016, Pages 129-146
Future Generation Computer Systems

Digital library interoperability at high level of abstraction

https://doi.org/10.1016/j.future.2015.09.020Get rights and content

Highlights

  • Foundational models for digital libraries.

  • Interoperability among digital library systems and possible approaches.

  • Unifying semantic model and ontology for high-level interoperability among digital library systems.

  • In depth analysis of the user, content, functionality and quality domains in digital libraries.

  • Concrete use case on annotation of illuminated manuscripts to illustrate how to apply the proposed methodology.

Abstract

Digital Library (DL) are the main conduits for accessing our cultural heritage and they have to address the requirements and needs of very diverse memory institutions, namely Libraries, Archives and Museums (LAM). Therefore, the interoperability among the Digital Library System (DLS) which manage the digital resources of these institutions is a key concern in the field.

DLS are rooted in two foundational models of what a digital library is and how it should work, namely the DELOS Reference Model and the Streams, Structures, Spaces, Scenarios, Societies (5S) model. Unfortunately these two models are not exploited enough to improve interoperability among systems.

To this end, we express these foundational models by means of ontologies which exploit the methods and technologies of Semantic Web and Linked Data. Moreover, we link the proposed ontologies for the foundational models to those currently used for publishing cultural heritage data in order to maximize interoperability.

We design an ontology which allows us to model and map the high level concepts of both the 5S model and the DELOS Reference Model. We provide detailed ontologies for all the domains of such models, namely the user, content, functionality, quality, policy and architectural component domains in order to make available a working tool for making DLS interoperate together at a high level of abstraction. Finally, we provide a concrete use case about digital annotation of illuminated manuscripts to show how to apply the proposed ontologies and illustrate the achieved interoperability between the 5S and DELOS Reference models.

Introduction

Digital Library (DL) have been steadily progressing since the early 1990s and they now determine how citizens and organizations study, learn, access and interact with their cultural heritage  [1], [2], [3], [4], [5], [6], [7], [8]. Despite their name, DL are not only the digital counter-part of traditional libraries but they are also concerned with other kinds of cultural heritage institutions, such as archives and museums, that is institutions typically referred to as Libraries, Archives and Museums (LAM). In the context of LAM, unifying a variety of organizational settings and providing more integrated access to their contents are aspects of utmost importance. Although the type of materials may differ and professional practices vary, LAM share an overlapping set of functions and fulfilling them in “collaboration rather than isolation creates a win–win for users and institutions”  [9].

These compelling integration and collaboration needs have propelled the evolution of Digital Library System (DLS)   [10] as systems that permit us to design and implement the overlapping set of functions of LAM.

In the 1990s, DLS were monolithic systems, each one built for a specific kind of information resource–e.g. text, images, or videos–and with very specialized functionalities developed ​ad hoc for those contents and their reference users. This approach caused a flourishing of systems where the very same functionalities, e.g. user management or repositories, were developed and re-developed from scratch many times, causing them to be different and often incompatible one with each other. From the mid 2000s, DLS evolved towards service-oriented architectures, where components can be plugged into each other to provide the desired end-user functionalities, yet requiring careful and ad hoc configuration. This paradigm shift allowed DLS to become more and more user-centered systems, where the original content management functionality was partnered with new communication and cooperation functionalities such as user annotation  [11], with the ultimate goal of acting as “a common vehicle by which everyone can access, discuss, evaluate and enhance information of all forms”  [12]. As a consequence, DLS started to embody the above vision for LAM since they were no longer isolated systems but, on the contrary, they needed to cooperate together in order to improve the user experience in accessing information and to seamlessly integrate information resources of different cultural heritage institutions.

This evolution has been favored by the development of two foundational models of what DL are, namely the Streams, Structures, Spaces, Scenarios, Societies (5S) model  [13] and the DELOS Reference Model   [14], which made it clear what kind of entities should be involved in a DL, what their functionalities should be and how DLS components should behave, and fostered the design and development of operational DLS complying with them.

However, these two models are quite abstract and, still providing a unifying vision of what a DL is, they allow for very different choices when it comes to develop actual DLS. This has led to the growth of “ecosystems” where services and components may be able, at best, to interoperate together within the boundaries of DLS that have been inspired by just one of the two models for DL. However, there are no running examples of two DLS, one implementing the 5S model and the other the DELOS Reference Model, which are able to interoperate. Therefore, interoperability still represents one of the biggest challenges in the DL field  [10], [15].

In this work, we address a still open issue in the DL realm: to make DL foundational models interoperable in order to derive all the other kinds of interoperability, in particular, interoperability between operational DLS. The main contributions of the paper are:

  • a detailed analysis of the 5S and DELOS Reference models pointing out common aspects and main differences;

  • the definition of a common ontology which encompasses and links the concepts of the DELOS Reference Model and the 5S models, covering all the domains of such models: the user, content, functionality, quality, policy and architectural component;

  • a concrete use case about digital annotation of illuminated manuscripts to show how to apply the proposed ontologies and illustrate the achieved interoperability between the 5S and DELOS Reference models.

The paper is organized as follows: Section  2 illustrates the rationale of the paper; Section  3 reports on some pertinent related works; Section  4 introduces the relevant aspects of the 5S Model and of the DELOS Reference Model; Section  5 presents the semantic mapping between the 5S Model and the DELOS Reference Model associating the high level concepts of one model to those of the other; Section  6 to 11 respectively show the correspondences between the notions and domains of user, content, functionality, quality, policy and architectural component in the two models; Section  12 presents a relevant case of application of the proposed approach where users who interact with two DLS, that manage illuminated manuscripts, are interested in annotating their contents to perform activities of their interest. Section  13 sums up the results presented in the paper.

Section snippets

Rationale

The current mainstream approach to bridge the interoperability gap between DLS and to provide comprehensive solutions able to embrace the full spectrum of LAM is to exploit semantic Web technologies and linked (open) data  [16], [17]. This allows for describing entities and information resources in a common way which enables their exchange, as for example happens in the case of library linked data  [18].

This approach is both “external” and “bottom-up”. It is “external” since it assumes that

Related work

As anticipated in the previous sections, most of the work that has been conducted to address the problem of interoperability in DL is related to the representation and exchange of data managed by a DLS, typically via Linked Open Data (LOD)   [17], i.e. it is mostly concerned with the data logic, as represented in Fig. 1. This explains the need to develop many ontologies and metadata schemes for this purpose, and the number of ontologies and metadata schemes is continuously growing along with

Foundational models for digital libraries

The 5S  [8], [13], [32] is a formal model which draws upon the broad digital library literature to produce a comprehensive base of support. It was developed largely bottom up, starting with key definitions and elucidation of digital library concepts from a minimalist approach.

The DELOS Reference Model  [14] is a high-level conceptual framework that aims at capturing significant entities and their relationships within the digital library universe with the goal of developing a more robust model

Semantic mapping between the 5S model and the DELOS reference model

In Fig. 4 we present the Resource Description Framework (RDF) graph of the unifying data model relating the DELOS Reference Model to the 5S model by means of a mapping between their most relevant high-level concepts. The presented RDF graph is composed by classes represented as circles and properties represented as directed edges between the classes. In this section we report the main concepts composing the two models and their mapping one into the other, whereas in the following sections we

The user domain

Users represent all the entities external to a DL which interact with it and are of foremost importance for a DL.

The DELOS Reference Model has a dedicated domain for users which comprehends both “humans and inanimate entities, such as software programs or physical instruments”  [14]; in the 5S model, users are modeled within the “Societies” abstraction which includes “humans as well as hardware and software components, which either use or support digital library services”  [13].

The RDF mapping

The content domain

The content domain represents all the entities that a DL has to manage to satisfy users’ information needs. In Fig. 6 we can see the RDF graph representing the content domain of the DELOS Reference Model and of the 5S model. In DELOS the central class is Information Object which is a resource and represents a unit of information within a DL that includes text documents, images, videos, sound documents and data sets. This class is related to the external class cidoc: Information Object of the

The functionality domain

The functionality domain represents one of the richest domains in the DL universe and it captures all the processing that can occur on resources and activities in a DL. In Fig. 7 we report the RDF graph of the functionality domain for the DELOS Reference Model and the 5S model.

The DELOS Reference Model shapes this domain around the Function class defined as a particular act or operation that can be performed on a Resource (a FunctionactOn a Resource) by an Actor (an Actorperforms a Function).

The quality domain

Quality is a fundamental aspect in DL  [32], [36], [37], [40], [41], which is often related to and affected by the interoperability and integration among DL systems  [15], [42], [43].

The quality domain in the DELOS Reference Model takes into account the general definition of quality provided by the International Organization for Standardization (ISO) which defines quality as “the degree to which a set of inherent characteristics fulfills requirements”  [44], where requirements are needs or

The policy domain

The policy domain represents a set of conditions, rules or regulations governing a DL and it is a domain “very broad and dynamic by nature”  [14]. The domain is modeled by DELOS model, whereas it is not explicitly handled by the 5S. Thus, for this domain it is not possible to provide a straightforward mapping between the two models, but we can use the common ontology we defined to enrich the 5S with some concepts of DELOS and at the same time use the policy domain as a bridge to further connect

The architectural component domain

The architectural component domain regards concepts and relationships characterizing a DLS and a DLMS in the DELOS reference model. In general, this domain aims at modeling the components of the software systems employed in a DL universe and their interactions. In the DELOS reference model an architectural component is connected to a functionality meaning that a specific function, e.g. searching, is realized by a specific architectural component, i.e. search engine, in a given DLS. On the other

Use case

Let us consider the case of two different DLS managing cultural heritage digital resources and providing annotation services to their users.

The first system, called DLS1, is the DelosDLMS  [49], [50], which implements the DELOS Reference Model, offers annotation functionalities through the Flexible Annotation Semantic Tool (FAST) service and exploits them to improve search and retrieval of annotated resources  [51], [52], [53]. The second system, DLS2, is an implementation of the 5S model.

Conclusions

In this paper we address, for the first time, the need for interoperability among DLS at a high level of abstraction and we show how this is achieved by a semantically-enabled representation of foundational DL models. The ultimate goal is to promote and facilitate a better convergence and integration in the context of Libraries, Archives and Museums (LAM) by lowering the barriers between them.

We show: (i) how the two state-of-the-art foundational models, namely the DELOS Reference Model and the

Maristella Agosti is a Full Professor in computer science, with a main focus on databases, digital libraries and information retrieval, at the Department of Information Engineering of the University of Padua, Italy. She is one of the founding members and the coordinator of the Information Management Systems (IMS) research group of the department. She coordinates and has coordinated a number of national and international research projects, and she has been the organizer of national and

References (56)

  • M.A. Gonçalves et al.

    What is a good digital library? A quality model for digital libraries

    Inf. Process. Manage.

    (2007)
  • B.L. Moreira et al.

    Automatic evaluation of digital libraries with 5SQual

    J. Informetr.

    (2009)
  • E.A. Fox et al.

    Users, user interfaces, and objects: Envision, a digital library

    J. Am. Soc. Inf. Sci. (JASIS)

    (1993)
  • E.A. Fox et al.

    Digital Libraries

    Commun. ACM (CACM)

    (1995)
  • G. Marchionini et al.

    The roles of digital libraries in teaching and learning

    Commun. ACM (CACM)

    (1995)
  • M. Lesk

    Practical Digital Libraries. Books, Bytes & Bucks

    (1997)
  • C.L. Borgman

    What are digital libraries? Competing visions

    Inf. Process. Manage.

    (1999)
  • C.L. Borgman
  • I.H. Witten et al.

    How to Build a Digital Library

    (2003)
  • E.A. Fox et al.

    Theoretical Foundations for Digital Libraries: The 5S (Societies, Scenarios, Spaces, Structures, Streams) Approach

    (2012)
  • D.M. Zorich, G. Waibel, R. Erway, Beyond the Silos of the LAMs. Collaboration Among Libraries, Archives and Museums,...
  • M. Agosti

    Digital libraries

  • M. Agosti et al.

    A formal model of annotations of digital content

    ACM Trans. Inf. Syst. (TOIS)

    (2008)
  • Y. Ioannidis et al.

    Digital library information-technology infrastructures

    Int. J. Digit. Libr.

    (2005)
  • M.A. Gonçalves et al.

    Streams, structures, spaces, scenarios, societies (5S): A formal model for digital libraries

    ACM Trans. Inf. Syst. (TOIS)

    (2004)
  • L. Candela, D. Castelli, N. Ferro, Y. Ioannidis, G. Koutrika, C. Meghini, P. Pagano, S. Ross, D. Soergel, M. Agosti, M....
  • N. Ferro

    Quality and interoperability: The quest for the optimal balance

  • T. Heath et al.

    Linked Data: Evolving the Web into a Global Data Space

    (2011)
  • E. Hyvönen

    Publishing and Using Cultural Heritage Linked Data on the Semantic Web

    (2012)
  • T. Baker, E. Bermès, K. Coyle, G. Dunsire, A. Isaac, P. Murray, M. Panzer, J. Schneider, R. Singer, E. Summers, W....
  • ISO 15836:2009, Information and documentation—the dublin core metadata element set, Recommendation ISO 15836:2009,...
  • Society of American Archivists, Encoded archival description: Tag library, ver. 2002, Society of American Archivists,...
  • E. Coburn, R. Light, G. McKenna, R. Stein, A. Vitzthum, LIDO—lightweight information describing objects—version 1.0,...
  • C. Lagoze, H. Van De Sompel, P. Johnston, M. Nelson, R. Sanderson, S. Warner, ORE specification—abstract data...
  • ISO 21127, Information and documentation—a reference ontology for the interchange of cultural heritage information,...
  • M. Doerr, S. Gradmann, S. Hennicke, A. Isaac, C. Meghini, H. Van de Sompel, The Europeana data model (EDM), in: IFLA...
  • Europeana, Europeana data model primer, October 2011....
  • Europeana, Definition of the Europeana data model elements—version 5.2.3, 24/02/2012, February 2012....
  • Cited by (21)

    • Information Resource, Interface, and Tasks as User Interaction Components for Digital Library Evaluation

      2019, Information Processing and Management
      Citation Excerpt :

      The quality model is helpful to develop quantitative assessment program for digital libraries; for example, they designed a 5SQual toolkit that can perform automatically quantitative evaluation of DLs (Shen, Gonçalves, & Fox, 2013). In addition, Agosti, Ferro, and Silvello (2016) also designed an ontology that models and maps the high level concepts of the 5S model and the DELOS Reference Model. These theoretical models of DLs describe the main components of DLs and present associated measures for DL evaluation regarding each component.

    • Establishing Core Concepts for Information-Powered Collaborations

      2018, Future Generation Computer Systems
      Citation Excerpt :

      Recent developments demonstrated the feasibility of aggregating cross-cutting resources to offer VREs as a Service in order to maximise the adoption and productivity in multidisciplinary contexts [35]. Similarly, Virtual Laboratories (VLs), Science Gateways (SGs), Virtual Organisations (VOs) and Digital Libraries (DLs) provide the necessary tools and interoperability to enable interactions and foster seamless access, usage and sharing of resources across diverse stakeholders [36,37]. There is a substantial interest in the scientific community in VREs (VLs, SGs, VOs and DLs) that yields a flourishing scientific literature and many initiatives and research projects.

    • Semantic interoperability of cultural heritage: a systematic review

      2023, Revista Ibero-Americana de Ciencia da Informacao
    • Quality Assessment of Library Linked Data: a Case Study

      2021, Communications in Computer and Information Science
    View all citing articles on Scopus

    Maristella Agosti is a Full Professor in computer science, with a main focus on databases, digital libraries and information retrieval, at the Department of Information Engineering of the University of Padua, Italy. She is one of the founding members and the coordinator of the Information Management Systems (IMS) research group of the department. She coordinates and has coordinated a number of national and international research projects, and she has been the organizer of national and international conferences. From 2011: Leader of the unit of the Department in the European project on CULTivating Understanding and Research through Adaptivity—CULTURA. From 2010: active research member of the European network of excellence PROMISE—Participative Research labOratory for Multimedia and Multilingual Information Systems Evaluation. Chair from 2009 to 2012 of the Steering Committee of the International Conference on Theory and Practice of Digital Libraries (TPDL). Member of the Editorial Board of the International Journal on Digital Libraries (Springer-Verlag). Member of the Board of the Italian association for digital humanities and digital culture. Her actual interests are evaluation of digital libraries and archives, digital library systems, digital humanities, evaluation infrastructures, retrieval of information in cultural heritage, information access through search engines and digital libraries, user’s interaction with digital cultural heritage collections.

    Nicola Ferro is an Associate Professor at the Department of Information Engineering of the University of Padua. His research interests include information retrieval, its experimental evaluation, multilingual information access and digital libraries. He is the coordinator of the CLEF evaluation initiative, which involves more than 200 research groups world-wide in large-scale IR evaluation activities. He has published more than 200 papers on information retrieval and digital libraries. More about his research can be found at http://www.dei.unipd.it/~ferro/.

    Gianmaria Silvello is a Post-doctoral Researcher at the University of Padua. He was awarded a master’s degree in Computer Engineering, University of Padua in 2006, a post-graduate master’s degree on “Design, Management and Conservation of Public and Private Digital Archives” in 2007 and a Ph.D. in Information Engineering from the University of Padua in 2011. He has published more than 60 scientific papers on IR evaluation; digital library; data models; LOD publishing and citation. More about his research can be found at http://www.dei.unipd.it/~silvello/.

    View full text