Original Research
The BMS-LM ontology for biomedical data reporting throughout the lifecycle of a research study: From data model to ontology

https://doi.org/10.1016/j.jbi.2022.104007Get rights and content
Under an Elsevier user license
open archive

Highlights

  • Current status of data management and reporting in the biomedical research field.

  • Key concepts for structuring research data management.

  • A brief state-of-the-art in semantic interoperability for managing heterogeneous Knowledge Organization Systems (KOS)

  • Reusable methods for building an interoperable ontology applicable in different specific context.

  • A case study for the migration from a data model to an ontology.

  • The BMS-LM ontology and examples of its use for preclinical research.

Abstract

Biomedical research data reuse and sharing is essential for fostering research progress. To this aim, data producers need to master data management and reporting through standard and rich metadata, as encouraged by open data initiatives such as the FAIR (Findable, Accessible, Interoperable, Reusable) guidelines. This helps data re-users to understand and reuse the shared data with confidence. Therefore, dedicated frameworks are required. The provenance reporting throughout a biomedical study lifecycle has been proposed as a way to increase confidence in data while reusing it. The Biomedical Study - Lifecycle Management (BMS-LM) data model has implemented provenance and lifecycle traceability for several multimodal-imaging techniques but this is not enough for data understanding while reusing it. Actually, in the large scope of biomedical research, a multitude of metadata sources, also called Knowledge Organization Systems (KOSs), are available for data annotation. In addition, data producers uses local terminologies or KOSs, containing vernacular terms for data reporting. The result is a set of heterogeneous KOSs (local and published) with different formats and levels of granularity. To manage the inherent heterogeneity, semantic interoperability is encouraged by the Research Data Management (RDM) community. Ontologies, and more specifically top ontologies such as BFO and DOLCE, make explicit the metadata semantics and enhance semantic interoperability. Based on the BMS-LM data model and the BFO top ontology, the BioMedical Study - Lifecycle Management (BMS-LM) core ontology is proposed together with an associated framework for semantic interoperability between heterogeneous KOSs. It is made of four ontological levels: top/core/domain/local and aims to build bridges between local and published KOSs. In this paper, the conversion of the BMS-LM data model to a core ontology is detailed. The implementation of its semantic interoperability in a specific domain context is explained and illustrated with examples from small animal preclinical research.

Keywords

Provenance
Local terminologies
Data sharing
Research Data Management
Data annotation
Heterogeneous data

Abbreviations

ADQIV
Annotation property, Data property, Quality, data Item, and Value specification
BMS-LM
BioMedical Study – Lifecycle Management
DICOM
Digital Imaging and Communications in Medicine
DMP
Data Management Plan
FAIR
Findable Accessible Interoperable Reusable
KOS
Knowledge Organization System
NCBO Bioportal
National Center for Biomedical Ontology Bioportal (a repository for ontologies)
OBO
Open Biological and Biomedical Ontology
OWL
Ontology Web Language (W3C standard)
PROV-DM
PROVenance Data Model
PET-CT
Positron Emission Tomography – Computed Tomography
RDF
Resource Description Framework
RDM
Research Data Management
SKOS
Simple Knowledge Organization System (W3C standard)
UML
Unified Modeling Language Ontologies
BFO
Basic Formal Ontology
DCM
DICOM Controlled Terminology
DOLCE
Descriptive Ontology for Linguistic and Cognitive Engineering
FBbi
Biological Imaging Methods Ontology
IAO
Information Artifact Ontology
IOBC
Interlinking Ontology for Biological Concepts
MESH
Medical Subject Heading
MS
Mass Spectrometry Ontology
NCIT
National Cancer Institute Thesaurus
OBI
Ontology of Biomedical Investigations
PATIT
PAT placental Investigative Technique
PROV-O
PROVenance Ontology
QIBO
Quantitative Imaging Biomarker Ontology
RO
Relation Ontology

Cited by (0)

1

Co-last authors.