Abstract
Research has become more data-intensive over the last few decades. Sharing research data is often a challenge, especially for interdisciplinary collaborative projects. One primary goal of a research infrastructure for data management should be to enable efficient data discovery and integration of heterogeneous data. In order to enable such interoperability, a lot of effort has been undertaken by scientists to develop standards and characterize their domain knowledge in the form of taxonomies and formal ontologies. However, these knowledge models are often disconnected and distributed. The work presented here provides a promising approach for integrating and harmonizing terminological resources to serve as a backbone for a platform. The component developed, called the GFBio Terminology Service, acts as a semantic platform for access, development and reasoning over internally and externally maintained terminological resources within the biological and environmental domain. We highlight the utility of the Terminology Service by practical use cases of semantically enhanced components. We show how the Terminology Service enables applications to add meaning to their data by giving access to the knowledge that can be derived from the terminologies and data annotated by them.



Similar content being viewed by others
Notes
The European Bioinformatics Institute (www.ebi.ac.uk)
Data Publisher for Earth & Environmental Science (www.pangaea.de)
Natural History Museum (www.naturkundemuseum.berlin)
German Collection of Microorganisms and Cell Cultures (www.dsmz.de)
The Bavarian Natural History Collections (www.snsb.mwn.de)
The complete list of involved archives and data centers is available on the GFBio website.
The NCBI taxonomy is a curated classification and nomenclature for all of the organisms in the public sequence databases
Web Ontology Language, www.w3.org/OWL
JavaScript Object Notation (www.w3.org/TR/html-json-forms/)
Extensible Markup Language (www.w3.org/XML/)
Comma Separated Values (tools.ietf.org/html/rfc4180)
JSON for Linking Data (www.w3.org/TR/json-ld/)
References
WoRMS Editorial Board (2016) World Register of Marine Specie. http://www.marinespecies.org. Accessed 2016-04-25
Adamusiak T, Burdett T, Kurbatova N, Joeri van der Velde K, Abeygunawardena N, Antonakaki D, Kapushesky M, Parkinson H, Swertz MA (2011) Ontocat – Simple Ontology Search and Integration in Java, R and Rest/Javascript. BMC Bioinformatics 12(1):1–12
Atkins D, Droegemeier K, Feldman S, Garcia-Molina H, Klein M, Messerschmitt D, Messina P, Ostriker J, Wright M (2003) Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the Blue-Ribbon Advisory Panel on Cyberinfrastructure. National Science Foundation, Washington, DC
Authmann C, Beilschmidt C, Drönner J, Mattig M, Seeger B (2015) VAT: A System for Visualizing, Analyzing and Transforming Spatial Data in Science. Datenbank-Spektrum 15(3):175–184
Baader F, Lutz C, Suntisrivaraporn B (2006) CEL – A Polynomial-Time Reasoner for Life Science Ontologies. Springer, Berlin, Heidelberg
Berendsohn W, Döring M, Geoffroy M, Glück K, Güntsch A, Hahn A, Kusber WH, Li J, Röpert D, Specht F (2003) The berlin model: a concept-based taxonomic information model. In: MoReTax – Handling Factual Information Linked to Taxonomic Concepts in Biology. BfN, Schriftenreihe Vegetationskunde, vol 39
Ciardelli P, Kelbert P, Kohlbecker A, Hoffmann N, Güntsch A, Berendsohn WG (2009) The EDIT Cyberplatform for Taxonomy and the Taxonomic Workflow: Selected Components. In 39. Jahrestagung der Gesellschaft für Informatik e.V. (GI). GI, Lübeck, Germany, pp 625–638
Deegan (née Clark) JI, Dimmer EC, Mungall CJ (2010) Formalization of Taxon-Based Constraints to Detect Inconsistencies in Annotation and Ontology Development. BMC Bioinformatics 11(1):1–10
Côté RG, Jones P, Apweiler R, Hermjakob H (2006) The Ontology Lookup Service, a Lightweight Cross-Platform Tool for Controlled Vocabulary Queries. BMC Bioinformatics 7(1):1–7
Diepenbroek M, Glöckner FO, Grobe P, Güntsch A, Huber R, König-Ries B, Kostadinov I, Nieschulze J, Seeger B, Tolksdorf R, Triebel D (2014) Towards an Integrated Biodiversity and Ecological Research Data Management and Archiving Platform: The German Federation for the Curation of Biological Data (gfbio). 44. Jahrestagung der Gesellschaft für Informatik. GI, Stuttgart, Germany
Euzenat J, Shvaiko P (2013) Ontology Matching, 2nd edn. Springer-Verlag, Heidelberg (DE)
Federhen S (2012) The NCBI Taxonomy Database. Nucleic Acids Res 40(Database issue):D136–D143
Franz N (2011) Biological Taxonomy and Ontology Development: Scope and Limitations. Biodivers Informatics. doi:10.17161/bi.v7i1.3927
Gerlach R, Blaa D, Chamanara J, Hohmuth M, Navabpour N, Thiel S, König-Ries B (2015) Bexis 2 – A Platform for Managing Heterogeneous Biodiversity Data and Projects. TDWG Annual Conference
Hevner AR, March ST, Park J, Ram S (2004) Design Science in Information Systems Research. Mis Q 28(1):75–105
Hey T, Tansley S, Tolle KM et al (2009) The Fourth Paradigm: Data-Intensive Scientific Discovery vol. 1. Microsoft research, Redmond, WA
Hoehndorf R, Dumontier M, Oellrich A, Rebholz-Schuhmann D, Schofield PN, Gkoutos GV (2011) Interoperability Between Biomedical Ontologies Through Relation Expansion, Upper-Level Ontologies and Automatic Reasoning. PLOS ONE 6(7):1–9
Hoehndorf R, Slater L, Schofield PN, Gkoutos GV (2015) Aber-Owl: A Framework for Ontology-Based Data Access in Biology. BMC Bioinformatics 16(1):1–9
Holetschek J, Dröge G, Güntsch A, Berendsohn WG (2012) The ABCD of Primary Biodiversity Data Access. Plant Biosyst 146(4):771–779
Isaac A, Haslhofer B (2013) Europeana Linked Open Data – data.europeana.eu. Semant Web 4(3):291–297
de Jong Y, Kouwenberg J, Boumans L et al (2015) Pesi – A Taxonomic Backbone for Europe. Biodivers Data J 3:e5848
Köhler S, Bauer S, Mungall CJ, Carletti G, Smith CL, Schofield P, Gkoutos GV, Robinson PN (2011) Improving Ontologies by Automatic Reasoning and Evaluation of Logical Definitions. BMC Bioinformatics 12(1):1–8
Kuśnierczyk W (2008) Taxonomy-Based Partitioning of the Gene Ontology. J Biomed Inform 41(2):282–292
Leibniz Institute DSMZ (2016) Prokaryotic Nomenclature Up-To-Date. http://www.dsmz.de/bacterial-diversity/prokaryotic-nomenclature-up-to-date
Löffler F, Sateli B, Witte R, König-Ries B (2014) Towards semantic recommendation of biodiversity datasets based on linked open data. In: Proceedings of the 26th GI-Workshop Grundlagen von Datenbanken, vol. 1313. Bozen-Bolzano, Italy, pp 65–70
Meyer ET, Schroeder R (2015) Knowledge Machines: Digital Transformations of the Sciences and Humanities. MIT Press, Cambridge, MA
Noy NF, Shah NH, Whetzel PL, Dai B, Dorf M, Griffith N, Jonquet C, Rubin DL, Storey MD, Chute CG, Musen MA (2009) Bioportal: Ontologies and Integrated Data Resources at the Click of a Mouse. Nucleic Acids Res 37(Web-Server-Issue):170–173
Roskov Y, Abucay L, Orrell T, Nicolson D, Flann C, Bailly N, Kirk P, Bourgoin T, DeWalt R, Decock W, De~Wever A (eds) (2016) Species 2000 & ITIS Catalogue of Life, 25th March 2016. www.catalogueoflife.org/col (Species 2000: Naturalis, Leiden, the Netherlands. ISSN 2405-8858)
Schulz S, Stenzhorn H, Boeker M (2008) The ontology of biological taxa. In: Proceedings 16th International Conference on Intelligent Systems for Molecular Biology (ISMB) Toronto, Canada, July 19–23, 2008. pp 313–321
Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S (2007) The Obo Foundry: Coordinated Evolution of Ontologies to Support Biomedical Data Integration. Nat Biotechnol 25(11):1251–1255
Suominen O, Pessala S, Tuominen J, Lappalainen M, Nykyri S, Ylikotila H, Frosterus M, Hyvönen E (2014) Deploying national ontology services: From onki to finto. In: Proceedings of the Industry Track at the International Semantic Web Conference 2014. CEUR Workshop Proceedings
Thau D, Ludäscher B (2007) Reasoning About Taxonomies in First-Order Logic. Ecol Inform 2(3):195–209 (Meta-information systems and ontologies. A Special Feature from the 5th International Conference on Ecological Informatics ISEI5, Santa Barbara, CA, Dec. 4–7, 2006 Novel Concepts of Ecological Data Management S.I.)
Triebel D, Hagedorn G, Jablonski S, Rambold G (eds) (1999) Diversity Workbench – A virtual research environment for building and accessing biodiversity and environmental data. http://www.diversityworkbench.net
Tuominen J, Laurenne N, Hyvönen E (2011) Biological names and taxonomies on the semantic web – managing the change in scientific conception. In: The Semanic Web: Research and Applications – 8th Extended Semantic Web Conference, ESWC 2011, Heraklion, Crete, Greece, Proceedings, Part II. pp 255–269
Viljanen K, Tuominen J, Hyvönen E (2009) Ontology libraries for production use: The finnish ontology library service onki. In: Proceedings of the 6th European Semantic Web Conference
Viljanen K, Tuominen J, Mäkelä E, Hyvönen E (2012) Normalized access to ontology repositories. In: Proceedings of the Sixth International Conference on Semantic Computing (IEEE ICSC 2012). IEEE Press, Washington, DC
Xiang Z, Mungall C, Ruttenberg A, He Y (2011) Ontobee: A linked data server and browser for ontology terms. In: Proceedings of the 2nd International Conference on Biomedical Ontology Buffalo, NY, USA, July 26–30, 2011
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Karam, N., Müller-Birn, C., Gleisberg, M. et al. A Terminology Service Supporting Semantic Annotation, Integration, Discovery and Analysis of Interdisciplinary Research Data. Datenbank Spektrum 16, 195–205 (2016). https://doi.org/10.1007/s13222-016-0231-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13222-016-0231-8