Skip to main content

Investigating Schema Definitions Using RDFS and OWL 2 for RDF Databases in Life Sciences

  • Conference paper
  • First Online:
Book cover Semantic Technology (JIST 2019)

Abstract

With the development of measuring instruments, life science databases are becoming larger and more heterogeneous. As a step towards providing integrated databases, many life science databases have been published as Linked Open Data (LOD). To utilize such databases efficiently, it is desirable that the schema, such as class–class relations, can be acquired in advance from SPARQL Protocol and RDF Query Language (SPARQL) endpoints. However, a SPARQL query to obtain the schema from a SPARQL endpoint often fails because it is time consuming and places an excessive load on the server. On the other hand, many datasets include the definitions using standard vocabularies, such as RDF Schema 1.1 and OWL 2. If the database schema is properly described and provided using RDF Schema 1.1 or OWL 2, it is no longer necessary to obtain it by exhaustively crawling the SPARQL endpoints. Therefore, we investigated the extent of the schema definitions in life sciences databases, focusing on seven specific patterns related to properties using RDF Schema 1.1 or OWL 2. We found that for some datasets, the patterns of domain and range definitions using RDF Schema 1.1, are relatively well defined for properties. However, there are few patterns using OWL 2 as schema definitions for properties. Additionally, we validated RDF datasets by restricting the patterns of domain and range definitions of RDF Schema 1.1. Subsequently, we found that RDF datasets follow these restrictions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Yamaguchi, A., Toh, H.: Implementing LOD surfer as a search system for the annotation of multiple protein sequence alignment. In: Ichise, R., Lecue, F., Kawamura, T., Zhao, D., Muggleton, S., Kozaki, K. (eds.) JIST 2018. LNCS, vol. 11341, pp. 418–426. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-04284-4_29

    Chapter  Google Scholar 

  2. RDF Schema 1.1. https://www.w3.org/TR/rdf-schema/

  3. OWL 2 Web Ontology Language Document Overview. https://www.w3.org/TR/owl2-overview/

  4. Yamamoto, Y., Yamaguchi, A., Splendiani, A.: YummyData: providing high-quality open life science data. Database 2018(1), bay022 (2018)

    Google Scholar 

  5. Polikoff, I.: From OWL to SHACL in an automated way. https://www.topquadrant.com/2018/05/01/from-owl-to-shacl-in-an-automated-way/

  6. Life Science Database Archive. https://dbarchive.biosciencedbc.jp/

  7. eagle-i. https://www.eagle-i.net/

Download references

Acknowledgments

This work was supported by JSPS KAKENHI grant numbers 17K00434 and by the National Bioscience Database Center of the Japan Science and Technology Agency. We also acknowledge the participants of the NBDC/DBCLS BioHackathon 2019, who participated in a discussion on a validation method.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Atsuko Yamaguchi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yamaguchi, A., Kushida, T., Yamamoto, Y., Kozaki, K. (2020). Investigating Schema Definitions Using RDFS and OWL 2 for RDF Databases in Life Sciences. In: Wang, X., Lisi, F., Xiao, G., Botoeva, E. (eds) Semantic Technology. JIST 2019. Communications in Computer and Information Science, vol 1157. Springer, Singapore. https://doi.org/10.1007/978-981-15-3412-6_14

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-3412-6_14

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-3411-9

  • Online ISBN: 978-981-15-3412-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics