Skip to main content

Semantic Web Oriented Approaches for Smaller Communities in Publishing Findable Datasets

  • Conference paper
  • First Online:
Book cover Metadata and Semantic Research (MTSR 2020)

Abstract

Publishing findable datasets is a crucial step in data interoperability and reusability. Initiatives like Google data search and semantic web standards like Data Catalog Vocabulary (DCAT) and Schema.org provide mechanisms to expose datasets on the web and make them findable. Apart from these standards, it is also essential to optionally explain the datasets, both its structure and applications. Metadata application profiles are a suitable mechanism to ensure interoperability and improve use-cases for datasets. Standards and attempts, including the Profiles (PROF) and VoID vocabularies, as well as frameworks like Dublin core application profiles (DCAP), provide a better understanding of developing and publishing metadata application profiles. The major challenge for domain experts, especially smaller communities intending to publish findable data on the web, is the complexities in understanding and conforming to such standards. Mostly, these features are provided by complex data repository systems, which is not always a sustainable choice for various small groups and communities looking for self-publishing their datasets. This paper attempts to utilize these standards in self-publishing findable datasets through customizing minimal static web publishing tools and demonstrating the possibilities to encourage smaller communities to adopt cost-effective and simple dataset publishing. The authors express this idea though this work-in-progress paper with the notion that such simple tools will help small communities to publish findable datasets and thus, gain more reach and acceptance for their data. From the perspective of the semantic web, such tools will improve the number of linkable datasets as well as promote the fundamental concepts of the decentralized web.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://schema.org/.

  2. 2.

    https://www.dublincore.org/specifications/dublin-core/dcmi-terms/.

  3. 3.

    https://datasetsearch.research.google.com/.

  4. 4.

    https://dev.socrata.com/.

  5. 5.

    https://developers.google.com/search/docs/data-types/dataset.

  6. 6.

    https://schema.org/Dataset.

  7. 7.

    https://gohugo.io/.

  8. 8.

    https://gohugo.io/templates/.

  9. 9.

    https://golang.org/pkg/text/template/.

  10. 10.

    https://golang.org/pkg/html/template/.

  11. 11.

    https://gettypubs.github.io/quire.

  12. 12.

    https://getdkan.org/.

  13. 13.

    https://jkan.io/.

  14. 14.

    https://ckan.org/.

  15. 15.

    https://invenio-software.org/.

References

  1. Baca, M.: Introduction to metadata (2016). http://www.getty.edu/publications/intrometadata

  2. Browning, D., Perego, A., Albertoni, R., Cox, S., Beltran, A.G., Winstanley, P.: Data catalog vocabulary (DCAT) - version 2. W3C Recommendation, W3C (2020). https://www.w3.org/TR/2020/REC-vocab-dcat-2-20200204/

  3. Car, N.: The profiles vocabulary. W3C Note, W3C (2019). https://www.w3.org/TR/2019/NOTE-dx-prof-20191218/

  4. Cousijn, H., et al.: A data citation roadmap for scientific publishers. Sci. Data 5(1), 180259 (2018). https://doi.org/10.1038/sdata.2018.259

    Article  Google Scholar 

  5. Crosas, M.: Joint declaration of data citation principles - FINAL (2013). https://www.force11.org/datacitationprinciples

  6. Heery, R., Patel, M.: Application profiles: mixing and matching metadata schemas. Ariadne 25 (2000). http://www.ariadne.ac.uk/issue/25/app-profiles/

  7. Hillmann, D.: Metadata standards and applications. Metadata Management Associates LLC (2006). http://managemetadata.com/

  8. Jacobsen, A., et al.: FAIR principles: interpretations and implementation considerations. Data Intelligence 2(1–2), 10–29 (2019). https://doi.org/10.1162/dint_r_00024

    Article  Google Scholar 

  9. Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: Describing linked datasets with the VoID vocabulary. https://www.w3.org/TR/void/

  10. Noy, N.: Making it easier to discover datasets (2018). https://blog.google/products/search/making-it-easier-discover-datasets/, library Catalog: www.blog.google

  11. Noy, N., Brickley, D.: Facilitating the discovery of public datasets (2017). http://ai.googleblog.com/2017/01/facilitating-discovery-of-public.html, library Catalog: ai.googleblog.com

  12. Sansone, S.A., et al.: FAIRsharing as a community approach to standards, repositories and policies. Nature Biotechnol. 37(4), 358–367 (2019). https://doi.org/10.1038/s41587-019-0080-8

    Article  Google Scholar 

  13. Stevens, I., Mukarram, A.K., Hörtenhuber, M., Meehan, T.F., Rung, J., Daub, C.O.: Ten simple rules for annotating sequencing experiments. PLoS Comput. Biol. 16(10), e1008260 (2020). https://doi.org/10.1371/journal.pcbi.1008260

    Article  Google Scholar 

  14. Tennison, J.: CSV on the web: a primer. W3C Note, W3C (2016). https://www.w3.org/TR/2016/NOTE-tabular-data-primer-20160225/

  15. Thalhath, N., Nagamori, M., Sakaguchi, T.: MetaProfiles - a mechanism to express metadata schema, privacy, rights and provenance for data interoperability. In: Ishita, E., Pang, N.L.S., Zhou, L. (eds.) ICADL 2020. LNCS, vol. 12504, pp. 364–370. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-64452-9_34

    Chapter  Google Scholar 

  16. Walsh, P., Pollock, R.: Data package. https://specs.frictionlessdata.io/data-package

  17. Wilkinson, M.D., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3(1), 160018 (2016). https://doi.org/10.1038/sdata.2016.18. https://www.nature.com/articles/sdata201618

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant Numbers JP18K11984 (to Nagamori) and 19K206740A (to Kasaragod).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepa Kasaragod .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Thalhath, N., Nagamori, M., Sakaguchi, T., Kasaragod, D., Sugimoto, S. (2021). Semantic Web Oriented Approaches for Smaller Communities in Publishing Findable Datasets. In: Garoufallou, E., Ovalle-Perandones, MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-71903-6_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-71902-9

  • Online ISBN: 978-3-030-71903-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics