Abstract
Publishing findable datasets is a crucial step in data interoperability and reusability. Initiatives like Google data search and semantic web standards like Data Catalog Vocabulary (DCAT) and Schema.org provide mechanisms to expose datasets on the web and make them findable. Apart from these standards, it is also essential to optionally explain the datasets, both its structure and applications. Metadata application profiles are a suitable mechanism to ensure interoperability and improve use-cases for datasets. Standards and attempts, including the Profiles (PROF) and VoID vocabularies, as well as frameworks like Dublin core application profiles (DCAP), provide a better understanding of developing and publishing metadata application profiles. The major challenge for domain experts, especially smaller communities intending to publish findable data on the web, is the complexities in understanding and conforming to such standards. Mostly, these features are provided by complex data repository systems, which is not always a sustainable choice for various small groups and communities looking for self-publishing their datasets. This paper attempts to utilize these standards in self-publishing findable datasets through customizing minimal static web publishing tools and demonstrating the possibilities to encourage smaller communities to adopt cost-effective and simple dataset publishing. The authors express this idea though this work-in-progress paper with the notion that such simple tools will help small communities to publish findable datasets and thus, gain more reach and acceptance for their data. From the perspective of the semantic web, such tools will improve the number of linkable datasets as well as promote the fundamental concepts of the decentralized web.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
References
Baca, M.: Introduction to metadata (2016). http://www.getty.edu/publications/intrometadata
Browning, D., Perego, A., Albertoni, R., Cox, S., Beltran, A.G., Winstanley, P.: Data catalog vocabulary (DCAT) - version 2. W3C Recommendation, W3C (2020). https://www.w3.org/TR/2020/REC-vocab-dcat-2-20200204/
Car, N.: The profiles vocabulary. W3C Note, W3C (2019). https://www.w3.org/TR/2019/NOTE-dx-prof-20191218/
Cousijn, H., et al.: A data citation roadmap for scientific publishers. Sci. Data 5(1), 180259 (2018). https://doi.org/10.1038/sdata.2018.259
Crosas, M.: Joint declaration of data citation principles - FINAL (2013). https://www.force11.org/datacitationprinciples
Heery, R., Patel, M.: Application profiles: mixing and matching metadata schemas. Ariadne 25 (2000). http://www.ariadne.ac.uk/issue/25/app-profiles/
Hillmann, D.: Metadata standards and applications. Metadata Management Associates LLC (2006). http://managemetadata.com/
Jacobsen, A., et al.: FAIR principles: interpretations and implementation considerations. Data Intelligence 2(1–2), 10–29 (2019). https://doi.org/10.1162/dint_r_00024
Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: Describing linked datasets with the VoID vocabulary. https://www.w3.org/TR/void/
Noy, N.: Making it easier to discover datasets (2018). https://blog.google/products/search/making-it-easier-discover-datasets/, library Catalog: www.blog.google
Noy, N., Brickley, D.: Facilitating the discovery of public datasets (2017). http://ai.googleblog.com/2017/01/facilitating-discovery-of-public.html, library Catalog: ai.googleblog.com
Sansone, S.A., et al.: FAIRsharing as a community approach to standards, repositories and policies. Nature Biotechnol. 37(4), 358–367 (2019). https://doi.org/10.1038/s41587-019-0080-8
Stevens, I., Mukarram, A.K., Hörtenhuber, M., Meehan, T.F., Rung, J., Daub, C.O.: Ten simple rules for annotating sequencing experiments. PLoS Comput. Biol. 16(10), e1008260 (2020). https://doi.org/10.1371/journal.pcbi.1008260
Tennison, J.: CSV on the web: a primer. W3C Note, W3C (2016). https://www.w3.org/TR/2016/NOTE-tabular-data-primer-20160225/
Thalhath, N., Nagamori, M., Sakaguchi, T.: MetaProfiles - a mechanism to express metadata schema, privacy, rights and provenance for data interoperability. In: Ishita, E., Pang, N.L.S., Zhou, L. (eds.) ICADL 2020. LNCS, vol. 12504, pp. 364–370. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-64452-9_34
Walsh, P., Pollock, R.: Data package. https://specs.frictionlessdata.io/data-package
Wilkinson, M.D., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3(1), 160018 (2016). https://doi.org/10.1038/sdata.2016.18. https://www.nature.com/articles/sdata201618
Acknowledgements
This work was supported by JSPS KAKENHI Grant Numbers JP18K11984 (to Nagamori) and 19K206740A (to Kasaragod).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Thalhath, N., Nagamori, M., Sakaguchi, T., Kasaragod, D., Sugimoto, S. (2021). Semantic Web Oriented Approaches for Smaller Communities in Publishing Findable Datasets. In: Garoufallou, E., Ovalle-Perandones, MA. (eds) Metadata and Semantic Research. MTSR 2020. Communications in Computer and Information Science, vol 1355. Springer, Cham. https://doi.org/10.1007/978-3-030-71903-6_23
Download citation
DOI: https://doi.org/10.1007/978-3-030-71903-6_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71902-9
Online ISBN: 978-3-030-71903-6
eBook Packages: Computer ScienceComputer Science (R0)