Abstract
Reproducibility of research is critical for science. Computational biology research presents a significant challenge, given the need to track critical details, such as software version or genome draft iteration. Metadata research infrastructures, while greatly improved, often assume a level of programming skills in their user community, or rely on expert curators to ensure that key information is not lost. This paper introduces MEDFORD, a new human-readable, easily-editable and templatable metadata language for scientists to collocate all the details relevant to their experiments. We provide an overview of the underlying design principles, language, and current and planned support infrastructure for parsing and translating MEDFORD into other metadata formats. MEDFORD 0.9 has been specifically designed for the coral research community, with initial metadata generated from RNA-Seq analyses of coral transcriptomes and coral photo collections. Notably, the format is generally applicable and useful for many types of scientific metadata generated by non-computer science experts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ball, A., Greenberg, J., Jeffery, K., Koskela, R.: RDA metadata standards directory working group (2016)
Bosch, T.C.G., McFall-Ngai, M.J.: Metaorganisms as the new frontier. Zoology 114(4), 185ā190 (2011)
Chandler, C.L., et al.: BCO-DMO: stewardship of marine research data from proposal to preservation. American Geophysical Union 2016:OD24B-2457 (2016)
Donner, S.D., Rickbeil, G.J.M., Heron, S.F.: A new, high-resolution global mass coral bleaching database. PLoS One 12(4), e0175490 (2017)
Fegraus, E.H., Andelman, S., Jones, M.B., Schildhauer, M.: Maximizing the value of ecological data with structured metadata: an introduction to ecological metadata language (EML) and principles for metadata creation. Bull. Ecol. Soc. Am. 86(3), 158ā168, e0175490 (2005)
Hughes, T.P., et al.: Coral reefs in the anthropocene. Nature 546, 82ā90, e0175490 (2017)
Lassila, O., Swick, R.R., et al.: Resource description framework (RDF) model and syntax specification (1998)
Leipzig, J., NĆ¼st, D., et al.: The role of metadata in reproducible computational research. CoRR, abs/2006.08589 (2020)
Liew, Y.J., Aranda, M., Voolstra, C.R.: Reefgenomics. Org - a repository for marine genomics data. Database 12, baw152 (2016)
Littman, J., Madden, L., Vargas, B.: The BagIt file packaging format (v0. 97) draft-kunze-bagit-07. txt (2012)
Madin, J.S., et al.: A trait-based approach to advance coral reef science. Trends Ecol. Evol. 31(6), 419ā428, e0175490 (2016)
Qin, J., Ball, A., Greenberg, J.: Functional and architectural requirements for metadata: supporting discovery and management of scientific data. In: International Conference on Dublin Core and Metadata Applications, pp. 62ā71 (2012)
Vardigan, M.: The DDI matures: 1997 to the present. IASSIST Quart. 37(1ā4), 45ā45, e0175490 (2014)
Weibel, S.L., Koch, T.: The Dublin core metadata initiative. D-lib Magaz. 6(12), 1082ā9873, e0175490 (2000)
Wilkinson, M.D., Dumontier, M., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3(1), 1ā9, e0175490 (2016)
Woodhead, A., et al.: Coral reef ecosystem services in the anthropocene. Funct. Ecol. 33(6), 1023ā1034, e0175490 (2019)
Yu, L., Li, T., Li, L., et al.: SAGER: a database of symbiodiniaceae and algal genomic resource. Database 07, baaa051 (2020)
Acknowledgment
The authors thank the anonymous reviewers for their valuable suggestions. This work is supported in part by funds from the National Science Foundation under grants NSF-OAC #1939263, #1939795 and #1940233.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Shpilker, P. et al. (2022). MEtaData Format forĀ Open Reef Data (MEDFORD). In: Garoufallou, E., Ovalle-Perandones, MA., Vlachidis, A. (eds) Metadata and Semantic Research. MTSR 2021. Communications in Computer and Information Science, vol 1537. Springer, Cham. https://doi.org/10.1007/978-3-030-98876-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-98876-0_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98875-3
Online ISBN: 978-3-030-98876-0
eBook Packages: Computer ScienceComputer Science (R0)