Abstract
It is introduced a platform for quality control and monitoring of Cuban scientific publications named Sceiba. To this end, it needs to collect scientific publications comprehensively at the national level. Metadata quality is crucial for Sceiba interoperability and development. This paper exposes how metadata quality is assured and enhanced in Sceiba. The metadata aggregation pipeline is worked out to collect, transform, store and expose metadata on Persons, Organizations, Sources, and Scientific Publications. Raw data transformation into Sceiba’s internal metadata models includes cleaning, disambiguation, deduplication, entity linking, validation, standardization, and enrichment using a semi-automated approach aligned with the findability, accessibility, interoperability, and reusability principles. To meet the requirements of metadata quality in Sceiba, a three-layer structure for metadata is used, including 1) discovery metadata, which allows the discovery of relevant scientific publications by browsing or query, 2) contextual metadata, which allows a) rich information on persons, organizations and other aspects associated with publications, b) interoperation among common metadata formats used in Current Research Information Systems, journals systems or Institutional Repositories; 3) detailed metadata, which is specific to the domain of scientific publication evaluation. The example provided shows how the metadata quality is improved in the Identification System for Cuban Research Organizations, one of Sceiba´s component applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Vlaamse Interuniversitaire Raad - Universitaire Ontwikkelingssamenwerking’ (VLIR-UOS), more information about the project can be found in https://www.vliruos.be/en/projects/project/22?pid=4202.
- 2.
Sceiba is a word that arises from the combination of the Latin ‘‘sci’’ and Ceiba, a leafy tree considered sacred by several Cuban traditions.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
References
Allen, R.: Metadata for social science datasets. In: Rich Search and Discovery for Research Datasets: Building the Next Generation of Scholarly Infrastructure, pp. 40–52. Sage (2020)
Alma’aitah, W.Z.A., Talib, A.Z., Osman, M.A.: Opportunities and challenges in enhancing access to metadata of cultural heritage collections: a survey. Artif. Intell. Rev. 53(5), 3621–3646 (2020)
Bryant, R., Clements, A., Castro, P., de Cantrell, J., Dortmund, A., Fransen, J., et. al.: Practices and patterns in research information management: findings from a global survey (2020). https://doi.org/10.25333/BGFG-D241
Fernandes, S., Pinto, M.J.: From the institutional repository to a CRIS system. Qual. Quant. Methods Libr. 7(3), 481–487 (2019)
Galvez, C., Moya-Anegón, F.: The unification of institutional addresses applying parametrized finite-state graphs (P-FSG). Scientometrics 69, 323–345 (2006). https://doi.org/10.1007/s11192-006-0156-3
Jeffery, K., Houssos, N., Jörg, B., Asserson, A.: Research Information management: the CERIF approach. Int. J. Metadata Semant. Ontol. 9, 5–14 (2014). https://doi.org/10.1504/IJMSO.2014.059142
Jörg, B., Jeffery, K., Dvorak, J., Houssos, N., Asserson, A., Grootel, G., et.al.: CERIF 1.3 Full Data Model (FDM): introduction and specification (2012)
Ma, J.: Managing metadata for digital projects. Libr. Collect. Acquis. Tech. Serv. 30, 17–23 (2006)
Schriml, L.M., Chuvochina, M., Davies, N., Eloe-Fadrosh, E.A., Finn, R.D., Hugenholtz, P., et al.: COVID-19 pandemic reveals the peril of ignoring metadata standards. Sci. Data 7(1), 188 (2020). https://doi.org/10.1038/s41597-020-0524-5
Tharani, K.: Much more than a mere technology: a systematic review of Wikidata in libraries. J. Acad. Librarianship 47(2), 102326 (2021). https://doi.org/10.1016/j.acalib.2021.102326
Wiley, C.: Metadata use in research data management. Bull. Assoc. Inf. Sci. Technol. 40(6), 38–40 (2014). https://doi.org/10.1002/bult.2014.1720400612
Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., Baak, A., et al.: The FAIR guiding principles for scientific data management and stewardship. Sci. Data 3(1), 1–9 (2016). https://doi.org/10.1038/sdata.2016.18
Zuiderwijk, A., Jeffery, K., Janssen, M.: The potential of metadata for linked open data and its value for users and publishers. J. e-Democracy Open Gov. 4(2), 222–244 (2012). https://doi.org/10.29379/jedem.v4i2.138
Acknowledgements
The work of the Sceiba project was supported by the ‘Vlaamse Interuniversitaire Raad - Universitaire Ontwikkelingssamenwerking’ (VLIR-UOS), Belgium. The authors are team members of the Sceiba project. They like to thank Sadia Van Cauwenbergh (Hasselt University) and Raf Guns (Antwerp University) for their suggestions on the article and to the Sceiba team of the University of Pinar del Rio for their contribution to the development of the Sceiba platform.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Arencibia, E., Martinez, R., Marti-Lahera, Y., Goovaerts, M. (2022). On Metadata Quality in Sceiba, a Platform for Quality Control and Monitoring of Cuban Scientific Publications. In: Garoufallou, E., Ovalle-Perandones, MA., Vlachidis, A. (eds) Metadata and Semantic Research. MTSR 2021. Communications in Computer and Information Science, vol 1537. Springer, Cham. https://doi.org/10.1007/978-3-030-98876-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-98876-0_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98875-3
Online ISBN: 978-3-030-98876-0
eBook Packages: Computer ScienceComputer Science (R0)