Abstract
Recording provenance is a key requirement for data-centric scholarship, allowing researchers to evaluate the integrity of source data sets and reproduce, and thereby, validate results. Provenance has become even more critical in the web environment in which data from distributed sources and of varying integrity can be combined and derived. Recent work by the W3C on the PROV model provides the foundation for semantically-rich, interoperable, and web-compatible provenance metadata. We apply that model to complex, but characteristic, provenance examples of social science data, describe scenarios that make scholarly use of those provenance descriptions, and propose a manner for encoding this provenance metadata within the widely-used DDI metadata standard.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Daw, M., Procter, R., Lin, Y., Hewitt, T., Ji, W., Voss, A., Baird, K., Turner, A., Birkin, M., Miller, K., Dutton, W., Jirotka, M., Schroeder, R., de la Flor, G., Edwards, P., Allan, R., Yang, X., Crouchley, R.: Developing an e-Infrastructure for Social Science. In: Proceedings of e-Social Science 2007 (2007)
Lagoze, C., Block, W., Williams, J., Abowd, J.M., Vilhuber, L.: Data Management of Confidential Data. In: International Data Curation Conference (2013)
Vardigan, M., Heus, P., Thomas, W.: Data Documentation Initiative: Toward a Standard for the Social Sciences. The International Journal of Digital Curation 3(1) (2008)
Groth, P., Moreau, L.: PROV-Overview: An Overview of the PROV Family of Documents. W3C (2013)
National Science Foundation, NSF Award Search: Award#1131848 - NCRN-MN: Cornell Census-NSF Research Node: Integrated Research Support, Training and Data Documentation (2011)
Simmhan, Y., Plale, B., Gannon, D.: A survey of data provenance in e-science. ACM Sigmod Record (2005)
Cheney, J., Chong, S., Foster, N., Seltzer, M., Vansummeren, S.: Provenance. In: Proceeding of the 24th ACM SIGPLAN Conference Companion on Object Oriented Programming Systems Languages and Applications - OOPSLA 2009, p. 957 (2009)
Groth, P., Gil, Y., Cheney, J., Miles, S.: Requirements for Provenance on the Web. International Journal of Digital Curation 7(1), 39–56 (2012)
McGuinness, D.L., Fox, P., Pinheiro da Silva, P., Zednik, S., Del Rio, N., Ding, L., West, P., Chang, C.: Annotating and embedding provenance in science data repositories to enable next generation science applications. AGU Fall Meeting Abstracts 1 (2008)
Moreau, L., Freire, J., Futrelle, J., McGrath, R., Myers, J., Paulson, P.: The Open Provenance Model. University of Southampton, pp. 1–30 (August 2007)
Moreau, L., Missier, P.: PROV-N: The Provenance Notation. W3C (2013)
Jarmin, R., Miranda, J.: The Longtitudinal Business Database (2002)
Klyne, G., Groth, P.: Provenance Access and Query. W3C (2013)
Lebo, T., Sahoo, S., McGuinness, D.L.: PROV-O: The PROV Ontology. W3C (2013)
Kramer, S., Leahey, A., Southall, H., Vampras, J., Wackerow, J.: Using RDF to describe and link social science data to related resources on the Web: leveraging the Data Documentation Initiative (DDI) model. Data Documentation Initiative (September 01, 2012)
Bosch, T., Cyganiak, R., Wackerow, J., Zapilko, B.: Leveraging the DDI Model for Linked Statistical Data in the Social, Behavioural, and Economic Sciences. In: International Conference on Dublin Core and Metadata Applications; DC-2012–The Kuching Proceedings (September 2012)
Bosch, T., Cyganiak, R., Gregory, A., Wackerow, J.: DDI-RDF Discovery Vocabulary: A Metadata Vocabulary for Documenting Research and Survey Data. In: Linked Data on the Web Workshop (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lagoze, C., Willliams, J., Vilhuber, L. (2013). Encoding Provenance Metadata for Social Science Datasets. In: Garoufallou, E., Greenberg, J. (eds) Metadata and Semantics Research. MTSR 2013. Communications in Computer and Information Science, vol 390. Springer, Cham. https://doi.org/10.1007/978-3-319-03437-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-03437-9_13
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03436-2
Online ISBN: 978-3-319-03437-9
eBook Packages: Computer ScienceComputer Science (R0)