ABSTRACT
Realizing the vision of networked data collections and services requires large bodies of scientific data that can be used in new ways. Adapting the concept of epistemological potential, we illustrate an approach for assessing the value of data for reuse in new domains. Two criteria for this analytic potential - integrity and fit-for-purpose - are recognized aspects of data curation, however identifying potential domains of interest for reuse requires knowledge of practices and needs across disciplines. Evaluating analytic potential will become increasingly important for libraries and repositories to make informed decisions about recruitment and curation of data for interdisciplinary science.
- Hey, T., Tansley, S., Tolle, K. (2009). The 4th paradigm: Data-intensive scientific discovery. Microsoft Research, Redmond, WA. http://research.microsoft.com/enus/ collaboration/fourthparadigm/Google Scholar
- National Science Board. (2005). Long-lived digital data collections: Enabling research and education in the 21st century. http://www.nsf.gov/pubs/2005/nsb0540/Google Scholar
- Choudhury, G.S. & Hanisch, R. (2009, December). Data Conservancy: Building a sustainable system for interdisciplinary scientific data curation and preservation. Ensuring Long-Term Preservation and Adding Value to Scientific and Technical Data. Presentation given at PV 2009 conference, Madrid, Spain..Google Scholar
- NASA Socioeconomic Data and Applications Center (SEDAC) Long-Term Archive. (2005). Appraisal for accession to the SEDAC LTA. http://sedac.ciesin columbia.edu/lta/Appraisal.html.Google Scholar
- Borgman, C., Wallis J., & Enyedy N. (2007). Little science confronts the data deluge: Habitat ecology, embedded sensor networks, and digital libraries. International Journal on Digital Libraries, 7(1/2), 17--30. Google ScholarDigital Library
- Faniel, I.M., & Jacobsen, T.E. (2010). Reusing scientific data: How earthquake engineering researchers assess the reusability of colleagues' data. Computer Supported Cooperative Work, 19(3--4), 355--375. Google ScholarDigital Library
- Zimmerman, A. (2007). Not by metadata alone: The use of diverse forms of knowledge to locate data for reuse. International Journal on Digital Libraries, 7(1--2), 5--16. Google ScholarDigital Library
- Hjørland, B. (1997). Information seeking and subject representation: An activity-theoretical approach to information science. Westport, CT : Greenwood.Google Scholar
- Lord, P., MacDonald, A., Lyon, L., & Giaretta, D. (2004). From data deluge to data curation. Proceedings of the UK e-Science All Hands Meeting, Nottingham, September 2004.Google Scholar
- Cragin, M.H., Palmer, C.L., Carlson, J.R., & Witt, M. (2010). Data sharing, small science, and institutional repositories. Philosophical Transactions of the Royal Society A, 368(1926), 4023--4038.Google ScholarCross Ref
- Baker, K.S., & Yarmey, L. (2009). Data stewardship: Environmental data curation and a web-of-repositories. International Journal of Digital Curation, 4(2).Google Scholar
- Cragin, M.H., Palmer, C.L., & Chao, T.C. (2010). Relating data practices, types, and curation functions: An empirically derived framework. Proceedings of the ASIS&T annual meeting, Pittsburgh, PA, Oct. 22--27, 2010. Google ScholarDigital Library
- Bates, M.J. (1999). The invisible substrate of information science. Journal of the American Society for Information Science, 50(12), 1043--1050. Google ScholarDigital Library
- Hjørland, Birger. (1998). Theory and metatheory of information science: A new interpretation. Journal of Documentation 54: 606--62Google ScholarCross Ref
Index Terms
- Analytic potential of data: assessing reuse value
Recommendations
The challenges of digging data: a study of context in archaeological data reuse
JCDL '13: Proceedings of the 13th ACM/IEEE-CS joint conference on Digital librariesField archaeology only recently developed centralized systems for data curation, management, and reuse. Data documentation guidelines, standards, and ontologies have yet to see wide adoption in this discipline. Moreover, repository practices have ...
Assessing Legacy Collections for Scientific Data Rescue
Diversity, Divergence, DialogueAbstractWidespread investments in facilitating reuse and reproducibility of scientific research have spurred an increasing recognition of the potential value of data biding in unpublished records and legacy research materials, such as scientists’ papers, ...
Managing fixity and fluidity in data repositories
iConference '12: Proceedings of the 2012 iConferenceData repositories walk a fine line between the fixity and fluidity of the data they curate. Change is constant, but too much change affects the integrity of data. This paper examines data transformations in three repositories, serving the zoological, ...
Comments