ABSTRACT
While previous CSCW research on data sharing and reuse has focused on how researchers assess the trust and reliability of the data of others, we know little about scientists' data use practices after that decision has been taken. This qualitative study of post-doctoral researchers' use of preexisting datasets investigates the practices of cancer-epidemiology post-docs working to understand their "Small Data" datasets. We report the ongoing and iterative nature of information seeking inherent in using unfamiliar data and the time-consuming and highly-collaborative process post-docs used to understand aspects of the dataset important to their scientific questions. Understanding data use practices can help inform the design of both Small Data projects and large cyberinfrastructure projects where multi-source data are collected and combined.
- Baker, K.S., & Yarmey, L. (2009). Data stewardship: Environmental data curation and a web-of-repositories. International Journal of Digital Curation, 4(2), 1--12.Google ScholarCross Ref
- Bietz, M.J., Baumer, E.P.S., & Lee, C.P. (2010). Synergizing in Cyberinfrastructure Development. Computer Supported Cooperative Work, 19(3--4), 3--4. Google ScholarDigital Library
- Bietz, M.J., Ferro, T., & Lee, C.P. (2012). Sustaining the development of cyberinfrastructure: an organization adapting to change. In Proc. CSCW 2012, ACM Press (2012), 901--910. Google ScholarDigital Library
- Bietz, M.J., & Lee, C.P. (2009). Collaboration in Metagenomics: Sequence Databases and the Organization of Scientific Work. In Proc. ECSCW 2009, Springer-Verlag (2009), 243--262.Google ScholarCross Ref
- Birnholtz, J.P., & Bietz, M.J. (2003). Data at work: supporting sharing in science and engineering. In Proc. ACM SIGGROUP, ACM Press (2003), 339--348. Google ScholarDigital Library
- Borgman, C.L. (2011). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology, 63(6), 1059--1078. Google ScholarDigital Library
- Edwards, P.N., Batcheller, A.L., Mayernik, M.S., Borgman, C.L., & Bowker, G.C. (2011). Science friction: Data, metadata, and collaboration. Social Studies of Science, 41(5), 667--690.Google ScholarCross Ref
- Faniel, I.M., & Jacobsen, T.E. (2010). Reusing Scientific Data: How Earthquake Engineering Researchers Assess the Reusability of Colleagues' Data. Computer Supported Cooperative Work, 19(3--4), 355--375. Google ScholarDigital Library
- Fortier, I., Burton, P.R., Little, J., et al. (2010). Quality, quantity and harmony: The DataSHaPER approach to integrating data across bioclinical studies. International Journal of Epidemiology, 39(5), 1383--1393.Google ScholarCross Ref
- Fortier, I., Doiron, D., Little, J., et al. (2011). Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies. International Journal of Epidemiology, 40(5), 1314--1328.Google ScholarCross Ref
- 1Karasti, H., Baker, K., & Halkola, E. (2006). Enriching the Notion of Data Curation in E-Science: Data Managing and Information Infrastructuring in the Long Term Ecological Research (LTER) Network. Computer Supported Cooperative Work, 15(4), 321--358. Google ScholarDigital Library
- Lee, C.P., Dourish, P., & Mark, G. (2006). The Human Infrastructure of Cyberinfrastructure. In Proc. CSCW 2006, ACM Press (2006), 483 - 492. Google ScholarDigital Library
- Lethbridge, T.C., Singer, J., & Forward, A. (2003). How Software Engineers Use Documentation: The State of the Practice. IEEE Software, 20(6). Google ScholarDigital Library
- National Postdoctoral Association. 2009. Accessed May 29, 2012. http://www.nationalpostdoc.org/policy/what-is-a-postdocGoogle Scholar
- National Institutes of Health Research Portfolio Online Reporting Tools (RePORT). 2012. Accessed May 29, 2012. http://report.nih.gov/award/index.cfm?ot=&fy=2011&state=&ic=&fm=&orgid=#tab2Google Scholar
- Nelson, B. (2009). Data sharing: Empty archives. Nature, 461(7261).Google Scholar
- Piwowar, H.A. (2011). Who Shares? Who Doesn't? Factors Associated with Openly Archiving Raw Research Data. PloS one, 6(7), e18657.Google ScholarCross Ref
- Piwowar, H.A., Day, R. S., & Fridsma, D. B. (2007). Sharing detailed research data is associated with increased citation rate. PloS one, 2(3).Google Scholar
- Walport, M., & Brest, P. (2011). Sharing research data to improve public health. The Lancet, 6736(10), 9--11.Google Scholar
- Zimmerman, A. (2007). Not by metadata alone: the use of diverse forms of knowledge to locate data for reuse. International Journal on Digital Libraries, 7(1--2), 5--16. Google ScholarDigital Library
Index Terms
- Beyond trust and reliability: reusing data in collaborative cancer epidemiology research
Recommendations
The role of team cognition in collaborative information seeking
Collaborative information seeking CIS is of growing importance in the information sciences and human-computer interaction HCI research communities. Current research has primarily focused on examining the social and interactional aspects of CIS in ...
Beyond Expertise Seeking: A Field Study of the Informal Knowledge Practices of Healthcare IT Teams
CSCW has long been concerned with formal and informal knowledge practices in organizations, examining both the social and technical aspects of how knowledge is sought, shared, and used. In this study, we are interested in examining the set of activities ...
Understanding data search as a socio-technical practice
Open research data are heralded as having the potential to increase effectiveness, productivity and reproducibility in science, but little is known about the actual practices involved in data search. The socio-technical problem of locating data for reuse ...
Comments