Abstract
Legal and policy-oriented restrictions often hamper if not inhibit well-intended efforts to share sensitive or restricted data. The research reported on in this paper is a part of a larger initiative to develop a prototype system for automatically generating data sharing agreements that address privacy, legal concerns, and other restrictions. A content analysis was conducted, examining a sample of 26 data sharing agreements. The results include 6 high level categories, 15 mid-level attributes, and over 90 lower-level specific attributes, a portion of which can help to expeditiously support the automatic development of data sharing agreements. The paper presents background information, research questions and methods, results, and a discussion. The conclusion summarizes our results and identifies next steps.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Janssen, M., Charalabidis, Y., Zuiderwijk, A.: Benefits, adoption barriers and myths of open data and open government. Inf. Syst. Manage. 29(4), 258–268 (2012)
McGuire, A.L., Oliver, J.M., Slashinski, M.J., Graves, J.L., Wang, T., Kelly, P.A., Fisher, W., Lau, C.C., Goss, J., Okcu, M., Treadwell-Deering, D.: To share or not to share: a randomized trial of consent for data sharing in genome research. Genet. Med. Off. J. Am. Coll. Med. Genet. 13(11), 948–955 (2011)
Pencarrick Hertzman, C., Meagher, N., McGrail, K.M.: Privacy by design at population data BC: a case study describing the technical, administrative, and physical controls for privacy-sensitive secondary use of personal information for research in the public interest. J. Am. Med. Inform. Assoc. 20(1), 25–28 (2012)
Gleason, C.J., Hamdan, A.N.: Crossing the (watershed) divide: satellite data and the changing politics of international river basins. Geogr. J. 183(1), 2–15 (2017)
Metadata Research Center: A Licensing Model and Ecosystem for Data Sharing (2017). http://cci.drexel.edu/mrc/projects/a-licensing-model-and-ecosystem-for-data-sharing/
Northeast Big Data Innovation Hub (2017). http://nebigdatahub.org/
Datahub: What is datahub? (2016). https://datahub.csail.mit.edu/www/
Creative Commons: Licensing types (2017). https://creativecommons.org/share-your-work/licensing-types-examples/
The National Archives (n.d.): Open government license for public sector information. http://nationalarchives.gov.uk/doc/open-government-licence/version/3/. Accessed 1 July 2017
Open Data Institute (n.d.): What is open data? https://theodi.org/what-is-open-data. Accessed 1 July 2017
Dietrich, D., Gray, J., McNamara, T., Poikola, A., Pollock, P., Tait, J., Zijlstra, T., et al.: Open data handbook (2009). http://opendatahandbook.org. Accessed 15 June 2017
CC0. (n.d.). https://creativecommons.org/share-your-work/public-domain/cc0/. Accessed 1 July 2017
Greenbaum, D., Sboner, A., Mu, X., Gerstein, M.: Genomics and privacy: Implications of the new reality of closed data for the field. PLoS Comput. Biol. 7(12), e1002278 (2011). https://doi.org/10.1371/journal.pcbi.1002278
Segall, L.: Ashley Madison: Life after the hack. CNN Tech (2017). http://money.cnn.com/mostly-human/click-swipe-cheat/
Kwon, D.: Google’s DeepMind, UK’s NHS criticized for sharing data. The Scientist (2017). http://www.the-scientist.com/?articles.view/articleNo/49812/title/Google-s-DeepMind–UK-s-NHS-Criticized-for-Sharing-Data/
Kaplan, B.: How should health data be used? Camb. Q. Healthc. Ethics CQ Int. J. Healthc. Ethics Committees 25(2), 312 (2016)
Dryad Digital Repository (2017). http://datadryad.org/
Liu, X., Li, X., Motiwalla, L., Li, W., Zheng, H., Franklin, P.D.: Preserving patient privacy when sharing same-disease data. J. Data Inf. Qual. 7(4), 17 (2016). https://doi.org/10.1145/2956554
El Emam, K., et al.: De-identification methods for open health data: the case of the heritage health prize claims dataset. J. Med. Internet Res. 14(1), e33 (2012)
Raymond, E.: The cathedral and the bazaar. Philos. Technol. 12(3), 23 (1999). https://monoskop.org/File:Raymond_Eric_S_The_Cathedral_and_the_Bazaar_rev_ed.pdf
Atkins, D.E., National Science Foundation (U.S.): Blue-ribbon advisory panel on cyberinfrastructure. Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure (2003)
Hey, T., Trefethen, A.: The Data deluge: an e-science perspective. In: Berman, F., Fox, G., Hey, T. (eds.) Grid Computing: Making the Global Infrastructure a Reality, pp. 809–824. Wiley, Chichester (2003). https://doi.org/10.1002/0470867167.ch36
Hey, T., Tansley, S., Tolle, K.: The Fourth Paradigm: Data-Intensive Scientific Research. Microsoft Research, Redmond, WA (2009)
Greenberg, J., Grabus, S., Hudson, F., Kraska, T., Madden, S., Bastón, R.: The Northeast Big Data Hub: “Enabling Seamless Data Sharing in Industry and Academia” Workshop. The Northeast Big Data Innovation Hub, Philadelphia (2016). https://doi.org/10.17918/D8159V
Zhang, Y., Wildemuth, B.M.: Qualitative analysis of content, applications of social research methods to questions in information and library science. Google Scholar, pp. 308–319 (2005)
Crosas, M.: The DataTags system: sharing sensitive data with confidence. In: RDA 8th Plenary, Denver Colorado, 16 September 2016 (2016). https://scholar.harvard.edu/mercecrosas/presentations/datatags-system-sharing-sensitive-data-confidence
Barth, A., Datta, A., Mitchell, J.C., Nissenbaum, H.: Privacy and contextual integrity: framework and applications. In: 2006 IEEE Symposium on Security and Privacy, 15-pp. IEEE, May 2006
Blake, K., Collins, M.: Controlling chaos: management of electronic journal holdings in an academic library environment. Serials Rev. 36(4), 242–250 (2010)
Higgins, D., Berkley, C., Jones, M.B.: Managing heterogeneous ecological data using morpho. In: The 14th International Conference on Scientific and Statistical Database Management, pp. 69–76 (2002). https://doi.org/10.1109/SSDM.2002.1029707
Acknowledgements
We acknowledge the support of the National Science Foundation/IIS/BD Spokes/Award #1636788; and thank Sam Maddden (MIT), Carsten Binnig (TU Darmstadt), and Tim Kraska (Brown University), and other individuals who provided us with data sharing agreements.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Grabus, S., Greenberg, J. (2017). Toward a Metadata Framework for Sharing Sensitive and Closed Data: An Analysis of Data Sharing Agreement Attributes. In: Garoufallou, E., Virkus, S., Siatri, R., Koutsomiha, D. (eds) Metadata and Semantic Research. MTSR 2017. Communications in Computer and Information Science, vol 755. Springer, Cham. https://doi.org/10.1007/978-3-319-70863-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-70863-8_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70862-1
Online ISBN: 978-3-319-70863-8
eBook Packages: Computer ScienceComputer Science (R0)