Skip to main content

Toward a Metadata Framework for Sharing Sensitive and Closed Data: An Analysis of Data Sharing Agreement Attributes

  • Conference paper
  • First Online:
Book cover Metadata and Semantic Research (MTSR 2017)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 755))

Included in the following conference series:

Abstract

Legal and policy-oriented restrictions often hamper if not inhibit well-intended efforts to share sensitive or restricted data. The research reported on in this paper is a part of a larger initiative to develop a prototype system for automatically generating data sharing agreements that address privacy, legal concerns, and other restrictions. A content analysis was conducted, examining a sample of 26 data sharing agreements. The results include 6 high level categories, 15 mid-level attributes, and over 90 lower-level specific attributes, a portion of which can help to expeditiously support the automatic development of data sharing agreements. The paper presents background information, research questions and methods, results, and a discussion. The conclusion summarizes our results and identifies next steps.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Janssen, M., Charalabidis, Y., Zuiderwijk, A.: Benefits, adoption barriers and myths of open data and open government. Inf. Syst. Manage. 29(4), 258–268 (2012)

    Article  Google Scholar 

  2. McGuire, A.L., Oliver, J.M., Slashinski, M.J., Graves, J.L., Wang, T., Kelly, P.A., Fisher, W., Lau, C.C., Goss, J., Okcu, M., Treadwell-Deering, D.: To share or not to share: a randomized trial of consent for data sharing in genome research. Genet. Med. Off. J. Am. Coll. Med. Genet. 13(11), 948–955 (2011)

    Google Scholar 

  3. Pencarrick Hertzman, C., Meagher, N., McGrail, K.M.: Privacy by design at population data BC: a case study describing the technical, administrative, and physical controls for privacy-sensitive secondary use of personal information for research in the public interest. J. Am. Med. Inform. Assoc. 20(1), 25–28 (2012)

    Article  Google Scholar 

  4. Gleason, C.J., Hamdan, A.N.: Crossing the (watershed) divide: satellite data and the changing politics of international river basins. Geogr. J. 183(1), 2–15 (2017)

    Article  Google Scholar 

  5. Metadata Research Center: A Licensing Model and Ecosystem for Data Sharing (2017). http://cci.drexel.edu/mrc/projects/a-licensing-model-and-ecosystem-for-data-sharing/

  6. Northeast Big Data Innovation Hub (2017). http://nebigdatahub.org/

  7. Datahub: What is datahub? (2016). https://datahub.csail.mit.edu/www/

  8. Creative Commons: Licensing types (2017). https://creativecommons.org/share-your-work/licensing-types-examples/

  9. The National Archives (n.d.): Open government license for public sector information. http://nationalarchives.gov.uk/doc/open-government-licence/version/3/. Accessed 1 July 2017

  10. Open Data Institute (n.d.): What is open data? https://theodi.org/what-is-open-data. Accessed 1 July 2017

  11. Dietrich, D., Gray, J., McNamara, T., Poikola, A., Pollock, P., Tait, J., Zijlstra, T., et al.: Open data handbook (2009). http://opendatahandbook.org. Accessed 15 June 2017

  12. CC0. (n.d.). https://creativecommons.org/share-your-work/public-domain/cc0/. Accessed 1 July 2017

  13. Greenbaum, D., Sboner, A., Mu, X., Gerstein, M.: Genomics and privacy: Implications of the new reality of closed data for the field. PLoS Comput. Biol. 7(12), e1002278 (2011). https://doi.org/10.1371/journal.pcbi.1002278

    Article  Google Scholar 

  14. Segall, L.: Ashley Madison: Life after the hack. CNN Tech (2017). http://money.cnn.com/mostly-human/click-swipe-cheat/

  15. Kwon, D.: Google’s DeepMind, UK’s NHS criticized for sharing data. The Scientist (2017). http://www.the-scientist.com/?articles.view/articleNo/49812/title/Google-s-DeepMind–UK-s-NHS-Criticized-for-Sharing-Data/

  16. Kaplan, B.: How should health data be used? Camb. Q. Healthc. Ethics CQ Int. J. Healthc. Ethics Committees 25(2), 312 (2016)

    Article  Google Scholar 

  17. Dryad Digital Repository (2017). http://datadryad.org/

  18. Liu, X., Li, X., Motiwalla, L., Li, W., Zheng, H., Franklin, P.D.: Preserving patient privacy when sharing same-disease data. J. Data Inf. Qual. 7(4), 17 (2016). https://doi.org/10.1145/2956554

    Google Scholar 

  19. El Emam, K., et al.: De-identification methods for open health data: the case of the heritage health prize claims dataset. J. Med. Internet Res. 14(1), e33 (2012)

    Article  Google Scholar 

  20. Raymond, E.: The cathedral and the bazaar. Philos. Technol. 12(3), 23 (1999). https://monoskop.org/File:Raymond_Eric_S_The_Cathedral_and_the_Bazaar_rev_ed.pdf

    Google Scholar 

  21. Atkins, D.E., National Science Foundation (U.S.): Blue-ribbon advisory panel on cyberinfrastructure. Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure (2003)

    Google Scholar 

  22. Hey, T., Trefethen, A.: The Data deluge: an e-science perspective. In: Berman, F., Fox, G., Hey, T. (eds.) Grid Computing: Making the Global Infrastructure a Reality, pp. 809–824. Wiley, Chichester (2003). https://doi.org/10.1002/0470867167.ch36

    Chapter  Google Scholar 

  23. Hey, T., Tansley, S., Tolle, K.: The Fourth Paradigm: Data-Intensive Scientific Research. Microsoft Research, Redmond, WA (2009)

    Google Scholar 

  24. Greenberg, J., Grabus, S., Hudson, F., Kraska, T., Madden, S., Bastón, R.: The Northeast Big Data Hub: “Enabling Seamless Data Sharing in Industry and Academia” Workshop. The Northeast Big Data Innovation Hub, Philadelphia (2016). https://doi.org/10.17918/D8159V

    Google Scholar 

  25. Zhang, Y., Wildemuth, B.M.: Qualitative analysis of content, applications of social research methods to questions in information and library science. Google Scholar, pp. 308–319 (2005)

    Google Scholar 

  26. Crosas, M.: The DataTags system: sharing sensitive data with confidence. In: RDA 8th Plenary, Denver Colorado, 16 September 2016 (2016). https://scholar.harvard.edu/mercecrosas/presentations/datatags-system-sharing-sensitive-data-confidence

  27. Barth, A., Datta, A., Mitchell, J.C., Nissenbaum, H.: Privacy and contextual integrity: framework and applications. In: 2006 IEEE Symposium on Security and Privacy, 15-pp. IEEE, May 2006

    Google Scholar 

  28. Blake, K., Collins, M.: Controlling chaos: management of electronic journal holdings in an academic library environment. Serials Rev. 36(4), 242–250 (2010)

    Article  Google Scholar 

  29. Higgins, D., Berkley, C., Jones, M.B.: Managing heterogeneous ecological data using morpho. In: The 14th International Conference on Scientific and Statistical Database Management, pp. 69–76 (2002). https://doi.org/10.1109/SSDM.2002.1029707

Download references

Acknowledgements

We acknowledge the support of the National Science Foundation/IIS/BD Spokes/Award #1636788; and thank Sam Maddden (MIT), Carsten Binnig (TU Darmstadt), and Tim Kraska (Brown University), and other individuals who provided us with data sharing agreements.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sam Grabus .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Grabus, S., Greenberg, J. (2017). Toward a Metadata Framework for Sharing Sensitive and Closed Data: An Analysis of Data Sharing Agreement Attributes. In: Garoufallou, E., Virkus, S., Siatri, R., Koutsomiha, D. (eds) Metadata and Semantic Research. MTSR 2017. Communications in Computer and Information Science, vol 755. Springer, Cham. https://doi.org/10.1007/978-3-319-70863-8_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-70863-8_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-70862-1

  • Online ISBN: 978-3-319-70863-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics