Skip to main content

Dead Science: Most Resources Linked in Biomedical Articles Disappear in Eight Years

  • Conference paper
  • First Online:
Information in Contemporary Society (iConference 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11420))

Included in the following conference series:

Abstract

Scientific progress critically depends on disseminating analytic pipelines and datasets that make results reproducible and replicable. Increasingly, researchers make resources available for wider reuse and embed links to them in their published manuscripts. Previous research has shown that these resources become unavailable over time but the extent and causes of this problem in open access publications has not been explored well. By using 1.9 million articles from PubMed Open Access, we estimate that half of all resources become unavailable after 8 years. We find that the number of times a resource has been used, the international (int) and organization (org) domain suffixes, and the number of affiliations are positively related to resources being available. In contrast, we found that the length of the URL, Indian (in), European Union (eu), and Chinese (cn) domain suffixes, and abstract length are negatively related to resources being available. Our results contribute to our understanding of resource sharing in science and provide some guidance to solve resource decay.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Koehler, W., et al.: A longitudinal study of web pages continued: a consideration of document persistence. Inf. Res. 9(2) (2004)

    Google Scholar 

  2. Habibzadeh, P.: Decay of references to web sites in articles published in general medical journals: mainstream vs small journals. Appl. Clin. Inf. 04(4), 455–464 (2013)

    Article  Google Scholar 

  3. Duda, J.J., Camp, R.J.: Ecology in the information age: patterns of use and attrition rates of internet-based citations in ESA journals, 1997–2005. Front. Ecol. Environ. 6(3), 145–151 (2008)

    Article  Google Scholar 

  4. Goh, D.H.-L., Ng, P.K.: Link decay in leading information science journals. J. Am. Soc. Inf. Sci. Technol. 58(1), 15–24 (2007)

    Article  Google Scholar 

  5. Klein, M., et al.: Scholarly context not found: one in five articles suffers from reference rot. PloS ONE 9(12), e115253 (2014)

    Article  Google Scholar 

  6. Jones, S.M., Van de Sompel, H., Shankar, H., Klein, M., Tobin, R., Grover, C.: Scholarly context adrift: three out of four URI references lead to changed content. PLoS ONE 11(12), e0167475 (2016)

    Article  Google Scholar 

  7. Mangul, S., et al.: A comprehensive analysis of the usability and archival stability of omics computational tools and resources. bioRxiv, p. 452532 (2018)

    Google Scholar 

  8. Collaboration, O.S., et al.: Estimating the reproducibility of psychological science. Science 349(6251), aac4716 (2015)

    Article  Google Scholar 

  9. Bonàs-Guarch, S., et al.: Re-analysis of public genetic data reveals a rare x-chromosomal variant associated with type 2 diabetes. Nature Commun. 9 (2018)

    Google Scholar 

  10. National Institutes of Health: Final NIH statement on sharing research data (2003). https://grants.nih.gov/grants/guide/notice-files/NOT-OD-03-032.html. Accessed 5 Dec 2018

  11. National Science Foundation: NSF data sharing policy (2017). https://www.nsf.gov/pubs/policydocs/pappguide/nsf13001/aag_6.jsp#VID4. Accessed 5 Dec 2018

  12. Van Horn, J.D., Gazzaniga, M.S.: Why share data? Lessons learned from the fMRIDC. NeuroImage 82, 677–682 (2013)

    Article  Google Scholar 

  13. Milham, M.P., et al.: Assessment of the impact of shared brain imaging data on the scientific literature. Nature commun. 9 (2018)

    Google Scholar 

  14. McCown, F., Chan, S., Nelson, M.L., Bollen, J.: The availability and persistence of web references in d-lib magazine. arXiv preprint cs/0511077 (2005)

    Google Scholar 

  15. Hennessey, J., Ge, S.X.: A cross disciplinary study of link decay and the effectiveness of mitigation techniques. BMC Bioinf. 14, S5 (2013)

    Article  Google Scholar 

  16. Zittrain, J., Albert, K., Lessig, L.: Perma: scoping and addressing the problem of link and reference rot in legal citations. Legal Inf. Manag. 14(2), 88–99 (2014)

    Google Scholar 

  17. Gourley, D., Totty, B., Sayer, M., Aggarwal, A., Reddy, S.: HTTP: The Definitive Guide. O’Reilly Media Inc. (2002)

    Google Scholar 

  18. Aronsky, D., Madani, S., Carnevale, R.J., Duda, S., Feyder, M.T.: The prevalence and inaccessibility of internet references in the biomedical literature at the time of publication. J. Am. Med. Inf. Assoc. 14(2), 232–234 (2007)

    Article  Google Scholar 

  19. Burton, R.E., Kebler, R.: The ‘half-life’ of some scientific and technical literatures. Am. Documentation 11(1), 18–22 (1960)

    Article  Google Scholar 

  20. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. SSS. Springer, New York (2009). https://doi.org/10.1007/978-0-387-84858-7

    Book  MATH  Google Scholar 

  21. Zhou, K., Grover, C., Klein, M., Tobin, R.: No more 404s: predicting referenced link rot in scholarly articles for pro-active archiving. In: Proceedings of the 15th ACM/IEEE-CE on Joint Conference on Digital Libraries - JCDL 2015, pp. 233–236. ACM Press (2015)

    Google Scholar 

  22. Internet Archive: Wayback machine. https://archive.org/web/. Accessed 5 Dec 2018

  23. Eysenbach, G., Trudel, M.: Going, going, still there: using the webcite service to permanently archive cited web pages. J. Med. Internet Res. 7(5), e60 (2005)

    Article  Google Scholar 

  24. Eysenbach, G.: Preserving the scholarly record with webcite(r): an archiving system for long-term digital preservation of cited webpages. In: Proceedings ELPUB 2008 Conference on Electronic Publishing, pp. 378–389, Toronto, Canada (2008). www.webcitation.org

Download references

Acknowledgements

Tong Zeng was funded by the China Scholarship Council #201706190067. Daniel E. Acuna was funded by the National Science Foundation awards #1646763 and #1800956.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Daniel E. Acuna .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zeng, T., Shema, A., Acuna, D.E. (2019). Dead Science: Most Resources Linked in Biomedical Articles Disappear in Eight Years. In: Taylor, N., Christian-Lamb, C., Martin, M., Nardi, B. (eds) Information in Contemporary Society. iConference 2019. Lecture Notes in Computer Science(), vol 11420. Springer, Cham. https://doi.org/10.1007/978-3-030-15742-5_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-15742-5_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-15741-8

  • Online ISBN: 978-3-030-15742-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics