Abstract
Digital Object Identifiers (DOIs) are regarded as persistent; however, they are sometimes deleted. Deleted DOIs are an important issue not only for persistent access to scholarly content but also for bibliometrics, because they may cause problems in correctly identifying scholarly articles. However, little is known about how much of deleted DOIs and what causes them. We identified deleted DOIs by comparing the datasets of all Crossref DOIs on two different dates, investigated the number of deleted DOIs in the scholarly content along with the corresponding document types, and analyzed the factors that cause deleted DOIs. Using the proposed method, 708,282 deleted DOIs were identified. The majority corresponded to individual scholarly articles such as journal articles, proceedings articles, and book chapters. There were cases of many DOIs assigned to the same content, e.g., retracted journal articles and abstracts of international conferences. We show the publishers and academic societies which are the most common in deleted DOIs. In addition, the top cases of single scholarly content with a large number of deleted DOIs were revealed. The findings of this study are useful for citation analysis and altmetrics, as well as for avoiding deleted DOIs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cornell University: New arXiv articles are now automatically assigned DOIs \(|\)arXiv.org blog (2022). https://blog.arxiv.org/2022/02/17/new-arxiv-articles-are-now-automatically-assigned-dois/
Crossref: January 2021 Public Data File from Crossref. Academic Torrents. https://doi.org/10.13003/gu3dqmjvg4
Crossref: Crossref Metadata API JSON Format (2021). https://github.com/CrossRef/rest-api-doc/blob/master/api_format.md
Crossref: Crossref REST API (2021). https://api.crossref.org/
Crossref: crossref.org : : crossref stats (2022). https://www.crossref.org/06members/53status.html
Farley, I.: Conflict report - Crossref (2020). https://www.crossref.org/documentation/reports/conflict-report/
Franceschini, F., Maisano, D., Mastrogiacomo, L.: Errors in DOI indexing by bibliometric databases. Scientometrics 102(3), 2181–2186 (2014). https://doi.org/10.1007/s11192-014-1503-4
Hendricks, G., Tkaczyk, D., Lin, J., Feeney, P.: Crossref: the sustainable source of community-owned scholarly metadata. Quantit. Sci. Stud. 1(1), 414–427 (2020). https://doi.org/10.1162/qss_a_00022
Himmelstein, D., Wheeler, K., Greene, C.: Metadata for all DOIs in Crossref: JSON MongoDB exports of all works from the Crossref API. figshare (2017). https://doi.org/10.6084/m9.figshare.4816720.v1
Kemp, J.: New public data file: 120+ million metadata records (2021). https://www.crossref.org/blog/new-public-data-file-120-million-metadata-records/
Kikkawa, J., Takaku, M., Yoshikane, F.: Dataset of the deleted DOIs extracted from the difference set between Crossref DOIs as of March 2017 and January 2021. Zenodo (2022). https://doi.org/10.5281/zenodo.6841257
Klein, M., Balakireva, L.: On the persistence of persistent identifiers of the scholarly web. In: Hall, M., Merčun, T., Risse, T., Duchateau, F. (eds.) TPDL 2020. LNCS, vol. 12246, pp. 102–115. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-54956-5_8
Klein, M., Balakireva, L.: An extended analysis of the persistence of persistent identifiers of the scholarly web. Int. J. Digit. Libr. 23(1), 5–17 (2021). https://doi.org/10.1007/s00799-021-00315-w
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Sov. Phys.-Dokl. 10(8), 707–710 (1966)
Smulyan, S.: Defunct DOI - Crossref (2020). https://www.crossref.org/_deleted-doi/
Van de Sompel, H., Klein, M., Jones, S.M.: Persistent URIs must be used to be persistent. In: Proceedings of the 25th International Conference Companion on World Wide Web, WWW 2016 Companion, pp. 119–120. International World Wide Web Conferences Steering Committee (2016). https://doi.org/10.1145/2872518.2889352
The International DOI Foundation: Factsheet DOI Resolution Documentation - 4. Which RA? (2020). https://www.doi.org/factsheets/DOIProxy.html#whichra
Tkaczyk, D.: Double trouble with DOIs - Crossref (2020). https://www.crossref.org/blog/double-trouble-with-dois/
Zhu, J., Hu, G., Liu, W.: DOI errors and possible solutions for web of science. Scientometrics 118(2), 709–718 (2018). https://doi.org/10.1007/s11192-018-2980-7
Ziegler, A.: halostatue/diff-lcs: generate difference sets between Ruby sequences (2022). https://github.com/halostatue/diff-lcs
Acknowledgments
This work was partially supported by JSPS KAKENHI Grant Numbers JP21K21303, JP22K18147, JP20K12543, and JP21K12592. We would like to thank Editage (https://www.editage.com/) for the English language editing.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Kikkawa, J., Takaku, M., Yoshikane, F. (2022). Analysis of the Deletions of DOIs. In: Silvello, G., et al. Linking Theory and Practice of Digital Libraries. TPDL 2022. Lecture Notes in Computer Science, vol 13541. Springer, Cham. https://doi.org/10.1007/978-3-031-16802-4_13
Download citation
DOI: https://doi.org/10.1007/978-3-031-16802-4_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16801-7
Online ISBN: 978-3-031-16802-4
eBook Packages: Computer ScienceComputer Science (R0)