Skip to main content
Log in

Leveraging deep learning for automatic literature screening in intelligent bibliometrics

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Intelligent bibliometrics, by providing sufficient statistical information based on large-scale literature data analytics, is promising for understanding innovative pathways, addressing meaningful insights with the assistance of expert knowledge, and indicating key areas of scientific inquiry. However, the exponential growth of global scientific publication output in most areas of modern science makes it extremely difficult and labor-intensive to analyze literature in large volumes. This study aims to accelerate intelligent bibliometrics-driven literature analysis by leveraging deep learning for automatic literature screening. The comparison of different machine learning algorithms for the automatic classification of literature regarding relevance to a given research topic reveals the outstanding performance of deep learning. This study also compares different features as model input and provides suggestions about training dataset size. By leveraging deep learning’s abilities in predictive and big data analytics, this study makes contributions to intelligent bibliometrics by promoting literature screening and is promising to track technological changes and scientific evolutionary pathways.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data availability

The datasets used in this study are available from the corresponding author upon reasonable request.

Notes

  1. https://fasttext.cc/docs/en/english-vectors.html

References

  1. Graham S, Depp C, Lee EE et al (2019) Artificial intelligence for mental health and mental illnesses: an overview. Curr Psychiatry Rep 21:116

    Google Scholar 

  2. Chen X, Xie H, Cheng G et al (2020) Trends and features of the applications of natural language processing techniques for clinical trials text analysis. Appl Sci 10:2157

    Google Scholar 

  3. Balakrishnan N, Rajendran A, Palanivel K (2019) Meticulous fuzzy convolution C means for optimized big data analytics: adaptation towards deep learning. Int J Mach Learn Cybern 10:3575–3586

    Google Scholar 

  4. Rowley J, Slack F (2004) Conducting a literature review. Manag Res news 27:31–39

    Google Scholar 

  5. Hart C (1998) Reviewing and the research imagination: doing a literature review. Sage, London

    Google Scholar 

  6. Webster J, Watson RT (2002) Analyzing the past to prepare for the future: Writing a literature review. MIS Q xiii–xxiii

  7. Cronin P, Ryan F, Coughlan M (2008) Undertaking a literature review: a step-by-step approach. Br J Nurs 17:38–43

    Google Scholar 

  8. Zhang Y, Wu M, Hu Z et al (2021) Profiling and predicting the problem-solving patterns in china’s research systems: a methodology of intelligent bibliometrics and empirical insights. Quant Sci Stud 2:409–432

    Google Scholar 

  9. Vom Brocke J, Simons A, Riemer K et al (2015) Standing on the shoulders of giants: challenges and recommendations of literature search in information systems research. Commun Assoc Inf Syst 37:9

    Google Scholar 

  10. Cobo MJ, López-Herrera AG, Herrera-Viedma E, Herrera F (2012) SciMAT: a new science mapping analysis software tool. J Am Soc Inf Sci Technol 63:1609–1630

    Google Scholar 

  11. Åström F, Danell R, Larsen B, Schneider J (2009) Celebrating scholarly communication studies: A Festschrift for Olle Persson at his 60th Birthday. ISSI

  12. Van Eck N, Waltman L (2010) Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 84:523–538

    Google Scholar 

  13. Chen C (2006) CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature. J Am Soc Inf Sci Technol 57:359–377

    Google Scholar 

  14. Van Eck NJ, Waltman L (2014) CitNetExplorer: a new software tool for analyzing and visualizing citation networks. J Informetr 8:802–823

    Google Scholar 

  15. Team S (2009) Sci2 Tool: A Tool for Science of Science Research and Practice. https://sci2.cns.iu.edu.

  16. Bastian M, Heymann S, Jacomy M (2009) Gephi: an open source software for exploring and manipulating networks. In: Proceedings of the International AAAI Conference on Web and Social Media (Volume 3), pp 361–362. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/13937

  17. Grauwin S, Jensen P (2011) Mapping scientific institutions. Scientometrics 89:943–954

    Google Scholar 

  18. Belter CW (2016) Citation analysis as a literature search method for systematic reviews. J Assoc Inf Sci Technol 67:2766–2777

    Google Scholar 

  19. Hearst MA (1999) Untangling text data mining. In: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics. Association for Computational Linguistics, pp 3–10

  20. Raudys S, Pikelis V (1980) On dimensionality, sample size, classification error, and complexity of classification algorithm in pattern recognition. IEEE Trans Pattern Anal Mach Intell 2:242–252

    MATH  Google Scholar 

  21. Taha A, Cosgrave B, Mckeever S (2022) Using feature selection with machine learning for generation of insurance insights. Appl Sci 12:3209

    Google Scholar 

  22. Langley P, Iba W (1993) Average-case analysis of a nearest neighbor algorithm. In: IJCAI. Citeseer, p 889

  23. Saarela M, Jauhiainen S (2021) Comparison of feature importance measures as explanations for classification models. SN Appl Sci 3:1–12

    Google Scholar 

  24. Kwon O, Sim JM (2013) Effects of data set features on the performances of classification algorithms. Expert Syst Appl 40:1847–1857

    Google Scholar 

  25. Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182

    MATH  Google Scholar 

  26. Peng Y, Wang G, Kou G, Shi Y (2011) An empirical study of classification algorithm evaluation for financial risk prediction. Appl Soft Comput 11:2906–2915

    Google Scholar 

  27. Althnian A, AlSaeed D, Al-Baity H et al (2021) Impact of dataset size on classification performance: an empirical evaluation in the medical domain. Appl Sci 11:796

    Google Scholar 

  28. Prusa J, Khoshgoftaar TM, Seliya N (2015) The effect of dataset size on training tweet sentiment classifiers. In: 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA). IEEE, pp 96–102

  29. Rahman MS, Sultana M (2017) Performance of Firth-and logF-type penalized methods in risk prediction for small or sparse binary data. BMC Med Res Methodol 17:1–15

    Google Scholar 

  30. Collins JA, Fauser BCJM (2005) Balancing the strengths of systematic and narrative reviews. Oxford University Press, Oxford

    Google Scholar 

  31. Boell SK, Cecez-Kecmanovic D On Being ‘Systematic’in Literature Reviews in IS. In Formulating Research Methods for Information Systems. Springer, pp 8–78

  32. Bernardo WM, Nobre MRC, Jatene FB (2004) Evidence based clinical practice: part II-searching evidence databases. Rev Assoc Med Bras 50:104–108

    Google Scholar 

  33. Parahoo K (2006) Nursing research: principles, process and issues. Bloomsbury Publishing, London

    Google Scholar 

  34. Dunn TJ, Kennedy M (2019) Technology enhanced learning in higher education; motivations, engagement and academic achievement. Comput Educ 137:104–113

    Google Scholar 

  35. Xie H, Chu H-C, Hwang G-J, Wang C-C (2019) Trends and development in technology-enhanced adaptive/personalized learning: a systematic review of journal publications from 2007 to 2017. Comput Educ 140:103599

    Google Scholar 

  36. Ramos-Rodríguez A, Ruíz-Navarro J (2004) Changes in the intellectual structure of strategic management research: a bibliometric study of the Strategic Management Journal, 1980–2000. Strateg Manag J 25:981–1004

    Google Scholar 

  37. Gimenez E, Salinas M, Manzano-Agugliaro F (2018) Worldwide research on plant defense against biotic stresses as improvement for sustainable agriculture. Sustainability 10:391

    Google Scholar 

  38. Chen X, Xie H, Wang FL et al (2018) A bibliometric analysis of natural language processing in medical research. BMC Med Inform Decis Mak 18:1–14

    Google Scholar 

  39. Song Y, Chen X, Hao T et al (2019) Exploring two decades of research on classroom dialogue by using bibliometric analysis. Comput Educ 137:12–31

    Google Scholar 

  40. Howard BE, Phillips J, Miller K et al (2016) SWIFT-Review: a text-mining workbench for systematic review. Syst Rev 5:1–16

    Google Scholar 

  41. Scells H, Zuccon G, Koopman B, et al (2017) A test collection for evaluating retrieval of studies for inclusion in systematic reviews. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. pp 1237–1240

  42. Shojania KG, Sampson M, Ansari MT et al (2007) How quickly do systematic reviews go out of date? A survival analysis. Ann Intern Med 147:224–233

    Google Scholar 

  43. Zhang Y, Liang S, Feng Y et al (2022) Automation of literature screening using machine learning in medical evidence synthesis: a diagnostic test accuracy systematic review protocol. Syst Rev 11:1–7

    Google Scholar 

  44. Lee S, Kim D, Lee K et al (2016) BEST: next-generation biomedical entity search tool for knowledge discovery from biomedical literature. PLoS ONE 11:e0164680

    Google Scholar 

  45. Petticrew M, Roberts H (2008) Systematic reviews in the social sciences: a practical guide. John Wiley & Sons, New York

    Google Scholar 

  46. Kho ME, Brouwers MC (2012) The systematic review and bibliometric network analysis (SeBriNA) is a new method to contextualize evidence. Part 1: description. J Clin Epidemiol 65:1010–1015

    Google Scholar 

  47. Robinson KA, Dunn AG, Tsafnat G, Glasziou P (2014) Citation networks of related trials are often disconnected: implications for bidirectional citation searches. J Clin Epidemiol 67:793–799

    Google Scholar 

  48. Bernstam EV, Herskovic JR, Aphinyanaphongs Y et al (2006) Using citation data to improve retrieval from MEDLINE. J Am Med Informatics Assoc 13:96–105

    Google Scholar 

  49. Bunn F, Trivedi D, Alderson P et al (2014) The impact of Cochrane systematic reviews: a mixed method evaluation of outputs from Cochrane Review Groups supported by the UK National Institute for Health Research. Syst Rev 3:125

    Google Scholar 

  50. Royle P, Kandala N-B, Barnard K, Waugh N (2013) Bibliometrics of systematic reviews: analysis of citation rates and journal impact factors. Syst Rev 2:74

    Google Scholar 

  51. O’Mara-Eves A, Brunton G, McDaid D et al (2014) Techniques for identifying cross-disciplinary and ‘hard-to-detect’evidence for systematic review. Res Synth Methods 5:50–59

    Google Scholar 

  52. Shemilt I, Simon A, Hollands GJ et al (2014) Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviews. Res Synth Methods 5:31–49

    Google Scholar 

  53. Adeva JJG, Atxa JMP, Carrillo MU, Zengotitabengoa EA (2014) Automatic text classification to support systematic reviews in medicine. Expert Syst Appl 41:1498–1508

    Google Scholar 

  54. Yu Z, Menzies T (2019) FAST2: an intelligent assistant for finding relevant papers. Expert Syst Appl 120:57–71

    Google Scholar 

  55. van Dinter R, Catal C, Tekinerdogan B (2021) A decision support system for automating document retrieval and citation screening. Expert Syst Appl 182:115261

    Google Scholar 

  56. Colón-Ruiz C, Segura-Bedmar I (2020) Comparing deep learning architectures for sentiment analysis on drug reviews. J Biomed Inform 110:103539

    Google Scholar 

  57. Kontonatsios G, Spencer S, Matthew P, Korkontzelos I (2020) Using a neural network-based feature extraction method to facilitate citation screening for systematic reviews. Expert Syst with Appl X 6:100030

    Google Scholar 

  58. Ros R, Bjarnason E, Runeson P (2017) A machine learning approach for semi-automated search and selection in literature studies. In: Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering. pp 118–127

  59. Watanabe WM, Felizardo KR, Candido A Jr et al (2020) Reducing efforts of software engineering systematic literature reviews updates using text classification. Inf Softw Technol 128:106395

    Google Scholar 

  60. Xiong Z, Liu T, Tse G et al (2018) A machine learning aided systematic review and meta-analysis of the relative risk of atrial fibrillation in patients with diabetes mellitus. Front Physiol 9:835

    Google Scholar 

  61. Timsina P, Liu J, El-Gayar O, Shang Y (2016) Using semi-supervised learning for the creation of medical systematic review: An exploratory analysis. In: 2016 49th Hawaii International Conference on System Sciences (HICSS). IEEE, pp 1195–1203

  62. Yu Z, Kraft NA, Menzies T (2018) Finding better active learners for faster literature reviews. Empir Softw Eng 23:3161–3186

    Google Scholar 

  63. Wang D, Weisz JD, Muller M, et al (2019) Human-AI collaboration in data science: Exploring data scientists’ perceptions of automated AI. Proc ACM Human-Computer Interact, pp 1–14.

  64. Oussous A, Benjelloun FZ, Ait Lahcen A, Belfkih S (2018) Big Data technologies: a survey. J King Saud Univ Comput Inf Sci 30:431–448

    Google Scholar 

  65. Kim B, Yoo M, Park KC et al (2021) A value of civic voices for smart city: a big data analysis of civic queries posed by Seoul citizens. Cities 108:102941

    Google Scholar 

  66. Ha T, Beijnon B, Kim S et al (2017) Examining user perceptions of smartwatch through dynamic topic modeling. Telemat Informat 34:1262–1273

    Google Scholar 

  67. Barnett GA, Ruiz JB, Xu WW et al (2017) The world is not flat: evaluating the inequality in global information gatekeeping through website co-mentions. Technol Forecast Soc Change 117:38–45

    Google Scholar 

  68. Barnett GA, Benefield GA (2017) Predicting international Facebook ties through cultural homophily and other factors. New Media Soc 19:217–239

    Google Scholar 

  69. Cheah S, Wang S (2017) Big data-driven business model innovation by traditional industries in the Chinese economy. J Chinese Econ Foreign Trade Stud 10:229–251

    Google Scholar 

  70. Lewis DD (1998) Naive (Bayes) at forty: The independence assumption in information retrieval. In: European conference on machine learning. Springer, pp 4–15

  71. McCallum A, Nigam K (1998) A comparison of event models for naive bayes text classification. In: AAAI-98 workshop on learning for text categorization. Citeseer, pp 41–48

  72. Moschitti A (2003) A study on optimal parameter tuning for Rocchio text classifier. In: Sebastiani F (ed) European Conference on Information Retrieval. Springer, Berlin, pp 420–435

    MATH  Google Scholar 

  73. Jabbar MA, Deekshatulu BL, Chndra P (2014) Alternating decision trees for early diagnosis of heart disease. In: International Conference on Circuits, Communication, Control and Computing. IEEE, pp 322–328

  74. Ali J, Khan R, Ahmad N, Maqsood I (2012) Random forests and decision trees. Int J Comput Sci Issues 9:272

    Google Scholar 

  75. Fawagreh K, Gaber MM, Elyan E (2014) Random forests: from early developments to recent advancements. Syst Sci Control Eng An Open Access J 2:602–609

    Google Scholar 

  76. Roy K, Kar S, Das RN (2015) Understanding the basics of QSAR for applications in pharmaceutical sciences and risk assessment. Academic press, Cambridge

    Google Scholar 

  77. Socher R, Pennington J, Huang EH, et al (2011) Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the conference on empirical methods in natural language processing. Association for Computational Linguistics, pp 151–161

  78. Iyyer M, Enns P, Boyd-Graber J, Resnik P (2014) Political ideology detection using recursive neural networks. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 1113–1122

  79. Kim Y (2014) Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 1746–1751

  80. Mikolov T, Grave E, Bojanowski P, et al (2017) Advances in pre-training distributed word representations. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Retrieved from https://arxiv.org/pdf/1712.09405.pdf

  81. Iglesias LL, Bellón PS, del Barrio AP et al (2021) A primer on deep learning and convolutional neural networks for clinicians. Insights Imaging 12:1–11

    Google Scholar 

  82. Yih W, He X, Meek C (2014) Semantic parsing for single-relation question answering. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp 643–648

  83. Kalchbrenner N, Grefenstette E, Blunsom P (2014) A convolutional neural network for modelling sentences. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 655–665.

  84. Shen Y, He X, Gao J, et al (2014) Learning semantic representations using convolutional neural networks for web search. In: Proceedings of the 23rd International Conference on World Wide Web. ACM, pp 373–374

  85. Collobert R, Weston J, Bottou L et al (2011) Natural language processing (almost) from scratch. J Mach Learn Res 12:2493–2537

    MATH  Google Scholar 

  86. Liu P, Qiu X, Huang X (2016) Recurrent neural network for text classification with multi-task learning. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp 2873–2879.

  87. Golmohammadi M, Ziyabari S, Shah V, et al (2017) Gated recurrent networks for seizure detection. In: 2017 IEEE Signal Processing in Medicine and Biology Symposium, SPMB 2017—Proceedings. IEEE, pp 1–5.

  88. Cheng F, Zhao J (2019) A novel process monitoring approach based on feature points distance dynamic autoencoder. In: Computer Aided Chemical Engineering (Vol. 46). Elsevier, pp 757–762

  89. Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780

    Google Scholar 

  90. Zhou C, Sun C, Liu Z, Lau F (2015) A C-LSTM neural network for text classification. Retrieved from https://doi.org/10.48550/arXiv.1511.08630

  91. García Adeva JJ, Pikatza Atxa JM (2007) Intrusion detection in web applications using text mining. Eng Appl Artif Intell 20:555–566

    Google Scholar 

  92. Hao T, Chen X, Song Y (2020) A topic-based bibliometric analysis of two decades of research on the application of technology in classroom dialogue. J Educ Comput Res 58:1311–1341

    Google Scholar 

  93. Chen X, Gao D, Lun Y, et al (2019) The Analysis of Worldwide Research on Artificial Intelligence Assisted User Modeling. In: International Symposium on Emerging Technologies for Education. Springer, pp 201–213

  94. Chen X, Zou D, Xie H et al (2022) A bibliometric analysis of game-based collaborative learning between 2000 and 2019. Int J Mob Learn Organ 16:20–51

    Google Scholar 

  95. Chen X, Zou D, Su F (2021) Twenty-five years of computer-assisted language learning: a topic modeling analysis. Lang Learn Technol 25:151–185

    Google Scholar 

  96. Yesir S, Soğukpinar İ (2021) Malware Detection and Classification Using fastText and BERT. In: 2021 9th International Symposium on Digital Forensics and Security (ISDFS). IEEE, pp 1–6

  97. Sia S, Dalmia A, Mielke SJ (2020) Tired of topic models? clusters of pretrained word embeddings make for fast and good topics too! In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 1728–1736.

  98. Oral B, Emekligil E, Arslan S, Eryiǧit G (2020) Information extraction from text intensive and visually rich banking documents. Inf Process Manag 57:102361

    Google Scholar 

  99. Dufter P, Kassner N, Schütze H (2021) Static Embeddings as Efficient Knowledge Bases? In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 2353–2363

  100. Magge A, Tutubalina E, Miftahutdinov Z et al (2021) DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter. J Am Med Inform Assoc 28:2184–2192

    Google Scholar 

  101. Tawfik NS, Spruit MR (2020) Evaluating sentence representations for biomedical text: methods and experimental results. J Biomed Inform 104:103396

    Google Scholar 

  102. Immer A, Hennigen LT, Fortuin V, Cotterell R (2022) Probing as Quantifying Inductive Bias. In: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp 1839–1851

  103. Balaji NNA, Bharathi B (2020) SSNCSE_NLP@ Fake news detection in the Urdu language (UrduFake) 2020. Health (Irvine Calif) 100:100

    Google Scholar 

  104. Zarate JMO de, Giovanni M Di, Feuerstein EZ, Brambilla M (2020) Measuring controversy in social networks through nlp. In: International Symposium on String Processing and Information Retrieval. Springer, pp 194–209

  105. Hennigen LT, Williams A, Cotterell R (2020) Intrinsic probing through dimension selection. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 197–216.

  106. Liu Z, Winata GI, Fung P (2020) Zero-resource cross-domain named entity recognition. In: Proceedings of the 5th Workshop on Representation Learning for NLP, pp 1–6.

  107. Hofstätter S, Hanbury A (2019) Let’s measure run time! Extending the IR replicability infrastructure to include performance aspects. Retrieved from https://doi.org/10.48550/arXiv.1907.04614

  108. Islam KI, Islam MS, Amin MR (2020) Sentiment analysis in Bengali via transfer learning using multi-lingual BERT. In: 2020 23rd International Conference on Computer and Information Technology (ICCIT). IEEE, pp 1–5

  109. Kucukyilmaz T, Cambazoglu BB, Aykanat C, Can F (2008) Chat mining: predicting user and message attributes in computer-mediated communication. Inf Process Manag 44:1448–1466

    Google Scholar 

Download references

Acknowledgements

The research described in this article has been supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (UGC/FDS16/E01/19), the Direct Grant (DR22A2) and the Faculty Research Grants (DB22B4 and DB22B7) of Lingnan University, Hong Kong, the One-off Special Fund from Central and Faculty Fund in Support of Research from 2019/20 to 2021/22 (MIT02/19-20), and Interdisciplinary Research Scheme of Dean’s Research Fund 2021/22 (FLASS/DRF/IDS-3) of The Education University of Hong Kong.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haoran Xie.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Appendix

Appendix

See Tables 10, 11, 12, 13, 14, 15, 16, 17

Table 10 Performance of accuracy of different classification algorithms with title feature
Table 11 Performance of F1 of different classification algorithms with title feature
Table 12 Performance of accuracy of different classification algorithms with abstract feature
Table 13 Performance of F1 of different classification algorithms with abstract feature
Table 14 Performance of accuracy of different classification algorithms with all feature
Table 15 Performance of F1 of different classification algorithms with all feature
Table 16 Post Hoc Comparisons using Tukey’s Test with accuracy as dependent variable
Table 17 Post Hoc Comparisons using Tukey’s Test with F1-score as dependent variable

and Figs. 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22 

Fig. 11
figure 11

Accuracy and F1-score for classroom dialogue

Fig. 12
figure 12

Accuracy and F1-score for technology-enhanced classroom dialogue

Fig. 13
figure 13

Accuracy and F1-score for NLP-enhanced clinical trial

Fig. 14
figure 14

Accuracy and F1-score for game-based collaborative learning

Fig. 15
figure 15

Accuracy and F1-score for technology-enhanced language learning

Fig. 16
figure 16

Accuracy and F1-score for user modeling

Fig. 17
figure 17

Accuracy of different algorithms when using title information

Fig. 18
figure 18

F1-score of different algorithms when using title information

Fig. 19
figure 19

Accuracy of different algorithms when using abstract information

Fig. 20
figure 20

F1-score of different algorithms when using abstract information

Fig. 21
figure 21

Accuracy of different algorithms when using all information

Fig. 22
figure 22

F1-score of different algorithms when using all information

.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, X., Xie, H., Li, Z. et al. Leveraging deep learning for automatic literature screening in intelligent bibliometrics. Int. J. Mach. Learn. & Cyber. 14, 1483–1525 (2023). https://doi.org/10.1007/s13042-022-01710-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-022-01710-8

Keywords

Navigation