Skip to main content

An Investigation of Covid-19 Papers for a Content-Based Recommendation System

  • Conference paper
  • First Online:
Advances on P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 343))

Abstract

The proliferation of scientific publications is a well-known phenomenon that was recently emphasized by the publications related to the Covid-19. The number of publications Covid-19 related that PubMed added in the period between January 17 and April 18, 2020 kept rising until it reached a number of 300 publications added in a single day. There are obvious issues related to this phenomenon, such as the difficulty for researchers to find papers strongly related to their applications. When searching for related papers, there could be issues with how a paper is preferred with respect to another. A paper could be recommended based on the greater number of citations, or on the connections between authors, that is as well related to the number of citations. For such reasons, the aim of this study is to build a recommendation system based exclusively on the abstracts of these publications. We provide a comparison between classical approaches—NLP-based such as TF-IDF and n-grams—and Deep Learning approaches for content-based recommendation systems, such as Transformers. We also provide an application to graphs that shows the relationships among related papers on the basis of the results obtained from the recommendation system developed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Michels, C., Schmoch, U.: The growth of science and database coverage. In: Scientometrics 93.3, 831–846 (2012)

    Google Scholar 

  2. Kousha, K., Thelwall, M.: COVID-19 publications: database coverage, citations, readers, tweets, news, Facebook walls, Reddit posts. Quantitative Sci. Stud. 1(3), 1068–1091 (2020). ISSN: 2641–3337. https://doi.org/10.1162/qss_a_00066.eprint: https://direct.mit.edu/qss/article-pdf/1/3/1068/1870075/qss_a_00066.pdf, https://doi.org/10.1162/qss%5C_a%5C_00066

  3. Pazzani, M.J., Billsus, D.: Content-based recommendation systems, pp. 325–341 (2007)

    Google Scholar 

  4. Schafer, J.B., et al.: Collaborative filtering recommender systems, pp. 291–324 (2007)

    Google Scholar 

  5. Falagas, M.E., et al.: Comparison of PubMed, scopus, web of science, and google scholar: strengths and weaknesses. FASEB J. 22.2, 338–342 (2007)

    Google Scholar 

  6. Amato, F., Moscato, V., Picariello, A., Sperli’ı, G.: Recommendation in social media networks. In: 2017 IEEE Third International Conference on Multimedia Big Data (BigMM). IEEE, pp. 213–216 (2017)

    Google Scholar 

  7. Amato, F., Moscato, V., Picariello, A., Sperli’ı, G.: Extreme events management using multimedia social networks. Future Gener. Comput. Syst. 94, 444–452 (2019)

    Google Scholar 

  8. Amato, F., et al.: A model for verification and validation of law compliance of smart-contracts in IoT environment. IEEE Trans. Ind. Informat. 17(11), 7752–7759 (2021)

    Google Scholar 

  9. Ferguson, C., et al.: Europe PMC in 2020. Nucleic Acids Res. 49.D1, D1507–D1514 (2021). ISSN: 0305–1048. https://europepmc.org/articles/PMC7778976. https://doi.org/10.1093/nar/gkaa994

  10. Wang, W., et al.: MiniLM: Deep Self-Attention Distillation for Task- Agnostic Compression of Pre-Trained Transformers (2020). arXiv: 2002. 10957 [cs.CL]

  11. Vaswani, A., et al.: Attention is All You Need (2017). arXiv: 1706.03762 [cs.CL]

  12. Bastian, M., Heymann, S., Jacomy, M.: Gephi: an open source software for exploring and manipulating networks. In: Third International AAAI Conference on Weblogs and Social Media (2009)

    Google Scholar 

  13. Balzano, M., et al.: Smart destination-based parking for the optimization of waiting time. In: Workshops of the International Conference on Advanced Information Networking and Applications, pp. 1019–1027. Springer (2020). https://doi.org/10.1007/978-3-030-44038-1

  14. Balzano, W.: Network signal comparison through waves parameters: a local-alignment-based approach. In: 2019 IEEE International Symposium on Measurements & Networking (M&N), pp. 1–6. IEEE (2019)

    Google Scholar 

  15. Arif, M., et al.: Integration of 5G, VANETs and blockchain technology. In: 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), pp. 2007–2013. IEEE (2020)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Barolli, L., Di Cicco, F., Fonisto, M. (2022). An Investigation of Covid-19 Papers for a Content-Based Recommendation System. In: Barolli, L. (eds) Advances on P2P, Parallel, Grid, Cloud and Internet Computing. 3PGCIC 2021. Lecture Notes in Networks and Systems, vol 343. Springer, Cham. https://doi.org/10.1007/978-3-030-89899-1_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-89899-1_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-89898-4

  • Online ISBN: 978-3-030-89899-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics