Skip to main content

Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements

  • Conference paper
  • First Online:
HCI International 2022 – Late Breaking Posters (HCII 2022)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1655))

Included in the following conference series:

  • 1295 Accesses

Abstract

A huge number of papers have been published about COVID-19. So much it’s overwhelming. Many papers appear on preprint servers such as arXiv before publication. Researchers and clinicians can get ahead of the curve by making use of these preprint papers, but how to tell what is worth reading? Could there be an automated recommendation mechanism? In this paper we address the question by experimenting with SPECTER document-level vector embedding which establishes the representations by incorporating state-of-the-art Transformer models, such as SciBERT, a BERT variant tailored to scientific text. Meanwhile, the dataset we choose to apply SPECTER embedding is the CORD-19 dataset.

This work was supported by JST (JPMJMS2033). The last author would like to thank Advanced Telecommunications Research Institute for his research visit there.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Beltagy, I., Lo, K., Cohan, A.: SciBERT: a pretrained language model for scientific text (2019)

    Google Scholar 

  2. Chen, Q., Allot, A., Lu, Z.: Keep up with the latest coronavirus research. Nature 579(7798), 193 (2020). https://doi.org/10.1038/d41586-020-00694-1, https://www.ncbi.nlm.nih.gov/pubmed/32157233

  3. Chen, Q., Allot, A., Lu, Z.: LitCovid: an open database of COVID-19 literature. Nucleic Acids Res. 49, D1534–D1540 (2020)

    Article  Google Scholar 

  4. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding (2019)

    Google Scholar 

  5. Neumann, P.M.: The mathematical Writings of Évariste Galois. European Mathematical Society (2011)

    Google Scholar 

  6. Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  7. SCImago: SJR - SCImago Journal & Country Rank [Portal] (2021). http://www.scimagojr.com. Accessed 29 Apr 2021

  8. Vaswani, A., et al.: Attention is all you need (2017)

    Google Scholar 

  9. Wang, L.L., Lo, K.: Text mining approaches for dealing with the rapidly expanding literature on COVID-19. Brief. Bioinform. 22(2), 781–799 (2020). https://doi.org/10.1093/bib/bbaa296

    Article  Google Scholar 

Download references

Acknowledgment

The authors are grateful to Ryohei Sasano for his help with the experimental part of this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tom Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, T., Hinton, N., Bennett, M.T., Maruyama, Y. (2022). Natural Language Processing for Scientific Paper Evaluation: Comparing Human and Machine Judgements. In: Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G. (eds) HCI International 2022 – Late Breaking Posters. HCII 2022. Communications in Computer and Information Science, vol 1655. Springer, Cham. https://doi.org/10.1007/978-3-031-19682-9_90

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-19682-9_90

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-19681-2

  • Online ISBN: 978-3-031-19682-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics