Named Entity Recognition and Relation Extraction for COVID-19: Explainable Active Learning with Word2vec Embeddings and Transformer-Based BERT Models

Arguello-Casteleiro, M.; Maroto, N.; Wroe, C.; Torrado, C. Sevillano; Henson, C.; Des-Diz, J.; Fernandez-Prieto, M. J.; Furmston, T.; Fernandez, D. Maseda; Kulshrestha, M.; Stevens, R.; Keane, J.; Peters, S.

doi:10.1007/978-3-030-91100-3_14

M. Arguello-Casteleiro¹⁰,
N. Maroto¹¹,
C. Wroe¹²,
C. Sevillano Torrado¹³,
C. Henson¹⁴,
J. Des-Diz¹³,
M. J. Fernandez-Prieto¹⁵,
T. Furmston¹⁰,
D. Maseda Fernandez¹⁴,
M. Kulshrestha¹⁴,
R. Stevens¹⁰,
J. Keane¹⁰ &
…
S. Peters¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13101))

Included in the following conference series:

International Conference on Innovative Techniques and Applications of Artificial Intelligence

931 Accesses
4 Citations

Abstract

Deep learning for natural language processing acquires dense vector representations for n-grams from large-scale unstructured corpora. Converting static embeddings of n-grams into a dataset of interlinked concepts with explicit contextual semantic dependencies provides the foundation to acquire reusable knowledge. However, the validation of this knowledge requires cross-checking with ground-truths that may be unavailable in an actionable or computable form. This paper presents a novel approach from the new field of explainable active learning that combines methods for learning static embeddings (word2vec models) with methods for learning dynamic contextual embeddings (transformer-based BERT models). We created a dataset for named entity recognition (NER) and relation extraction (REX) for the Coronavirus Disease 2019 (COVID-19). The COVID-19 dataset has 2,212 associations captured by 11 word2vec models with additional examples of use from the biomedical literature. We propose interpreting the NER and REX tasks for COVID-19 as Question Answering (QA) incorporating general medical knowledge within the question, e.g. “does ‘cough’ (n-gram) belong to ‘clinical presentation/symptoms’ for COVID-19?”. We evaluated biomedical-specific pre-trained language models (BioBERT, SciBERT, ClinicalBERT, BlueBERT, and PubMedBERT) versus general-domain pre-trained language models (BERT, and RoBERTa) for transfer learning with COVID-19 dataset, i.e. task-specific fine-tuning considering NER as a sequence-level task. Using 2,060 QA for training (associations from 10 word2vec models) and 152 QA for validation (associations from 1 word2vec model), BERT obtained an F-measure of 87.38%, with precision = 93.75% and recall = 81.82%. SciBERT achieved the highest F-measure of 94.34%, with precision = 98.04% and recall = 90.91%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fogel, A.L., Kvedar, J.C.: Artificial intelligence powers digital medicine. NPJ Digit. Med. 1, 5 (2018). https://doi.org/10.1038/s41746-017-0012-2
Durán, J.M., Jongsma, K.R.: Who is afraid of black box algorithms? on the epistemological and ethical basis of trust in medical AI. J. Med. Ethics 47(5), 329–335 (2021)
Google Scholar
Gunning, D., Stefik, M., Choi, J., Miller, T., Stumpf, S., Yang, G.Z.: XAI—Explainable artificial intelligence. Sci. Robot. 4, 37 (2019)
Article Google Scholar
Ghai, B., Liao, Q.V., Zhang, Y., Bellamy, R., Mueller, K.: Explainable active learning (XAL) toward AI explanations as interfaces for machine teachers. In: ACM, pp. 1–28 (2021)
Google Scholar
Settles, B.: Active learning literature survey. Computer Sciences Technical Report (2009)
Google Scholar
Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., Pedreschi, D.: A survey of methods for explaining black box models. In: ACM, pp.1–42 (2018)
Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)
Article Google Scholar
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: Bert: pre-training of deep bidirec-tional transformers for language understanding. In: 2019 NAACL, pp. 4171–4186 (2019)
Google Scholar
Nadkarni, P.M., Ohno-Machado, L., Chapman, W.W.: Natural language processing: an introduction. J. Am. Med. Inform. Assoc. 18(5), 544–551 (2011)
Article Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing, 3rd ed. draft. (2020)
Google Scholar
Word2vec. http://code.google.com/p/word2vec/. Accessed 16 June 2021
GloVe. https://nlp.stanford.edu/projects/glove/. Accessed 16 June 2021
PubMed. https://pubmed.ncbi.nlm.nih.gov/
COVID-19. https://bestpractice.bmj.com/topics/engb/3000168. Accessed 16 June 2021
Liu, Y., et al.: Roberta: A Robustly Optimized BERT Pretraining Approach (2019)
Google Scholar
Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2020)
Google Scholar
Beltagy, I., Lo, K., Cohan, A.: Scibert: a pretrained language model for scientific text. In: 2019 EMNLP-IJCNLP, pp. 3615–3620 (2019)
Google Scholar
Alsentzer, E., Murphy, J.R., Boag, W., Weng, W.H., Jin, D., Naumann, T., McDer-mott, M.: Publicly available clinical BERT embeddings. In: NAACL, pp. 72–78 (2019)
Google Scholar
Peng, Y., Yan, S., Lu, Z.: Transfer learning in biomedical natural language processing: an evaluation of BERT and ELMo on ten benchmarking datasets. In: 18th BioNLP Work-shop and Shared Task, pp. 58–65 (2019)
Google Scholar
Gu, Y., et al.: Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing (2021). https://doi.org/10.1145/3458754
Artstein, R., Poesio, M.: Inter-coder agreement for computational linguistics. Comput. Linguist. 34(4), 555–596 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Manchester, Manchester, UK
M. Arguello-Casteleiro, T. Furmston, R. Stevens, J. Keane & S. Peters
Universidad Politécnica de Madrid, Madrid, Spain
N. Maroto
BMJ, London, UK
C. Wroe
Hospital do Salnés, Pontevedra, Spain
C. Sevillano Torrado & J. Des-Diz
Midcheshire Hospital Foundation Trust, Crewe, UK
C. Henson, D. Maseda Fernandez & M. Kulshrestha
University of Salford, Salford, UK
M. J. Fernandez-Prieto

Authors

M. Arguello-Casteleiro
View author publications
You can also search for this author in PubMed Google Scholar
N. Maroto
View author publications
You can also search for this author in PubMed Google Scholar
C. Wroe
View author publications
You can also search for this author in PubMed Google Scholar
C. Sevillano Torrado
View author publications
You can also search for this author in PubMed Google Scholar
C. Henson
View author publications
You can also search for this author in PubMed Google Scholar
J. Des-Diz
View author publications
You can also search for this author in PubMed Google Scholar
M. J. Fernandez-Prieto
View author publications
You can also search for this author in PubMed Google Scholar
T. Furmston
View author publications
You can also search for this author in PubMed Google Scholar
D. Maseda Fernandez
View author publications
You can also search for this author in PubMed Google Scholar
M. Kulshrestha
View author publications
You can also search for this author in PubMed Google Scholar
R. Stevens
View author publications
You can also search for this author in PubMed Google Scholar
J. Keane
View author publications
You can also search for this author in PubMed Google Scholar
S. Peters
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Arguello-Casteleiro .

Editor information

Editors and Affiliations

University of Portsmouth, Portsmouth, UK
Max Bramer
RKE Consulting, Micheldever, UK
Richard Ellis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arguello-Casteleiro, M. et al. (2021). Named Entity Recognition and Relation Extraction for COVID-19: Explainable Active Learning with Word2vec Embeddings and Transformer-Based BERT Models. In: Bramer, M., Ellis, R. (eds) Artificial Intelligence XXXVIII. SGAI-AI 2021. Lecture Notes in Computer Science(), vol 13101. Springer, Cham. https://doi.org/10.1007/978-3-030-91100-3_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-91100-3_14
Published: 06 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91099-0
Online ISBN: 978-3-030-91100-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics