Skip to main content

BioBERT for Multiple Knowledge-Based Question Expansion and Biomedical Extractive Question Answering

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2024)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14810))

Included in the following conference series:

  • 427 Accesses

Abstract

Seeking a relevant answer to a biomedical question became a daily activity not only for experts but also for patients. In this perspective, biomedical extractive Question Answering systems have witnessed a rapid progress especially with the emergence of pre-trained language models such as BERT and its biomedical variant BioBERT. Those systems aim to extract an answer to a given question from a biomedical context and rely on two principal components question processing and exact answer identification. Several pre-trained language models-based systems have been proposed and focused only on the second component. In this paper, we proposed a BioBERT-based question answering system which rests on a question expansion phase. The Latter intends to extract question terms synonyms, as expansion terms, from multiple knowledge resource MeSH and WordNet. Indeed, we used firstly BioBERT pre-training model as a representation model in the selection of relevant expansion MeSH and WordNet terms. Secondly, in the fine-tuning phase to perform the question answering task and identify the exact answer. The experimental results on BioASQ dataset highlight the interest of the BioBert-based question expansion phase.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://huggingface.co/monologg/biobert_v1.1_pubmed.

  2. 2.

    https://github.com/dmis-lab/BioBERT-v1.1.

  3. 3.

    https://github.com/dmis-lab/bioasq-biobert.

References

  1. Xu, G., Rong, W., Wang, Y., Ouyang, Y., Xiong, Z.: External features enriched model for biomedical question answering. BMC Bioinform. 22(1), 272 (2021)

    Article  Google Scholar 

  2. Mutabazi, E., Ni, J., Tang, G., Cao, W.: A review on medical textual question answering systems based on deep learning approaches. Appl. Sci. 11(12), 5456 (2021)

    Article  Google Scholar 

  3. Badugu, S., Manivannan, R.: A study on different closed domain question answering approaches. Int. J. Speech Technol. 23(2), 315–325 (2020)

    Article  Google Scholar 

  4. Esposito, M., Damiano, E., Minutolo, A., De Pietro, G., Fujita, H.: Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering. Inf. Sci. 514, 88–105 (2020)

    Article  Google Scholar 

  5. Cao, Y., et al.: AskHERMES: an online question answering system for complex clinical questions. J. Biomed. Inform. 44(2), 277–288 (2011)

    Article  Google Scholar 

  6. Sarrouti, M., El Alaoui, S.O.: SemBioNLQA: a semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions. Artif. Intell. Med. 102, 101767 (2020)

    Article  Google Scholar 

  7. Sharma, S., Patanwala, H., Shah, M., Deulkar, K.: A survey of medical question answering systems. Int. J. Eng. Tech. Res. (IJETR) ISSN 3, 131–133 (2015)

    Google Scholar 

  8. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  9. Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2020)

    Article  MathSciNet  Google Scholar 

  10. Dimitriadis, D., Tsoumakas, G.: Word embeddings and external resources for answer processing in biomedical factoid question answering. J. Biomed. Inform. 92, 103118 (2019)

    Article  Google Scholar 

  11. Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)

  12. Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)

    Google Scholar 

  13. Fan, Y.: Pre-training methods in information retrieval. Found. Trends® Inf. Retrieval 16(3), 178–317 (2022). https://doi.org/10.1561/1500000100

    Article  Google Scholar 

  14. Nentidis, A., Bougiatiotis, K., Krithara, A., Paliouras, G.: Results of the seventh edition of the bioasq challenge. In: Cellier, P., Driessens, K. (eds.) ECML PKDD 2019, vol. 1168, pp. 553–568. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-43887-6_51

    Chapter  Google Scholar 

  15. Fu, E.R., Djoko, R., Mansor, M., Slater, R.: BERT for question answering on bioasq. SMU Data Sci. Rev. 3(3), 3 (2020)

    Google Scholar 

  16. Das, B., Nirmala, S.J.: Improving healthcare question answering system by identifying suitable answers. In: 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon), pp. 1–6. IEEE (2022)

    Google Scholar 

  17. Du, Y., Yan, J., Lu, Y., Zhao, Y., Jin, X.: Improving biomedical question answering by data augmentation and model weighting. IEEE/ACM Trans. Comput. Biol. Bioinf. 20(2), 1114–1124 (2022)

    Article  Google Scholar 

  18. Alzubi, J.A., Jain, R., Singh, A., Parwekar, P., Gupta, M.: COBERT: COVID-19 question answering system using BERT. Arab. J. Sci. Eng. 48(8), 11003–11013 (2023)

    Article  Google Scholar 

  19. Lewandowski, B., Morris, R., Paul, P.M., Slater, R.: Question answering with distilled BERT models: a case study for biomedical data. SMU Data Sci. Rev. 7(1), 9 (2023)

    Google Scholar 

  20. Kammoun, H., Gabsi, I., Amous, I.: Mesh-based semantic indexing approach to enhance biomedical information retrieval. Comput. J. 65(3), 516–536 (2022)

    Article  Google Scholar 

  21. Selmi, W., Kammoun, H., Amous, I.: Semantic-based hybrid query reformulation for biomedical information retrieval. Comput. J. 66(9), 2296–2316 (2023)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hager Kammoun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Gabsi, I., Kammoun, H., Wederni, A., Amous, I. (2024). BioBERT for Multiple Knowledge-Based Question Expansion and Biomedical Extractive Question Answering. In: Nguyen, N.T., et al. Computational Collective Intelligence. ICCCI 2024. Lecture Notes in Computer Science(), vol 14810. Springer, Cham. https://doi.org/10.1007/978-3-031-70816-9_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-70816-9_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-70815-2

  • Online ISBN: 978-3-031-70816-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics