Skip to main content

Chemical-Gene Relation Extraction with Graph Neural Networks and BERT Encoder

  • Conference paper
  • First Online:
Book cover Proceedings of the ICR’22 International Conference on Innovations in Computing Research (ICR 2022)

Abstract

Identifying named entities and their interactions from biomedical texts is essential for medical research. In this work, we modify the graph neural networks for relation extraction (RIFRE) model for biomedical text, named Bio-RIFRE that jointly extracts chemical-gene named entities and their interactions from biomedical texts. This model (Bio-RIFRE) utilizes the heterogeneous graph neural network model to strengthen the representation of biomedical text and relation extraction. As a result, Bio-RIFRE achieves 3% better F1-score on the CHEMPROT dataset than the other joint entity-relation extraction models. In addition, The results show that in the overlapping cases, and when the number of relations in a sentence increases, the model is robust and achieves a better F1-score.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kambar, M.E.Z.N., Nahed, P., Cacho, J.R.F., Lee, G., Cummings, J., Taghva, K.: Clinical text classification of Alzheimer’s drugs’ mechanism of action. In: Yang, X.-S., Sherratt, S., Dey, N., Joshi, A. (eds.) Proceedings of Sixth International Congress on Information and Communication Technology. LNNS, vol. 235, pp. 513–521. Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-2377-6_48

    Chapter  Google Scholar 

  2. Zhao, S., Su, C., Lu, Z., Wang, F.: Recent advances in biomedical literature mining. Brief. Bioinform. 22(3), bbaa057 (2020). https://doi.org/10.1093/bib/bbaa057

    Article  Google Scholar 

  3. Krallinger, M., et al.: The CHEMDNER corpus of chemicals and drugs and its annotation principles. J. Cheminform. 7(1), 1–17 (2015)

    Article  Google Scholar 

  4. Krallinger, M., et al.: Overview of the BioCreative VI chemical-protein interaction track. In: Proceedings of the Sixth BioCreative Challenge Evaluation Workshop, vol. 1, pp. 141–146 (2017)

    Google Scholar 

  5. Zeng, X., Zeng, D., He, S., Liu, K., Zhao, J.: Extracting relational facts by an end-to-end neural model with copy mechanism. In: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 506–514 (2018)

    Google Scholar 

  6. Kambar, M.E.Z.N., Esmaeilzadeh, A., Heidari, M.: A survey on deep learning techniques for joint named entities and relation extraction. In: 2022 IEEE World AI IoT Congress (AIIoT) (IEEE AIIOT 2022), Virtual, USA, June 2022

    Google Scholar 

  7. Warikoo, N., Chang, Y.-C., Hsu, W.-L.: LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task. Database 2018, bay108 (2018). https://doi.org/10.1093/database/bay108

  8. Lung, P.-Y., He, Z., Zhao, T., Yu, D., Zhang, J.: Extracting chemical-protein interactions from literature using sentence structure analysis and feature engineering. Database 2019 (2019)

    Google Scholar 

  9. Zuo, M., Zhang, Y.: A span-based joint model for extracting entities and relations of bacteria biotopes. Bioinformatics 38(1), 220–227 (2021)

    Article  Google Scholar 

  10. Zhang, Q., Liu, C., Chi, Y., Xie, X., Hua, X.: A multi-task learning framework for extracting bacteria biotope information. In: Proceedings of the 5th Workshop on BioNLP Open Shared Tasks, pp. 105–109 (2019)

    Google Scholar 

  11. Xiao, S., Song, M., et al.: A text-generated method to joint extraction of entities and relations. Appl. Sci. 9(18), 3795 (2019)

    Article  Google Scholar 

  12. Zhao, K., Xu, H., Cheng, Y., Li, X., Gao, K.: Representation iterative fusion based on heterogeneous graph neural network for joint entity and relation extraction. Knowl.-Based Syst. 219, 106888 (2021)

    Article  Google Scholar 

  13. Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks. In: International Conference on Learning Representations (2018, accepted as poster). https://openreview.net/forum?id=rJXMpikCZ

  14. Miranda, A., Mehryary, F., Luoma, J., Pyysalo, S., Valencia, A., Krallinger, M.: Overview of DrugProt BioCreative VII track: quality evaluation and large scale text mining of drug-gene/protein relations. In: Proceedings of the Seventh BioCreative Challenge Evaluation Workshop (2021)

    Google Scholar 

  15. Wei, Z., Su, J., Wang, Y., Tian, Y., Chang, Y.: A novel cascade binary tagging framework for relational triple extraction. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1476–1488 (2020)

    Google Scholar 

  16. Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics 36(4), 1234–1240 (2019). https://doi.org/10.1093/bioinformatics/btz682

    Article  Google Scholar 

  17. Devlin, J., Chang, M.-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)

  18. Vaswani, A.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30 (2017)

    Google Scholar 

  19. Esmaeilzadeh, A., Taghva, K.: Text classification using Neural Network Language Model (NNLM) and BERT: an empirical comparison. In: Arai, K. (ed.) IntelliSys 2021. LNNS, vol. 296, pp. 175–189. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-82199-9_12

    Chapter  Google Scholar 

  20. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)

    Article  Google Scholar 

  21. Malekzadeh, M., Hajibabaee, P., Heidari, M., Zad, S., Uzuner, O., Jones, J.H.: Review of graph neural network in text classification. In: 2021 IEEE 12th Annual Ubiquitous Computing, Electronics Mobile Communication Conference (UEMCON), pp. 0084–0091 (2021)

    Google Scholar 

  22. Peng, Y., Rios, A., Kavuluru, R., Lu, Z.: Extracting chemical-protein relations with ensembles of SVM and deep learning models. Database 2018, bay073 (2018). https://doi.org/10.1093/database/bay073

  23. Sun, C., Yang, Z., Wang, L., Zhang, Y., Lin, H., Wang, J.: MRC4BioER: joint extraction of biomedical entities and relations in the machine reading comprehension framework. J. Biomed. Inf. 125, 103956 (2022)

    Article  Google Scholar 

  24. Luo, L., Yang, Z., Cao, M., Wang, L., Zhang, Y., Lin, H.: A neural network-based joint learning approach for biomedical entity and relation extraction from biomedical literature. J. Biomed. Inf. 103, 103384 (2020)

    Article  Google Scholar 

  25. Zhang, Q., Liu, C., Chi, Y., Xie, X., Hua, X.: A multi-task learning framework for extracting bacteria biotope information. In: BioNLP-OST@EMNLP-IJNCLP 2019 - Proceedings of the 5th Workshop on BioNLP Open Shared Tasks, pp. 105–109 (2019)

    Google Scholar 

  26. Zheng, S., Wang, F., Bao, H., Hao, Y., Zhou, P., Xu, B.: Joint extraction of entities and relations based on a novel tagging scheme. arXiv preprint arXiv:1706.05075 (2017)

Download references

Acknowledgment

This research is based upon work funded by the National Science Foundation under Grant No. 1625677.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mina Esmail Zadeh Nojoo Kambar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Esmail Zadeh Nojoo Kambar, M., Esmaeilzadeh, A., Taghva, K. (2022). Chemical-Gene Relation Extraction with Graph Neural Networks and BERT Encoder. In: Daimi, K., Al Sadoon, A. (eds) Proceedings of the ICR’22 International Conference on Innovations in Computing Research. ICR 2022. Advances in Intelligent Systems and Computing, vol 1431. Springer, Cham. https://doi.org/10.1007/978-3-031-14054-9_17

Download citation

Publish with us

Policies and ethics