Skip to main content
Log in

Biomedical event causal relation extraction based on a knowledge-guided hierarchical graph network

  • Neural Networks
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Biomedical Event Causal Relation Extraction (BECRE) is a challenging task in biological information extraction and plays a crucial role to serve for knowledge base and knowledge graph construction. Here a biomedical cause-effect relation is defined as an association between two events and requires that the cause-event must occur before the effect-event. Some current advances tend to apply deep learning for the BECRE task and have achieved comparable performances. However, because most of event causal relations are implicitly stated, the performances of these works based on contextual semantics and syntactics might be limited. This fact suggests that it is necessary to introduce external cues to improve the performance of the implicit BECRE especially in the low source scenario. To improve the potential of the designed model, an intuitive idea is to introduce hierarchical knowledge from biological knowledge bases to supplement domain cues for the contexts. Nevertheless, it is difficult to learn the hierarchy and cause-effect direction of knowledge in the model and also few works focus on this issue. Thus, to better fuse knowledge, we propose a Graph Edge-Cluster Attention Network (GECANet) for the BECRE task. Specifically, we introduce external knowledge and build hierarchical knowledge graphs for the contexts. Also, the proposed GECANet effectively aggregates the context and hierarchical knowledge semantics under the guidance of cause-effect direction. The experimental results confirm that fusing external knowledge can effectively guide the model to identify event causal relations and facilitate our approach to achieve state-of-the-art performances respectively on the Hpowell and BioCause datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data Availability

The datasets generated during and/or analysed during the current study are available in the https://github.com/Zhangbeibei1991/KG-BCRE/tree/master/datasets repository and the program code has been also made available at: https://github.com/Zhangbeibei1991/KG-BCRE.

Notes

  1. Rel\( _\mathrm{A \rightarrow B} \) denotes the relation from nodes A to node B.

  2. Before the start of experiments, we conducted the operation of the knowledge discovery in advance for all texts in the evaluated datasets and construct a large number of triplets to pre-train knowledge representations.

  3. The reason using GRU in Eq. (5) is that GRU has a relatively simple structure compared with other variants of RNN.

  4. https://huggingface.co/dmis-lab/biobert-base-cased-v1.1

  5. https://huggingface.co/dmis-lab/biobert-v1.1.

  6. https://huggingface.co/monologg/biobert_v1.0_pubmed_pmc.

  7. https://huggingface.co/monologg/biobert_v1.1_pubmed.

  8. The bold in tables indicate the best scores of each column.

  9. The Bold and Italic text span and Bold span in sentence respectively denote E1 and E2 events.

References

  • Akkasi A, Moens M-F (2021) Causal relationship extraction from biomedical text using deep neural models: a comprehensive survey. J Biomed Inform 119:103820. https://doi.org/10.1016/j.jbi.2021.103820

    Article  Google Scholar 

  • Amos L, Anderson D, Brody S, Ripple A, Humphreys BL (2020) UMLS users and uses: a current overview. J Am Med Inf Assoc 27(10):1606–1611. https://doi.org/10.1093/jamia/ocaa084

    Article  Google Scholar 

  • Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. Nat Genet 25(1):25–29. https://doi.org/10.1038/75556

    Article  Google Scholar 

  • Bordes A, Usunier N, Garcıa-Duran A, Weston J, Yakhnenko O (2013) Translating embeddings for modeling multi-relational data. In: Burges CJC, Bottou L, Ghahramani Z, Weinberger KQ, (eds) advances in neural information processing systems 26: 27th annual conference on neural information processing systems 2013. proceedings of a meeting held December 5–8, 2013, Lake Tahoe, Nevada, United States, pp. 2787–2795, NeurIPS . https://proceedings.neurips.cc/paper/2013/file/1cecc7a77928ca8133fa24680a88d2f9-Paper.pdf

  • Caselli T, Vossen P (2017) The event storyline corpus: A new benchmark for causal and temporal relation extraction. In: Caselli T, Miller B, van Erp M, Vossen P, Palmer M, Hovy EH, Mitamura T, Caswell D, (eds) proceedings of the events and stories in the news workshop@ACL 2017, Vancouver, Canada, August 4, 2017, pp. 77–86. ACL. https://aclanthology.org/W17-2711

  • Chen W, Hao Z, Cai R, Zhang X, Hu Y, Liu M (2016) Multiple-cause discovery combined with structure learning for high-dimensional discrete data and application to stock prediction. Soft Comput 20(11):4575–4588. https://doi.org/10.1007/s00500-015-1764-8

    Article  Google Scholar 

  • Chen J, Hu B, Peng W, Chen Q, Tang B (2022) Biomedical relation extraction via knowledge-enhanced reading comprehension. BMC Bioinform 23(1):1–19. https://doi.org/10.1186/s12859-021-04534-5

    Article  Google Scholar 

  • Chen Z, Feng Y, Zhao D (2022) Entailment graph learning with textual entailment and soft transitivity. In Muresan S, Nakov P, and Villavicencio A, (eds) proceedings of the 60th annual meeting of the association for computational linguistics, ACL 2022, Dublin, Ireland, May 22–27, 2022, pp. 5899–5910. ACL. https://aclanthology.org/2022.acl-long.406

  • Deleger L, Bossy R, Chaix E, Ba M, Ferre A, Bessi‘eres P, Nedellec C (2016) Overview of the bacteria biotope task at bionlp shared task 2016. In: Nedellec C, Bossy R, Kim J-D, (eds) Proceedings of the 4th BioNLP shared short article title 11 task workshop, BioNLP 2016, Berlin, Germany, August 13, 2016, pp. 12–22. ACL. https://aclanthology.org/W16-3002

  • Devlin J, Chang M, Lee K, Toutanova K (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol 1 (Long and Short Papers), Minneapolis, Minnesota. June 3–5, 2019, pp. 4171–4186. ACL. https://doi.org/10.18653/v1/n19-1423

  • Feder A, Keith KA, Manzoor E, Pryzant R, Sridhar D, Wood-Doughty Z, Eisenstein J, Grimmer J, Reichart R, Roberts ME, Stewart BM, Veitch V, Yang D (2021) Causal inference in natural language processing: estimation, prediction, interpretation and beyond. CoRR, abs/2109.00725. arXiv:2109.00725

  • Gao L, Choubey PK, Huang R (2019) Modeling document-level causal structures for event causal relation identification. In: Burstein J, Doran C, and Solorio T, (eds) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, Minneapolis, MN, USA, June 2–7, 2019, pp. 1808–1817. ACL. https://aclanthology.org/N19-1179

  • Gopalan S, Devi SL (2017) Cause and effect extraction from biomedical corpus. Computacion Sistemas 21(4):749–757

    Google Scholar 

  • Hahn-Powell G, Bell D, Valenzuela-Escarcega MA, Surdeanu M (2016) This before that: causal precedence in the biomedical domain. In: Cohen KB, Demner-Fushman D, Ananiadou S, and Tsujii J, (eds) proceedings of the 15th workshop on biomedical natural language processing, BioNLP@ACL 2016, Berlin, Germany, August 12, 2016, pp. 146–155. ACL. https://aclanthology.org/W16-2920

  • Huang KH, Yang M, Peng N (2020) Biomedical event extraction on graph edge-conditioned attention networks with hierarchical knowledge graphs. In: Cohn T, He Y, Liu Y, (eds) findings of the association for computational linguistics: EMNLP 2020, online event, 16–20 November 2020, vol EMNLP 2020 of findings of ACL, pp 1277–1285. ACL. https://aclanthology.org/2020.findings-emnlp.114

  • Jie H, Lipi A, Dongxiao Z, Jianlin C (2015) An overview of bioinformatics methods for modeling biological pathways in Yeast. Briefings Funct Genom 15(2):95–108. https://doi.org/10.1093/bfgp/elv040

    Article  Google Scholar 

  • Kim J-D, Wang Y, Takagi T, Yonezawa A (2011) Overview of genia event task in bionlp shared task 2011. In: Tsujii J, Kim J-D, and Pyysalo S, (eds) proceedings of BioNLP shared task 2011 workshop, Portland, Oregon, USA, June 24, 2011, pp. 7–15. ACL. https://aclanthology.org/W11-1802

  • Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: 5th international conference on learning representations, ICLR 2017, Toulon, France, April 24–26, 2017, conference track proceedings. OpenReview.net. https://openreview.net/forum?id=SJU4ayYgl

  • Lai T, Ji H, Zhai CX, Tran QH (2021) Joint biomedical entity and relation extraction with knowledge-enhanced collective inference. In: Zong C, Xia F, Li W, Navigli R, (eds) proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, ACL/IJCNLP 2021, Virtual Event, August 1–6, 2021, pp. 6248–6260. ACL. https://aclanthology.org/2021.acl-long.488

  • Lai T, Ji H, Zhai CX, Tran QH (2021) Joint biomedical entity and relation extraction with knowledge-enhanced collective inference. In: Zong C, Xia F, Li W, Navigli R, (eds) proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, ACL/IJCNLP 2021, Virtual Event, August 1–6, 2021, pp. 6248–6260. ACL. https://aclanthology.org/2021.acl-long.488

  • Li P, Mao K (2019) Knowledge-oriented convolutional neural network for causal relation extraction from natural language texts. Expert Syst Appl 115:512–523. https://doi.org/10.1016/j.eswa.2018.08.009

    Article  Google Scholar 

  • Liang Z, Noriega-Atala E, Morrison C, Surdeanu M (2022) Low resource causal event detection from biomedical literature. In: Smaranda M, Preslav N, Aline V, (eds) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022, Dublin, Ireland, May 22–27, 2022, pp. 252–263. ACL. https://aclanthology.org/2022.bionlp-1.24

  • Li Z, Ding X, Liu T (2018) Constructing narrative event evolutionary graph for script event prediction. In: Jerome L, (ed) proceedings of the twenty-seventh international joint conference on artificial intelligence, IJCAI 2018, July 13–19, 2018, Stockholm, Sweden, pp. 4201–4207. ijcai.org. https://doi.org/10.24963/ijcai.2018/584

  • Li D, Huang L, Ji H, Han J (2019) Biomedical event extraction based on knowledge-driven tree-lstm. In: Burstein J, Doran C, Solorio T, (eds) proceedings of the 2019 conference of the north American chapter of the association for computational linguistics: human language technologies, NAACL HLT 2019, Minneapolis, MN, USA, June 2–7, 2019, pp. 1421–1430. ACL. https://aclanthology.org/N19-1145

  • Liu J, Chen Y, Zhao J (2020) Knowledge enhanced event causality identification with mention masking generalizations. In: Bessiere C, (ed) proceedings of the twenty-ninth international joint conference on artificial intelligence, IJCAI 2020, pp. 3608–3614. ijcai.org. https://doi.org/10.24963/ijcai.2020/499

  • Mihăilă C, Ananiadou S (2014) Semi-supervised learning of causal relations in biomedical scientific discourse. Biomed Eng Online 13(2):1–24. https://doi.org/10.1186/1475-925X-13-S2-S1

    Article  Google Scholar 

  • Mihaila C, Ohta T, Pyysalo S, Ananiadou S (2013) Biocause: annotating and analysing causality in the biomedical domain. BMC Bioinform 14(1):1–18. https://doi.org/10.1186/1471-2105-14-2

    Article  Google Scholar 

  • Mirza P, S Tonelli (2016) CATENA: causal and temporal relation extraction from natural language texts. In: Calzolari N, Matsumoto Y, Prasad R, (eds) COLING 2016, 26th international conference on computational linguistics, proceedings of the conference: technical papers, December 11–16, 2016, Osaka, Japan, pp. 64–75. ACL. https://aclanthology.org/C16-1007

  • Mirza P, Tonelli S (2014) An analysis of causality between events and its relation to temporal information. In: Hajic J, Tsujii J, (eds) COLING 2014, 25th international conference on computational linguistics, proceedings of the conference: technical papers, August 23–29, 2014, Dublin, Ireland, pp. 2097–2106. ACL. https://aclanthology.org/C14-1198

  • Nedellec C, Bossy R, Kim JD, Kim JJ, Ohta T, Pyysalo S, Zweigenbaum P (2013) Overview of bionlp shared task 2013. In: Proceedings of the BioNLP shared task 2013 workshop, Sofia, Bulgaria, August 9, 2013, pp. 1–7. ACL. https://aclanthology.org/W13-2001

  • Prasad R, McRoy S, Frid N, Joshi AK, Yu H (2011) The biomedical discourse relation bank. BMC Bioinform 12(1):1–18. https://doi.org/10.1186/1471-2105-12-188

    Article  Google Scholar 

  • Sanh V, Debut L, Chaumond J, Wolf T (2019) Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108

  • Saxena A, Chakrabarti S, Talukdar P (2021) Question answering over temporal knowledge graphs. In: C Zong, F Xia, W Li, and R Navigli, (eds) Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, ACL/IJCNLP 2021, virtual event, August 1–6, 2021, pp. 6663–6676. ACL. https://aclanthology.org/2021.acl-long.520

  • Scholkopf B, Locatello F, Bauer S, Ke NR, Kalchbrenner N, Goyal A, Bengio Y (2021) Toward causal representation learning. Proc IEEE 109(5):612–634

    Article  Google Scholar 

  • Sharma R, Palshikar GK, Pawar S (2018) An unsupervised approach for cause-effect relation extraction from biomedical text. In: Silberztein M, Atigui F, Kornyshova E, Metais E, Meziane F, (eds) natural language processing and information systems–23rd international conference on applications of natural language to information systems, NLDB 2018, Paris, France, June 13–15, 2018, proceedings, volume 10859 of lecture notes in computer science, pp 419–427. Springer. https://doi.org/10.1007/978-3-319-91947-8_43

  • Shi Y, Huang Z, Feng S, Zhong H, Wang W, Sun Y (2021) Masked label prediction: Unified message passing model for semi-supervised classification. In: Zhou ZH, (ed) proceedings of the thirtieth international joint conference on artificial intelligence, IJCAI 2021, virtual event/Montreal, Canada, 19–27 August 2021, pp. 1548–1554. ijcai.org. https://doi.org/10.24963/ijcai.2021/214

  • Song L, Zhang Y, Wang Z, Gildea D (2018) N-ary relation extraction using graph-state LSTM, In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Association for Computational Linguistics, Brussels, Belgium, 2018, pp. 2226–2235. ACL. https://www.aclweb.org/anthology/D18-1246

  • Velickovic P, Cucurull G, Casanova A, Romero A, Li‘o P, Bengio Y (2018) Graph attention networks. In: 6th international conference on learning representations, ICLR 2018, Vancouver, BC, Canada, April 30–May 3, 2018, conference track proceedings. OpenReview.net. https://openreview.net/forum?id=rJXMpikCZ

  • Vo DT, Al-Obeidat F, Bagheri E (2020) Extracting temporal and causal relations based on event networks. Inf Process Manag 57(6):102319. https://doi.org/10.1016/j.ipm.2020.102319

    Article  Google Scholar 

  • Wang Z, Zhang J, Feng J, Chen Z (2014) Knowledge graph embedding by translating on hyperplanes. In: Brodley CE, Stone P, (eds) proceedings of the twenty-eighth aaai conference on artificial intelligence, July 27–31, 2014, Quebec City, Quebec, Canada, pp. 1112–1119. AAAI press. https://doi.org/10.1609/aaai.v28i1.8870

  • Wu Z, Jiang D, Hsieh CY, Chen G, Liao B, Cao D, Hou T (2021) Hyperbolic relational graph convolution networks plus: a simple but highly efficient QSAR-modeling method. Brief Bioinform 22(5):bbab112. https://doi.org/10.1093/bib/bbab112

    Article  Google Scholar 

  • Wu Q, Deng Z, Pan X, Shen HB, Choi KS, Wang S, Wu J, Yu DJ (2022) MDGF-MCEC: a multi-view dual attention embedding model with cooperative ensemble learning for CircRNA-disease association prediction. Brief Bioinform 23(5):bbac289. https://doi.org/10.1093/bib/bbac289

    Article  Google Scholar 

  • Xing W, Qi J, Yuan X, Li L, Zhang X, Fu Y, Xiong S, Hu L, Peng J (2018) A gene-phenotype relationship extraction pipeline from the biomedical literature using a representation learning approach. Bioinformatics 34(13):386–394. https://doi.org/10.1093/bioinformatics/bty263

    Article  Google Scholar 

  • Zhao D, Wang J, Lin H, Wang X, Yang Z, Zhang Y (2021) Biomedical cross-sentence relation extraction via multihead attention and graph convolutional networks. Appl Soft Comput 104:107230. https://doi.org/10.1016/j.asoc.2021.107230

    Article  Google Scholar 

  • Zhao S, Jiang M, Liu M, Qin B, Liu T (2018) Causaltriad: toward pseudo causal relation discovery and hypotheses generation from medical text data. In: Amarda S, Wu CH, Christina B, Li J, Liu H, Pop M, (eds), Proceedings of the 2018 ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics, BCB 2018, Washington, DC, USA, August 29–September 01, 2018, pp. 184–193. ACM. https://doi.org/10.1145/3233547.3233555

  • Zhu G, Sun Z, Zhang S, Wei S, Li K (2022) Causality extraction model based on two-stage GCN. Soft Comput 26(24):13815–13828. https://doi.org/10.1007/s00500-022-07370-8

  • Zuo X, Cao P, Chen Y, Liu K, Zhao J, Peng W, Chen Y (2021) Improving event causality identification via self-supervised representation learning on external causal statement. In: Zong C, Xia F, Li W, and Navigli R, (eds) findings of the association for computational linguistics: ACL/IJCNLP 2021, Online Event, August 1–6, 2021, volume ACL/IJCNLP 2021 of findings of ACL, pp 2162–2172. ACL. https://aclanthology.org/2021.findings-acl.190

  • Zuo X, Chen Y, Liu K, Zhao J (2020) Knowdis: knowledge enhanced data augmentation for event causality detection via distant supervision. In: Donia S, Bel N, and Zong C, (eds) Proceedings of the 28th International Conference on computational linguistics, COLING 2020, Barcelona, Spain, December 8–13, 2020, pp. 1544–1550. ICCL. https://aclanthology.org/2020.coling-main.135

Download references

Funding

This work is supported by grant from the National Natural Science Foundation of China (No. 62076048), the Science and Technology Innovation Foundation of Dalian (2020JJ26GX035).

Author information

Authors and Affiliations

Authors

Contributions

BZ and LL conceived the experiment(s), BZ and YZ conducted the experiment(s), YZ and DS analysed the results. BZ and LL wrote and reviewed the manuscript.

Corresponding author

Correspondence to Lishuang Li.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests regarding the publication of this paper.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, B., Li, L., Song, D. et al. Biomedical event causal relation extraction based on a knowledge-guided hierarchical graph network. Soft Comput 27, 17369–17386 (2023). https://doi.org/10.1007/s00500-023-08882-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-08882-7

Keywords

Navigation