Skip to main content
Log in

Traditional Chinese medicine entity relation extraction based on CNN with segment attention

  • S.I: Cognitive-inspired Computing and Applications
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Extracting medical entity relations from Traditional Chinese Medicine (TCM) related article is crucial to connect domain knowledge between TCM with modern medicine. Herb accounts for the majority of Traditional Chinese Medicine, so our work mainly focuses on herb. The problem would be effectively solved by extracting herb-related entity relations from PubMed literature. In order to realize the entity relation mining, we propose a novel deep-learning model with improved layers without manual feature engineering. We design a new segment attention mechanism based on Convolutional Neural Network, which enables extracting local semantic features through word embedding. Then we classify the relations by connecting different embedding features. We first test this method on the Chemical-Induced Disease task and the experiment show better result comparing to other state-of-the-art deep learning methods. Further, we apply this method to a herbal-related data set (Herbal-Disease and Herbal Chemistry, HD-HC) constructed from PubMed to explore entity relation classification. The experiment shows superior results than other baseline methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Liu J, Chen Z (2011) Traditional Chinese medicine in the new century. Front Med 5(2):111–114

    Article  Google Scholar 

  2. Chai H, Hai LU, Liu QC (2015) Overview of research methods for natural language processing in traditional Chinese medicine. J Med Inf 36(10):58–63

    Google Scholar 

  3. Wu Z, Zhou X, Liu B, Chen J (2004) ‘Text mining for finding functional community of related genes using TCM knowledge’. In European Conference on principles of data mining & knowledge discovery, pp.459–470

  4. Fang YC, Huang HC, Chen HH, Juan HF (2008) TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining. Bmc Complement Altern Med 8:58

    Article  Google Scholar 

  5. Yu T, Li J, Yu Q, Tian Y, Shun X, Xu L, Zhu L, Gao H (2017) Knowledge graph for TCM health preservation: design, construction, and applications. Artif Intell Med 77:48–52

    Article  Google Scholar 

  6. Golshan PN, Dashti HAR, Azizi S, Safari L (2018) ‘A study of recent contributions on information extraction’. The 4th national conference on distributed computing and big data processing

  7. Haihong HE, Zhang WJ, Xiao SQ, Cheng R, Hu YX, Zhou XS, Niu PQ (2019) A survey of entity relationship extraction based on deep learning. J Softw 30(6):1793–1818

    MATH  Google Scholar 

  8. Li J, Sun Y, Johnson RJ, Sciaky D, Wei CH, Leaman R, Davis AP, Mattingly CJ, Wiegers TC, Lu Z (2016) ‘BioCreative V CDR task corpus: a resource for chemical disease relation extraction’, Database the J Biol Databases Curation, vol. 2016, Article Number: baw068

  9. Bai T, Gong L, Wang Y et al (2016) A method for exploring implicit concept relatedness in biomedical knowledge network. BMC Bioinf 17(9):53–56

    Google Scholar 

  10. Gu J, Qian L, Zhou G (2016)‘Chemical-Induced disease relation extraction with various linguistic features’, Database, vol. 2016, Article Number: baw042

  11. Gu J, Sun F, Qian L, Zhou G (2017) Chemical-Induced disease relation extraction via convolutional neural network. Database J Biol Databases Curation 1:2017

    Google Scholar 

  12. Zhou H, Deng H, Chen L, Yang Y, Chen J, Huang D (2016) ‘Exploiting syntactic and semantics information for chemical–disease relation extraction’, Database J Biol Databases Curation, vol. 2016, Article Number: w48

  13. Li H, Chen Q, Tang B, Wang X (2017) ‘Chemical-Induced disease extraction via convolutional neural networks with attention’, 2017 IEEE international conference on bioinformatics and biomedicine (BIBM), Kansas City, MO,USA, pp. 1276–1279

  14. Li H, Ming Y, Chen Q, Tang B, Wang X, Yan J (2018) Chemical-Induced disease extraction via recurrent piecewise convolutional neural networks. BMC Med Inform Decis Mak 18(S2):45–51

    Article  Google Scholar 

  15. Li Y, Jin R, Luo Y (2018) Classifying relations in clinical narratives using segment graph convolutional and recurrent neural networks (Seg-GCRNs). J Am Med Inform Assoc 26(3):262–268

    Article  Google Scholar 

  16. Wang D, Su J, Yu H (2020) Feature extraction and analysis of natural language processing for deep learning English language. IEEE Access 8:46335–46345

    Article  Google Scholar 

  17. Luo Y, Cheng Y, Uzuner Ã, Szolovits P, Starren J (2017) Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes. J Am Med Inform Assoc 25(1):93–98

    Article  Google Scholar 

  18. Bai T, Wang C et al (2020) A novel deep learning method for extracting unspecific biomedical relation. Concurr Comput Pract Exp 32(1):e5005

    Article  Google Scholar 

  19. Wan H, Moens MF, Luyten W, Zhou X, Mei Q, Liu L, Tang J (2016) Extracting relations from traditional chinese medicine literature via heterogeneous entity networks. J Am Med Inf Asmsociation Jaia 23(2):356–365

    Article  Google Scholar 

  20. Wang J, Poon J (2017) ‘Relation extraction from traditional Chinese medicine journal publication’. In IEEE international conference on bioinformatics & biomedicine, pp.15–18

  21. Yang XH, Shan YH, Xie D, Li XD (2017) Relation extraction of traditional Chinese medicine prescription and disease based on literature abstracts data. Mod Tradit Chin Med Mater Medica-World Sci Technol 19(7):1167–1172

    Google Scholar 

  22. Han H, Liu J, Liu G (2018) Attention-based memory network for text sentiment classification. IEEE Access 6:68302–68310

    Article  Google Scholar 

  23. Xiang Y, Xu Y, Yu Z et al (2019) CNN-based text multi-classifier using filters initialised by N-gram vector. Int J Inf Commun Technol 15(4):419

    Google Scholar 

  24. Vu NT, Adel H, Gupta P, Schütze H (2016) ‘Combining recurrent and convolutional neural networks for relation classification’. In: proceedings of NAACL-HLT, pp. 534–539

  25. Luong MT, Pham H, Manning CD (2015) ‘Effective approaches to attention-based neural machine translation’. In: proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, Portugal, pp. 1412–1421

  26. Ye W, Zhi Z, Shan J, Liu J, Mi L (2017)‘Comparisons and selections of features and classifiers for short text classification.’ In, IOP conference series-materials science and engineering (Iop Publishing Ltd) Vol. 261

  27. Amin S, Uddin MI, Hassan S et al (2020) Recurrent neural networks with TF-IDF embedding technique for detection and classification in tweets of dengue disease. IEEE Access 8:131522–131533

    Article  Google Scholar 

  28. Xu J, Wu Y, Zhang Y, Wang J, Lee HJ, Xu H (2016) ‘CD-REST: a system for extracting chemical-induced disease relation in literature’, Database, vol. 2016 Article Number:baw036

  29. Chika Onye S, Akkeleş A, Dimililer N (2018) RelSCAN—A system for extracting chemical-induced disease relation from biomedical literature. J Biomed Inf 87(2018):79–87

    Article  Google Scholar 

  30. Bai T, Ge Y et al (2019) BERST: an engine and tool for exploring biomedical entities and relationships. Chin J Electron 28(4):797–804

    Article  Google Scholar 

Download references

Funding

This work is supported by the Development Project of Jilin Province of China (Nos.20200801033GH, YDZJ202101ZYTS128), Jilin Provincial Key Laboratory of Big Data Intelligent Computing (No.20180622002JC), The Fundamental Research Funds for the Central University, JLU.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lan Huang.

Ethics declarations

Conflict of interest

The author(s) declared no potential conflicts of interest with respect to the research, author- ship, and/or publication of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bai, T., Guan, H., Wang, S. et al. Traditional Chinese medicine entity relation extraction based on CNN with segment attention. Neural Comput & Applic 34, 2739–2748 (2022). https://doi.org/10.1007/s00521-021-05897-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-021-05897-9

Keywords

Navigation