Abstract
Extracting medical entity relations from Traditional Chinese Medicine (TCM) related article is crucial to connect domain knowledge between TCM with modern medicine. Herb accounts for the majority of Traditional Chinese Medicine, so our work mainly focuses on herb. The problem would be effectively solved by extracting herb-related entity relations from PubMed literature. In order to realize the entity relation mining, we propose a novel deep-learning model with improved layers without manual feature engineering. We design a new segment attention mechanism based on Convolutional Neural Network, which enables extracting local semantic features through word embedding. Then we classify the relations by connecting different embedding features. We first test this method on the Chemical-Induced Disease task and the experiment show better result comparing to other state-of-the-art deep learning methods. Further, we apply this method to a herbal-related data set (Herbal-Disease and Herbal Chemistry, HD-HC) constructed from PubMed to explore entity relation classification. The experiment shows superior results than other baseline methods.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Liu J, Chen Z (2011) Traditional Chinese medicine in the new century. Front Med 5(2):111–114
Chai H, Hai LU, Liu QC (2015) Overview of research methods for natural language processing in traditional Chinese medicine. J Med Inf 36(10):58–63
Wu Z, Zhou X, Liu B, Chen J (2004) ‘Text mining for finding functional community of related genes using TCM knowledge’. In European Conference on principles of data mining & knowledge discovery, pp.459–470
Fang YC, Huang HC, Chen HH, Juan HF (2008) TCMGeneDIT: a database for associated traditional Chinese medicine, gene and disease information using text mining. Bmc Complement Altern Med 8:58
Yu T, Li J, Yu Q, Tian Y, Shun X, Xu L, Zhu L, Gao H (2017) Knowledge graph for TCM health preservation: design, construction, and applications. Artif Intell Med 77:48–52
Golshan PN, Dashti HAR, Azizi S, Safari L (2018) ‘A study of recent contributions on information extraction’. The 4th national conference on distributed computing and big data processing
Haihong HE, Zhang WJ, Xiao SQ, Cheng R, Hu YX, Zhou XS, Niu PQ (2019) A survey of entity relationship extraction based on deep learning. J Softw 30(6):1793–1818
Li J, Sun Y, Johnson RJ, Sciaky D, Wei CH, Leaman R, Davis AP, Mattingly CJ, Wiegers TC, Lu Z (2016) ‘BioCreative V CDR task corpus: a resource for chemical disease relation extraction’, Database the J Biol Databases Curation, vol. 2016, Article Number: baw068
Bai T, Gong L, Wang Y et al (2016) A method for exploring implicit concept relatedness in biomedical knowledge network. BMC Bioinf 17(9):53–56
Gu J, Qian L, Zhou G (2016)‘Chemical-Induced disease relation extraction with various linguistic features’, Database, vol. 2016, Article Number: baw042
Gu J, Sun F, Qian L, Zhou G (2017) Chemical-Induced disease relation extraction via convolutional neural network. Database J Biol Databases Curation 1:2017
Zhou H, Deng H, Chen L, Yang Y, Chen J, Huang D (2016) ‘Exploiting syntactic and semantics information for chemical–disease relation extraction’, Database J Biol Databases Curation, vol. 2016, Article Number: w48
Li H, Chen Q, Tang B, Wang X (2017) ‘Chemical-Induced disease extraction via convolutional neural networks with attention’, 2017 IEEE international conference on bioinformatics and biomedicine (BIBM), Kansas City, MO,USA, pp. 1276–1279
Li H, Ming Y, Chen Q, Tang B, Wang X, Yan J (2018) Chemical-Induced disease extraction via recurrent piecewise convolutional neural networks. BMC Med Inform Decis Mak 18(S2):45–51
Li Y, Jin R, Luo Y (2018) Classifying relations in clinical narratives using segment graph convolutional and recurrent neural networks (Seg-GCRNs). J Am Med Inform Assoc 26(3):262–268
Wang D, Su J, Yu H (2020) Feature extraction and analysis of natural language processing for deep learning English language. IEEE Access 8:46335–46345
Luo Y, Cheng Y, Uzuner Ã, Szolovits P, Starren J (2017) Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes. J Am Med Inform Assoc 25(1):93–98
Bai T, Wang C et al (2020) A novel deep learning method for extracting unspecific biomedical relation. Concurr Comput Pract Exp 32(1):e5005
Wan H, Moens MF, Luyten W, Zhou X, Mei Q, Liu L, Tang J (2016) Extracting relations from traditional chinese medicine literature via heterogeneous entity networks. J Am Med Inf Asmsociation Jaia 23(2):356–365
Wang J, Poon J (2017) ‘Relation extraction from traditional Chinese medicine journal publication’. In IEEE international conference on bioinformatics & biomedicine, pp.15–18
Yang XH, Shan YH, Xie D, Li XD (2017) Relation extraction of traditional Chinese medicine prescription and disease based on literature abstracts data. Mod Tradit Chin Med Mater Medica-World Sci Technol 19(7):1167–1172
Han H, Liu J, Liu G (2018) Attention-based memory network for text sentiment classification. IEEE Access 6:68302–68310
Xiang Y, Xu Y, Yu Z et al (2019) CNN-based text multi-classifier using filters initialised by N-gram vector. Int J Inf Commun Technol 15(4):419
Vu NT, Adel H, Gupta P, Schütze H (2016) ‘Combining recurrent and convolutional neural networks for relation classification’. In: proceedings of NAACL-HLT, pp. 534–539
Luong MT, Pham H, Manning CD (2015) ‘Effective approaches to attention-based neural machine translation’. In: proceedings of the 2015 conference on empirical methods in natural language processing. Association for Computational Linguistics, Lisbon, Portugal, pp. 1412–1421
Ye W, Zhi Z, Shan J, Liu J, Mi L (2017)‘Comparisons and selections of features and classifiers for short text classification.’ In, IOP conference series-materials science and engineering (Iop Publishing Ltd) Vol. 261
Amin S, Uddin MI, Hassan S et al (2020) Recurrent neural networks with TF-IDF embedding technique for detection and classification in tweets of dengue disease. IEEE Access 8:131522–131533
Xu J, Wu Y, Zhang Y, Wang J, Lee HJ, Xu H (2016) ‘CD-REST: a system for extracting chemical-induced disease relation in literature’, Database, vol. 2016 Article Number:baw036
Chika Onye S, Akkeleş A, Dimililer N (2018) RelSCAN—A system for extracting chemical-induced disease relation from biomedical literature. J Biomed Inf 87(2018):79–87
Bai T, Ge Y et al (2019) BERST: an engine and tool for exploring biomedical entities and relationships. Chin J Electron 28(4):797–804
Funding
This work is supported by the Development Project of Jilin Province of China (Nos.20200801033GH, YDZJ202101ZYTS128), Jilin Provincial Key Laboratory of Big Data Intelligent Computing (No.20180622002JC), The Fundamental Research Funds for the Central University, JLU.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declared no potential conflicts of interest with respect to the research, author- ship, and/or publication of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bai, T., Guan, H., Wang, S. et al. Traditional Chinese medicine entity relation extraction based on CNN with segment attention. Neural Comput & Applic 34, 2739–2748 (2022). https://doi.org/10.1007/s00521-021-05897-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-05897-9