Abstract
Electronic Health Records (EHRs) are generated in the clinical treatment process and contain a large number of medical knowledge, which is closely related to the health status of patients. Thus information extraction on unstructured clinical notes in EHRs is important which could contribute to huge improvement in patient health management. Besides, temporal related information extraction seems to be more essential since clinical notes are designed to capture states of patients over time. Previous studies mainly focused on English corpus. However, there are very limited research work on Chinese EHRs. Due to the challenges brought by the characteristics of Chinese, it is difficult to apply existing techniques for English on Chinese corpus directly. Considering this situation, we proposed a deep learning based temporal information extraction framework in this paper. Our framework contains three components: data preprocessing, temporal entity extraction and temporal relation extraction. For temporal entity extraction, we proposed a recurrent neural network based model, using bidirectional long short-term memory (LSTM) with Conditional Random Fields decoding (LSTM-CRF). For temporal relation extraction, we utilize Convolutional Neural Network (CNN) to classify temporal relations between clinical entities and temporal related expressions. To the best of our knowledge, this is the first framework to apply deep learning to temporal information extraction from clinical notes in Chinese EHRs. We conduct extensive sets of experiments on real-world datasets from hospital. The experimental results show the effectiveness of our framework, indicating its practical application value.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ao, X., Luo, P., Wang, J., Zhuang, F., He, Q.: Mining precise-positioning episode rules from event sequences. IEEE Trans. Knowl. Data Eng. 30(3), 530–543 (2018)
Birkhead, G.S., Klompas, M., Shah, N.R.: Uses of electronic health records for public health surveillance to advance public health. Annu. Rev. Public Health 36, 345–359 (2015)
Chen, X., Zhang, Y., Xu, J., Xing, C., Chen, H.: Deep learning based topic identification and categorization: mining diabetes-related topics on Chinese health websites. In: Navathe, S.B., Wu, W., Shekhar, S., Du, X., Wang, X.S., Xiong, H. (eds.) DASFAA 2016. LNCS, vol. 9642, pp. 481–500. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-32025-0_30
Cheng, Y., Anick, P., Hong, P., Xue, N.: Temporal relation discovery between events and temporal expressions identified in clinical narrative. J. Biomed. Inform. 46, S48–S53 (2013)
Cherry, C., Zhu, X., Martin, J.D., de Bruijn, B.: À la recherche du temps perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. JAMIA 20(5), 843–848 (2013)
Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(Aug), 2493–2537 (2011)
Dligach, D., Miller, T., Lin, C., Bethard, S., Savova, G.: Neural temporal relation extraction. In: Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 746–751 (2017)
D’Souza, J., Ng, V.: Temporal relation identification and classification in clinical notes. In: ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics, ACM-BCB 2013, Washington, DC, USA, 22–25 September 2013, p. 392 (2013). https://doi.org/10.1145/2506583.2506654
Hristovski, D., Dinevski, D., Kastrin, A., Rindflesch, T.C.: Biomedical question answering using semantic relations. BMC Bioinform. 16(1), 6 (2015)
Lafferty, J., McCallum, A., Pereira, F.C.: Conditional random fields: probabilistic models for segmenting and labeling sequence data (2001)
Li, L., Zhang, J., He, Y., Wang, H.: Chinese temporal relation resolution based on Chinese-English parallel corpus. Int. J. Embed. Syst. 9(2), 101–111 (2017)
Lin, C., Miller, T., Dligach, D., Bethard, S., Savova, G.: Representations of time expressions for temporal relation extraction with convolutional neural networks. BioNLP 2017, 322–327 (2017)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Mirza, P., Tonelli, S.: Classifying temporal relations with simple features. In: Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp. 308–317 (2014)
Mishra, R., et al.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inform. 52, 457–467 (2014)
Musen, M.A., Middleton, B., Greenes, R.A.: Clinical decision-support systems. In: Shortliffe, E.H., Cimino, J.J. (eds.) Biomedical Informatics: Computer Applications in Health Care and Biomedicine, pp. 643–674. Springer, London (2014). https://doi.org/10.1007/978-1-4471-4474-8_22
Palangi, H., et al.: Deep sentence embedding using long short-term memory networks: analysis and application to information retrieval. IEEE/ACM Trans. Audio, Speech Lang. Process. (TASLP) 24(4), 694–707 (2016)
Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 challenge. J. Am. Med. Inform. Assoc. 20(5), 806–813 (2013)
Tang, B., Wu, Y., Jiang, M., Chen, Y., Denny, J.C., Xu, H.: A hybrid system for temporal information extraction from clinical text. JAMIA 20(5), 828–835 (2013)
Tang, D., Qin, B., Liu, T.: Document modeling with gated recurrent neural network for sentiment classification. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1422–1432 (2015)
Tourille, J., Ferret, O., Neveol, A., Tannier, X.: Neural architecture for temporal relation extraction: a Bi-LSTM approach for detecting narrative containers. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Short Papers, vol. 2, pp. 224–230 (2017)
Wang, J., Wang, Z., Zhang, D., Yan, J.: Combining knowledge with deep convolutional neural networks for short text classification. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence, pp. 2915–2921. AAAI Press (2017)
Xu, Y., Wang, Y., Liu, T., Tsujii, J., Chang, E.I.C.: An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. J. Am. Med. Inform. Assoc. 20(5), 849–858 (2013)
Xu, Y., Wang, Y., Liu, T., Tsujii, J., Chang, E.I.: An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. JAMIA 20(5), 849–858 (2013)
Zhang, Y., Li, X., Wang, J., Zhang, Y., Xing, C., Yuan, X.: An efficient framework for exact set similarity search using tree structure indexes. In: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 759–770. IEEE (2017)
Zheng, X., Li, P., Huang, Y., Zhu, Q.: An approach to recognize temporal relations between Chinese events. In: Lu, Q., Gao, H. (eds.) Chinese Lexical Semantics. LNCS (LNAI), vol. 9332, pp. 543–553. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-27194-1_55
Acknowledgement
Our work is supported by NSFC (91646202), the National High-tech R&D Program of China (SS2015AA020102), Research/Project 2017YB142 supported by Ministry of Education of The People’s Republic of China, the 1000-Talent program, Tsinghua University Initiative Scientific Research Program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Tian, B., Xing, C. (2018). Deep Learning Based Temporal Information Extraction Framework on Chinese Electronic Health Records. In: Meng, X., Li, R., Wang, K., Niu, B., Wang, X., Zhao, G. (eds) Web Information Systems and Applications. WISA 2018. Lecture Notes in Computer Science(), vol 11242. Springer, Cham. https://doi.org/10.1007/978-3-030-02934-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-02934-0_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-02933-3
Online ISBN: 978-3-030-02934-0
eBook Packages: Computer ScienceComputer Science (R0)