skip to main content
10.1145/3584376.3584501acmotherconferencesArticle/Chapter ViewAbstractPublication PagesricaiConference Proceedingsconference-collections
research-article

SERG: A Sequence-to-Sequence Model for Chinese ECG Report Generation

Published: 19 April 2023 Publication History

Abstract

Medical report generation is a rising application of big data, which mitigates burden on doctors in clinical trail. However, the serious imbalanced distribution of diseases leads to data bias, an important issue in qualified medical reports generation. To address this issue, a sequence-to-sequence model with incorporation of clinical experience is proposed to generate Chinese ElectroCardioGraph (ECG) reports. The proposed model consists of Electrocardiograph Feature Extractor (EFE), Posterior Knowledge Embedding (PKE) and Report Generator (RG). Firstly, we introduce a novel spatial-temporal information fusion module in EFE to extract robust features from ECG data. Then, embeddings of ECG tags extracted from clinical ECG reports combined with output of EFE are then feed into PKE, which builds a bridge between ECG tags and ECG data, alleviating the problem caused by data bias. Finally, a transformer-based decoder is used in RG to generate ECG reports step by step with output of PKE as key and value. Experiments conducted on private data show that the proposed model can obtain an accuracy 50.26% on BLEU-4, 7.69% higher than state-of-the-art. Our method can also achieve better fluent reports, as demonstrated by the performance on CIDEr, a commonly used content metric.

References

[1]
Connie W Tsao, Aaron W Aday, Zaid I Almarzooq, Alvaro Alonso, Andrea Z Beaton, Marcio S Bittencourt, Amelia K Boehme, Alfred E Buxton, April P Carson, Yvonne Commodore-Mensah, and others. 2022. Heart disease and stroke statistics—2022 update: a report from the american heart association. Circulation, vol. 145, no. 8, pp. e153–e639.
[2]
Babak Mohammadzadeh Asl, Seyed Kamaledin Setarehdan, and Maryam Mohebbi. 2008. Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal. Artificial Intelligence in Medicine, vol. 44, no. 1, pp. 51–64. https://doi.org/10.1016/j.artmed.2008.04.007
[3]
Jing Zhang, Xiang Chen, Aiping Liu, Xun Chen, and Min Gao. 2020. Ecg-based multiclass arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network. Artificial Intelligence in Medicine, vol. 106, p. 101856.
[4]
Qihang Yao, Ruxin Wang, Xiaomao Fan, Jikui Liu, and Ye Li. 2020. Multi-class arrhythmia detection from 12-lead varied-length ecg using attention-based time-incremental convolutional neural network. Information Fusion, vol. 53, pp. 174–182.
[5]
Jikuo Wang, Xu Qiao, Changchun Liu, Xinpei Wang, Yuanyuan Liu, Lianke Yao, and Huan Zhang. 2021. Automated ecg classification using a non-local convolutional block attention module. Computer Methods and Programs in Biomedicine, vol. 203, no. 7, p. 106006. https://doi.org/10.1016/j.cmpb.2021.106006
[6]
Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2 (NIPS'14). MIT Press, Cambridge, MA, USA, 3104–3112.
[7]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17). Curran Associates Inc., Red Hook, NY, USA, 6000–6010.
[8]
Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3156-3164.
[9]
Lun Huang, Wenmin Wang, Jie Chen, amd Xiaoyong Wei. 2019. Attention on attention for image captioning. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 4633–4642.
[10]
Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, and Yann N. Dauphin. 2017. Convolutional sequence to sequence learning. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (ICML'17). JMLR.org, 1243–1252.
[11]
Mikhail S. Burtsev, Yuri Kuratov, Anton Peganov, and Grigory V Sapunov. 2020. Memory transformer. https://arxiv.org/abs/2006.11527
[12]
Yi Tay, Dara Bahri, Donald Metzler, Da-Cheng Juan, Zhe Zhao, and Che Zheng. 2021. Synthesizer: Rethinking self-attention in transformer models. Proceedings of the 38th International Conference on Machine Learning, PMLR 139:10183-10192.
[13]
Nikita Kitaev, Lukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The efficient transformer. https://arxiv.org/abs/2001.04451.
[14]
Sinong Wang, Belinda Z. Li, Madian Khabsa, Han Fang and Hao Ma. 2020. Linformer: Self-attention with linear complexity. https://arxiv.org/abs/2006.04768.
[15]
Angelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas, and François Fleuret. 2020. Transformers are rnns: Fast autoregressive transformers with linear attention. https://arxiv.org/abs/2006.16236.
[16]
Dan Hendrycks, and Kevin Gimpel. 2016. Gaussian error linear units (gelus). arXiv: Learning.
[17]
Jimmy Ba, J. Kiros, and Geoffrey E. Hinton. 2016. Layer normalization. https://arxiv.org/abs/1607.06450.
[18]
Tomas Mikolov, Kai Chen, G. S. Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. International Conference on Learning Representations.
[19]
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL '02). Association for Computational Linguistics, USA, 311–318. https://doi.org/10.3115/1073083.1073135.
[20]
Michael Denkowski and Alon Lavie. 2011. Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems. In Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT '11). Association for Computational Linguistics, USA, 85–91.
[21]
Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.
[22]
Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. Cider: Consensus-based image description evaluation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4566–4575.
[23]
Awni Y. Hannun, Andrew L. Maas, Daniel Jurafsky, and Andrew Y. Ng. 2014. First-pass large vocabulary continuous speech recognition using bi-directional recurrent dnns. https://arxiv.org/abs/1408.2873.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
RICAI '22: Proceedings of the 2022 4th International Conference on Robotics, Intelligent Control and Artificial Intelligence
December 2022
1396 pages
ISBN:9781450398343
DOI:10.1145/3584376
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 April 2023

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

RICAI 2022

Acceptance Rates

Overall Acceptance Rate 140 of 294 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 88
    Total Downloads
  • Downloads (Last 12 months)53
  • Downloads (Last 6 weeks)8
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media