research-article

SERG: A Sequence-to-Sequence Model for Chinese ECG Report Generation

Authors:

Na WeiAuthors Info & Claims

RICAI '22: Proceedings of the 2022 4th International Conference on Robotics, Intelligent Control and Artificial Intelligence

Pages 708 - 712

https://doi.org/10.1145/3584376.3584501

Published: 19 April 2023 Publication History

Abstract

Medical report generation is a rising application of big data, which mitigates burden on doctors in clinical trail. However, the serious imbalanced distribution of diseases leads to data bias, an important issue in qualified medical reports generation. To address this issue, a sequence-to-sequence model with incorporation of clinical experience is proposed to generate Chinese ElectroCardioGraph (ECG) reports. The proposed model consists of Electrocardiograph Feature Extractor (EFE), Posterior Knowledge Embedding (PKE) and Report Generator (RG). Firstly, we introduce a novel spatial-temporal information fusion module in EFE to extract robust features from ECG data. Then, embeddings of ECG tags extracted from clinical ECG reports combined with output of EFE are then feed into PKE, which builds a bridge between ECG tags and ECG data, alleviating the problem caused by data bias. Finally, a transformer-based decoder is used in RG to generate ECG reports step by step with output of PKE as key and value. Experiments conducted on private data show that the proposed model can obtain an accuracy 50.26% on BLEU-4, 7.69% higher than state-of-the-art. Our method can also achieve better fluent reports, as demonstrated by the performance on CIDEr, a commonly used content metric.

References

[1]

Connie W Tsao, Aaron W Aday, Zaid I Almarzooq, Alvaro Alonso, Andrea Z Beaton, Marcio S Bittencourt, Amelia K Boehme, Alfred E Buxton, April P Carson, Yvonne Commodore-Mensah, and others. 2022. Heart disease and stroke statistics—2022 update: a report from the american heart association. Circulation, vol. 145, no. 8, pp. e153–e639.

[2]

Babak Mohammadzadeh Asl, Seyed Kamaledin Setarehdan, and Maryam Mohebbi. 2008. Support vector machine-based arrhythmia classification using reduced features of heart rate variability signal. Artificial Intelligence in Medicine, vol. 44, no. 1, pp. 51–64. https://doi.org/10.1016/j.artmed.2008.04.007

Digital Library

[3]

Jing Zhang, Xiang Chen, Aiping Liu, Xun Chen, and Min Gao. 2020. Ecg-based multiclass arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network. Artificial Intelligence in Medicine, vol. 106, p. 101856.

[4]

Qihang Yao, Ruxin Wang, Xiaomao Fan, Jikui Liu, and Ye Li. 2020. Multi-class arrhythmia detection from 12-lead varied-length ecg using attention-based time-incremental convolutional neural network. Information Fusion, vol. 53, pp. 174–182.

Digital Library

[5]

Jikuo Wang, Xu Qiao, Changchun Liu, Xinpei Wang, Yuanyuan Liu, Lianke Yao, and Huan Zhang. 2021. Automated ecg classification using a non-local convolutional block attention module. Computer Methods and Programs in Biomedicine, vol. 203, no. 7, p. 106006. https://doi.org/10.1016/j.cmpb.2021.106006

[6]

Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2 (NIPS'14). MIT Press, Cambridge, MA, USA, 3104–3112.

Digital Library

[7]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS'17). Curran Associates Inc., Red Hook, NY, USA, 6000–6010.

Digital Library

[8]

Oriol Vinyals, Alexander Toshev, Samy Bengio, and Dumitru Erhan. 2015. Show and tell: A neural image caption generator. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3156-3164.

[9]

Lun Huang, Wenmin Wang, Jie Chen, amd Xiaoyong Wei. 2019. Attention on attention for image captioning. 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 4633–4642.

[10]

Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, and Yann N. Dauphin. 2017. Convolutional sequence to sequence learning. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 (ICML'17). JMLR.org, 1243–1252.

[11]

Mikhail S. Burtsev, Yuri Kuratov, Anton Peganov, and Grigory V Sapunov. 2020. Memory transformer. https://arxiv.org/abs/2006.11527

[12]

Yi Tay, Dara Bahri, Donald Metzler, Da-Cheng Juan, Zhe Zhao, and Che Zheng. 2021. Synthesizer: Rethinking self-attention in transformer models. Proceedings of the 38th International Conference on Machine Learning, PMLR 139:10183-10192.

[13]

Nikita Kitaev, Lukasz Kaiser, and Anselm Levskaya. 2020. Reformer: The efficient transformer. https://arxiv.org/abs/2001.04451.

[14]

Sinong Wang, Belinda Z. Li, Madian Khabsa, Han Fang and Hao Ma. 2020. Linformer: Self-attention with linear complexity. https://arxiv.org/abs/2006.04768.

[15]

Angelos Katharopoulos, Apoorv Vyas, Nikolaos Pappas, and François Fleuret. 2020. Transformers are rnns: Fast autoregressive transformers with linear attention. https://arxiv.org/abs/2006.16236.

[16]

Dan Hendrycks, and Kevin Gimpel. 2016. Gaussian error linear units (gelus). arXiv: Learning.

[17]

Jimmy Ba, J. Kiros, and Geoffrey E. Hinton. 2016. Layer normalization. https://arxiv.org/abs/1607.06450.

[18]

Tomas Mikolov, Kai Chen, G. S. Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. International Conference on Learning Representations.

[19]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL '02). Association for Computational Linguistics, USA, 311–318. https://doi.org/10.3115/1073083.1073135.

Digital Library

[20]

Michael Denkowski and Alon Lavie. 2011. Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems. In Proceedings of the Sixth Workshop on Statistical Machine Translation (WMT '11). Association for Computational Linguistics, USA, 85–91.

[21]

Chin-Yew Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.

[22]

Ramakrishna Vedantam, C. Lawrence Zitnick, and Devi Parikh. 2015. Cider: Consensus-based image description evaluation. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4566–4575.

[23]

Awni Y. Hannun, Andrew L. Maas, Daniel Jurafsky, and Andrew Y. Ng. 2014. First-pass large vocabulary continuous speech recognition using bi-directional recurrent dnns. https://arxiv.org/abs/1408.2873.

Recommendations

Extraction of foetal ECG from abdominal ECG by nonlinear transformation and estimations
Highlights
- Extracts fECG, fHR, etc., even on complete overlap.
- Uses single abdomen ...
Abstract Background and objective
This paper proposes a simple yet effective method for the extraction of foetal ECG from abdominal ECG which is necessary due to similar spatial and temporal content of mother and foetal ECG.
...
Compensation of in-plane rigid motion for in vivo intracoronary ultrasound image sequence

Intracoronary ultrasound (ICUS) is an interventional imaging modality that is used to acquire a series of tomographic images from the vascular lumen, for diagnosis and treatment of coronary artery diseases in clinical settings. Motion artifacts caused ...
Heartbeat Recognition from ECG Signals Using Hidden Markov Model with Adaptive Features
SNPD '13: Proceedings of the 2013 14th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing

A heartbeats recognition system for recognizing four different cardiac diseases was developed based on electrocardiogram (ECG) in this paper. The Hidden Markov model (HMM) was applied to the recognition of heartbeats from electrocardiogram (ECG). The ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

RICAI '22: Proceedings of the 2022 4th International Conference on Robotics, Intelligent Control and Artificial Intelligence

December 2022

1396 pages

ISBN:9781450398343

DOI:10.1145/3584376

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 April 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Research
Refereed limited

Conference

RICAI 2022

RICAI 2022: 2022 4th International Conference on Robotics, Intelligent Control and Artificial Intelligence

December 16 - 18, 2022

Dongguan, China

Acceptance Rates

Overall Acceptance Rate 140 of 294 submissions, 48%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
88
Total Downloads

Downloads (Last 12 months)53
Downloads (Last 6 weeks)8

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten