Processing math: 50%
Synthesizing Feature-Aligned and Category-Aware Electronic Medical Records for Intracranial Aneurysm Rupture Prediction | IEEE Journals & Magazine | IEEE Xplore

Synthesizing Feature-Aligned and Category-Aware Electronic Medical Records for Intracranial Aneurysm Rupture Prediction


The TransCGAN model features two main components: a Transformer framework with statistical loss, which enhances the alignment of feature-wise contextual relationship and ...

Abstract:

Rupture prediction is crucial for precise treatment and follow-up management of patients with intracranial aneurysms (IAs). Considerable machine learning (ML) methods hav...Show More

Abstract:

Rupture prediction is crucial for precise treatment and follow-up management of patients with intracranial aneurysms (IAs). Considerable machine learning (ML) methods have been proposed to improve rupture prediction by leveraging electronic medical records (EMRs), however, data scarcity and category imbalance strongly influence performance. Thus, we propose a novel data synthesis method i.e., Transformer-based conditional GAN (TransCGAN), to synthesize highly authentic and category-aware EMRs to address above challenges. Specifically, we first align feature-wise context relationship and distribution between synthetic and original data to enhance synthetic data quality. To achieve this, we first integrate the Transformer structure into GAN to match the contextual relationship by processing the long-range dependencies among clinical factors and introduce a statistical loss to maintain distributional consistency by constraining the mean and variance of the synthesis features. Additionally, a conditional module is designed to assign the category of the synthesis data, thereby addressing the challenge of category imbalance. Subsequently, the synthetic data are merged with the original data to form a large-scale and category-balanced training dataset for IAs rupture prediction. Experimental results show that using TransCGAN's synthetic data enhances classifier performance, achieving AUC of 0.89 and outperforming state-of-the-art resampling methods by 5\%–33\% in F1 score.
The TransCGAN model features two main components: a Transformer framework with statistical loss, which enhances the alignment of feature-wise contextual relationship and ...
Published in: IEEE Journal of Biomedical and Health Informatics ( Volume: 28, Issue: 12, December 2024)
Page(s): 7420 - 7433
Date of Publication: 23 August 2024

ISSN Information:

PubMed ID: 39178095

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.