Presentation + Paper
3 April 2023 Longitudinal robustness of a thoracic radiographic AI model for COVID-19 severity prediction
Author Affiliations +
Abstract
The ‘aging’ of artificial intelligence/machine learning (AI/ML) models after initial development and evaluation is known to frequently occur and can pose substantial problems. When there are changes in population, disease characteristics, imaging equipment, or protocols, model performance may start to deteriorate, and the performance predicted in a research setting may no longer hold after deployment (either in a clinical setting or in further research). This data shift phenomenon is a common problem in AI/ML.
We trained and evaluated a previously in-house developed AI/ML model for COVID severity prediction using two COVID-19-positive consecutive adult patient cohorts from a single institution. The first cohort was from the time that the Delta strain was dominant accounting for <95% of cases (June 24-December 11, 2021, 820 patients, 1331 chest radiographs (CXRs)) and the second cohort was from the time that the Omicron variant was dominant (Jan 1-21, 2022, 656 patients, 970 CXRs). Inclusion criteria were COVID-positivity and the availability of CXR imaging exams, in general for patients not admitted to ICU and prior to ICU admission for those patients admitted to ICU as part of their treatment. Exclusion criteria were image acquisition in ICU or the presence of mechanical ventilation. Our image-based AI/ML model was trained to predict, based on each frontal CXR from a COVID-positive patient, whether this patient would be admitted to ICU within a 24, 48, 72, or 96-hour window. The model was evaluated 1) in a cross-sectional test when trained on a subset/tested on an independent subset of the Delta cohort, 2) similarly for the Omicron cohort, and 3) in a longitudinal test when trained on the Delta cohort/tested on the Omicron cohort.
Cohorts were similar in ICU admission rate and fraction of portable CXRs, while immunization rate was higher for the Omicron cohort. The model did not demonstrate signs of aging with performances in the longitudinal test being very similar to those within the Delta cohort, e.g., an area under the ROC curve in the task of predicting ICU admission within 24 hours of 0.76 [0.68; 0.84] when trained/tested within the Delta cohort and 0.77 [0.73; 0.80] for the longitudinal test (p>0.05). The performance within the Omicron cohort was similar as well, at 0.76 [0.66; 0.84].
Our AI/ML model for COVID-severity prediction did not demonstrate signs of aging in a longitudinal test when trained on the Delta cohort and applied as-is to the Omicron cohort.
Conference Presentation
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Karen Drukker, Hui Li, and Maryellen L. Giger "Longitudinal robustness of a thoracic radiographic AI model for COVID-19 severity prediction", Proc. SPIE 12467, Medical Imaging 2023: Image Perception, Observer Performance, and Technology Assessment, 124670C (3 April 2023); https://doi.org/10.1117/12.2653636
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Chest imaging

COVID 19

Performance modeling

Radiography

Machine learning

Artificial intelligence

Diseases and disorders

Back to Top