Statistical machine learning to identify traumatic brain injury (TBI) from structural disconnections of white matter networks

doi:10.1016/j.neuroimage.2016.01.056

NeuroImage

Volume 129, 1 April 2016, Pages 247-259

https://doi.org/10.1016/j.neuroimage.2016.01.056 Get rights and content

Highlights

•
Method to identify diffuse axonal injury in mild traumatic brain injury (mTBI) patients from structural connectivity patterns.
•
Network-based statistics (NBS) is used to find significant network differences in mTBI and controls.
•
Fractional anisotropy (FA) features of the different structural connections obtained from NBS used as features.
•
Random forest classifier discriminates between mTBI and controls based on FA features.
•
Discriminative and significant network differences obtained from feature importance of random forest.

Abstract

Identifying diffuse axonal injury (DAI) in patients with traumatic brain injury (TBI) presenting with normal appearing radiological MRI presents a significant challenge. Neuroimaging methods such as diffusion MRI and probabilistic tractography, which probe the connectivity of neural networks, show significant promise. We present a machine learning approach to classify TBI participants primarily with mild traumatic brain injury (mTBI) based on altered structural connectivity patterns derived through the network based statistical analysis of structural connectomes generated from TBI and age-matched control groups. In this approach, higher order diffusion models were used to map white matter connections between 116 cortical and subcortical regions. Tracts between these regions were generated using probabilistic tracking and mean fractional anisotropy (FA) measures along these connections were encoded in the connectivity matrices. Network-based statistical analysis of the connectivity matrices was performed to identify the network differences between a representative subset of the two groups. The affected network connections provided the feature vectors for principal component analysis and subsequent classification by random forest. The validity of the approach was tested using data acquired from a total of 179 TBI patients and 146 controls participants. The analysis revealed altered connectivity within a number of intra- and inter-hemispheric white matter pathways associated with DAI, in consensus with existing literature. A mean classification accuracy of 68.16% ± 1.81% and mean sensitivity of 80.0% ± 2.36% were achieved in correctly classifying the TBI patients evaluated on the subset of the participants that was not used for the statistical analysis, in a 10-fold cross-validation framework. These results highlight the potential for statistical machine learning approaches applied to structural connectomes to identify patients with diffusive axonal injury.

Graphical abstract

Introduction

Traumatic brain injuries (TBIs) are highly heterogeneous, with injury resulting from a unique combination of mechanical forces that interact with each individuals distinctive neuroanatomy ( Meaney and Smith, 2011 ). They are also dynamic, involving a complex cascade of metabolic events that affect important ionic fluxes, neurotransmitter concentrations, cerebral haemodynamic status, edema and neuro-inflammatory responses ( Bigler and Maxwell, 2012, Taber and Hurley, 2013). Numerous studies indicate that a hallmark neuropathological feature of TBI is the presence of diffuse axonal injury (DAI) resulting from damage to axolemma and neuro-filaments within the brain ( Bigler and Maxwell, 2012, Omalu et al., 2005). DAI is estimated to occur in approximately 40%–50% of all persons who sustain a TBI ( Meythaler et al., 2001 ). The pathology of DAI has been investigated in some detail ( Povlishock and Katz, 2005 ). If there is sufficient force to rupture the micro-vasculature, small hemorrhages or microbleeds may also be present, which are reliably detected using CT or MRI based susceptibility-weighted imaging (SWI) ( Benson, 2012 ). Although CT and conventional MRI have been proven to be useful for the clinical management of moderate to severe TBI, conventional neuroimaging does not reliably detect DAI in cases of mild TBI (mTBI) ( Hammoud and Wasserman, 2002, Iverson et al., 2000).

There is now growing evidence to suggest that diffusion tensor imaging (DTI) has the potential to revolutionize how we detect DAI in mTBI ( Arfanakis et al., 2002, Bigler and Maxwell, 2012, Taber and Hurley, 2013, Niogi and Mukherjee, 2010, Shenton et al., 2012, Lipton et al., 2012, Goh et al., 2014, Yeh et al., 2012). As DTI probes the subtle changes in the anisotropic nature of the diffusion of water in white matter (WM), it potentially provides a more sensitive and quantifiable measure of DAI within functionally distinct brain regions, thereby overcoming one of the main limitations of conventional MRI ( Huisman et al., 2004 ). Fractional anisotropy (FA), a widely used quantitative measure of anisotropy, reflects the degree of alignment or organization of axonal pathways and has the potential to identify axonal injury associated with TBI ( Johnson et al., 2013 ). For example, using region of interest (ROI) analyses, significant reduction in FA values in patients with mTBI have been reported within the corpus callosum, internal capsule and centrum semiovale ( Kumar et al., 2009, Inglese et al., 2005, Miles et al., 2008, Matsushita et al., 2011). A common finding was a reduction in FA within the genu or splenium of the corpus callosum for mTBI patients. Voxel wise analyses employing Tract-Based Spatial Statistics (TBSS) have revealed more wide spread injury extending to multiple white matter (WM) pathways projecting through the corpus callosum ( Kinnunen et al., 2010, Wada et al., 2012). In contrast to ROI or voxel wise analyses, diffusion imaging along with probabilistic tractography allows DAI to be investigated within specific WM pathways associated with multiple neural circuits. This approach is appealing as loss in connectivity of neural networks can be readily computed and may potentially guide the planning of personalized treatment and rehabilitation strategies ( Wilde et al., 2006, Xu et al., 2007). Using this approach, number of studies have reported loss of integrity of WM pathways within the corpus callosum, fornix, internal capsule and the peduncular projections in TBI patients compared to normal controls ( Wang et al., 2008, Sugiyama et al., 2013).

In recent years, there has been a trend toward using connectome-based strategies to assess the impact of DAI on network connectivity. For example, Pandit et al. (2013) used a graph-theoretic approach based on functional connectivities to show deviation of small-world characteristics i.e. reduced overall connectivity, longer average path lengths and reduced network efficiency in TBI patients compared to control participants, especially within networks involving the posterior cingulate cortex. Their findings also supported previous findings of reduced WM integrity along the corpus callosum, corticospinal tracts and superior longitudinal fasciculi. Similarly, ( Caeyenberghs et al., 2013 ) combined complementary information from the graph-theoretic analysis of the structural and functional hubs to discriminate between the TBI and the healthy groups, with a specific focus on the switching-motor network. Furthermore, graph-theoretic approach using DTI networks was adopted by Irimia et al., 2012a, Irimia et al., 2012b to introduce a patient-tailored approach to the graphical representation of WM change over time. Van Horn et al. (2012) also employed DTI connectomics to study the white matter damage in the simulated case of severe TBI of Phineas Gage, who was an American railroad construction foreman in the 19^th century. The study revealed that the impact on measures of network connectivity between the area of direct injury and other areas were profound and widespread, which probably contributed to acute and long-term behavioral changes.

Machine learning approaches are being increasingly used to identify discriminative features from the neural connections that best separate a diseased group from healthy cohorts. For example, Aribisala et al. (2010) applied a region-wise support vector machine (SVM) analysis to identify DAI using T1, T2 maps and mean diffusivity measures in both gray and white matter regions in mTBI patients compared to control participants. Recently, Lui et al. (2014) used features from conventional MRI, diffusion MRI and fMRI to build a set of features and employed a feature selection algorithm prior to applying several classifiers like support vector machines (SVMs), Bayesian networks and multilayer perceptrons to separate mTBI from controls.

Many of the studies in TBI research were based on the use of specific regions known to be associated with the loss of WM integrity in DAI. Such approaches may not identify WM injury elsewhere in more remote brain regions other than the point of impact, which impair a wide array of cerebral functions ( Love and Webb, 1992, Gioia et al., 2010). Our limited knowledge of how tract structure relates to cognitive function in normal brain ( Kinnunen et al., 2010 ), leads to the importance of assessment of white matter structure after traumatic brain injury with a highly comprehensive spatial coverage.

Network-based statistical analysis between the TBI and control groups may reveal structural connections based on a stochastic model with the assumption that the mean FA values are different between the groups. These connections being statistically significant in differentiating the two groups, may not however, represent an optimally discriminative feature set in classifying the TBI and control groups due to the following reasons: 1) the exclusive dependence on a stochastic model is an overspecialization when the analysis is performed on a limited number of datasets ( Breiman, 2001b ). Such an analysis therefore may fail in generalization when validated on a larger cohort. 2) The validation of these statistical methods is usually by goodness-of-fit measures such as hypothesis testing used in network-based statistical analysis ( Zalesky et al., 2010 ). The number of connections obtained with such methods, therefore, may be sensitive to the choice of the threshold used in t-tests ( Zalesky et al., 2010 ) and it is often challenging to reach a consensus on how many of these are actually affected due to TBI.

In machine learning, standard feature selection algorithms may find out features or neural connections that are discriminative in classification, while these may not be statistically significant in terms of difference in mean FA values between the TBI and control groups. In other words, the machine learning algorithms do not make any assumption for a data model, instead, a complex, unknown function is learned from the data to predict the responses ( Breiman, 2001b ). The robust cross-validation schemes used in machine learning approaches, i.e. building a classification model on a training set and validating on a test set, where the sets are mutually exclusive, also provides more confidence and generalization to the analysis. Unleashing the potential of both statistical and machine learning methods, therefore, in this work, we have adopted a hybrid strategy combining both the statistical analysis of network connections and then machine learning in order to identify the cortical/sub-cortical connections that differentiate between TBI patients and age-matched controls.

In our method, we used probabilistic tractography to build structural networks and estimate diffusion anisotropy in a population of TBI (mostly with mTBI) and matched controls, with DAI being characterized by the reduction in diffusion anisotropy ( Arfanakis et al., 2002 ). Network-based statistics (NBS) ( Zalesky et al., 2010 ), which are based on generalized linear model (GLM) and t-test statistics, were then applied to extract networks or connections that were different between a subset of the two groups. Tractography data from MRI is very noisy, hence it is very likely that the noise is propagated into pairwise associations measuring FA values over the tracts, thus resulting in a low contrast-to-noise ratio. NBS is known to work better than other generic statistical tests due to its ability to deal with low contrast-to-noise ratio ( Zalesky et al., 2010 ). NBS also involves extensive multiple comparisons correction and reduces the family-wise error rate in a weak sense, which is computationally more expensive in other statistical tests. As discussed before, statistical network separation using t-tests is however, sensitive to thresholds. To avoid a precise choice of the t-threshold that may vary with datasets, we selected the network from NBS comprising of a large number of structural connections and a high spatial coverage, using a reasonably high t-threshold. This ensured that all connections obtained from the NBS network were statistically significant with reduced FA values in TBI than controls.

Retaining a large number of connections may however, present serious challenges to classification methods ( Liu and Motoda, 2007 ), i.e. the curse of dimensionality ( Hastie et al., 2001 ), when many of these obtained from NBS may not be actually predictive in classifying the TBI and healthy groups. With the presence of a large number of features, a classification or prediction model tends to overfit, resulting in performance degradation on unknown test data ( Tang et al., 2014 ). Selecting the most relevant features is therefore, usually suboptimal in building a good predictor ( Guyon and Elisseeff, 2003 ), especially if the features are correlated and redundant. Principal component analysis (PCA) ( Pearson, 1901 ), a common feature selection technique was therefore, applied on the connections obtained from NBS, in order to reduce and re-express the correlated feature space with a set of linearly uncorrelated components.

Random forest (RF) ( Breiman, 2001a ) is known to be robust against overfitting and on noisy data. A RF model was therefore, built using the features obtained from NBS and PCA, which was then used to classify the remaining sets of TBI and normal groups that were not used for feature selection using NBS and PCA. The primary advantage of using RF over any other classifier is its uniqueness in providing the feature importance (Gini importance) within the cross-validation scheme based on the training samples. Unlike other traditional feature selection methods, Gini importance is known to capture the multivariate and non-linear relationships among the structural connections and the disease states ( Langs et al., 2011 ) i.e. TBI or non-TBI in our case. Gini importance has been shown to correlate well with the measures based on feature perturbations ( Breiman, 2001a, Archer and Kimes, 2008) and known to be stable across datasets without a need for explicit regularization, thereby, providing an alternative to the computationally more expensive statistical permutation tests. All these factors led to a conscientious combination of NBS, PCA and RF classifier that was particularly suited to our problem of finding a comprehensive set discriminative connections with a good spatial coverage that were affected in TBI. The novelty of our method, hence, is in the preliminary selection of signature connections using NBS and then refining the signature connections according to the feature importance provided by the Gini measure of random forest.

Section snippets

Participants

Data from 179 TBI participants (142 males, age: 42.8 ± 17 yrs. and 37 females, age: 45.8 ± 17.4 yrs) and 146 age-matched controls (81 males, age: 42.5 ± 16.8 yrs. and 65 females, age: 40.3 ± 17.6 yrs). TBI participants were classified as mild (n = 136), moderate (n = 21) and severe TBI (n = 22) following the the Glasgow Coma Scale (GCS) scores. GCS scores for the patients ranged between 1 and 15, with a mean GCS score of 13.1 ± 3.3, which were either recorded by ambulance officers at the accident scene or obtained

Method

Our approach consisted of two stages: (a) network connectivity analysis for feature selection; and (b) extraction of discriminative features using classification. A schematic diagram depicting these stages is provided in Fig. 1 .

Network connectivity analysis

A group-wise t-test comparison between the TBI (n = 70) patients and the age-matched controls (n = 40), performed using NBS, provided a list of statistically significant neural connections with p < 0.002 at t = 2.60. Fig. 2 shows the significant connections as network graphs obtained from NBS. A total of 115 connections were identified as being significantly different between the two groups, which were eventually reduced and refined by PCA and RF, respectively. A higher threshold used for NBS (t = 3.0)

Discussions

In this study we proposed a novel method to classify TBI and healthy control participants, and identified the statistically significant and discriminative structural connections between the two populations using a machine learning approach. NBS was used to identify statistically significant brain network differences between TBI and normal controls. The results from the NBS analysis revealed injury to a number of intra- and inter-hemispheric WM pathways known to be associated with DAI ( Wada et

Acknowledgments

This research was funded by a National Health and Medical Research Project Grant (ID 519220). The authors wish to thank Ms. V. Dennington and Dr. Y. Harman-Smith for their efforts in subject recruitment.

References (95)

O. Acosta et al.
Automated voxel-based 3D cortical thickness measurement in a combined Lagrangian–Eulerian PDE approach using partial volume maps
Med. Image Anal.
(2009)
P. Aljabar et al.
Multi-atlas based segmentation of brain images: atlas selection and its effect on accuracy
NeuroImage
(2009)
K. Archer et al.
Empirical characterization of random forest variable importance measures
Comput. Stat. Data Anal.
(2008)
K. Caeyenberghs et al.
Brain–behavior relationships in young traumatic brain injury patients: fractional anisotropy measures are highly correlated with dynamic visuomotor tracking performance
Neuropsychologia
(2010)
A. Irimia et al.
b. Neuroimaging of structural pathology and connectomics in traumatic brain injury: toward personalized outcome prediction
Neuroimage Clin.
(2012)
M. Jenkinson et al.
FSL. Neuroimage
(2012)
V.E. Johnson et al.
Axonal pathology in traumatic brain injury
Exp. Neurol.
(2013)
K. Kasahara et al.
Voxel- and atlas-based analysis of diffusion tensor imaging may reveal focal axonal injuries in mild traumatic brain injury — comparison with diffuse axonal injury
Magn. Reson. Imaging
(2012)
G. Langs et al.
Detecting stable distributed patterns of brain activation using Gini contrast
NeuroImage
(2011)
D. Meaney et al.
Biomechanics of concussion
Clin. Sports Med.
(2011)

B.S. Aribisala et al.

Classification of traumatic brain injury patients using multi-parametric automatic analysis of quantitative MRI scans

Y. Bai et al.

Model-based registration to correct for motion between acquisitions in diffusion MR imaging

R. Benson

Detection of hemorrahagic and axonal pathology in mild traumatic brain injury using advanced MRI: implications for neurorehabilitation

NeuroRehabilitation

(2012)

E. Bigler et al.

Neuropathology of mild traumatic brain injury: relationship to neuroimaging findings

Brain Imaging Behav.

(2012)

B. Boser et al.

A Training Algorithm For Optimal Margin Classifiers, In: Proc. Of the 5th annual Workshop On Computational Learning Theory (COLT’92)

(1992)

S. Bouix et al.

Increased gray matter diffusion anisotropy in patients with persistent post-concussive symptoms following mild traumatic brain injury

PLoS One

(2013)

L. Breiman

Random forests

Mach. Learn.

(2001)

L. Breiman

Statistical modeling: the two cultures

Stat. Sci.

I. Guyon et al.

An introduction to variable and feature selection

J. Mach. Learn. Res.

(2003)

I. Guyon et al.

Gene selection for cancer classification using support vector machines

Mach. Learn.

(2002)

Cited by (57)

Integrating unsupervised and supervised learning techniques to predict traumatic brain injury: A population-based study
2023, Intelligence-Based Medicine
This work aimed to identify pre-existing health conditions of patients with traumatic brain injury (TBI) and develop predictive models for the first TBI event and its external causes by employing a combination of unsupervised and supervised learning algorithms. We acquired up to five years of pre-injury diagnoses for 488,107 patients with TBI and 488,107 matched control patients who entered the emergency department or acute care hospitals between April 1st, 2002, and March 31st, 2020. Diagnoses were obtained from the Ontario Health Insurance Plan (OHIP) database which contains province-wide claims data by physicians in Ontario, Canada for inpatient and outpatient services. A screening process was conducted on the OHIP diagnostic codes to limit the subsequent analysis to codes that were predictive of TBI, which concluded that 314 codes were significantly associated with TBI. The Latent Dirichlet Allocation (LDA) model was applied to the diagnostic codes and generated an optimal number of 19 topics that concur with published literature but also suggest other unexplored areas. Estimated word-topic probabilities from the LDA model helped us detect pre-morbid conditions among patients with TBI by uncovering the underlying patterns of diagnoses, meanwhile estimated document-topic probabilities were utilized in variable creation as form of a dimension reduction. We created 19 topic scores for each patient in the cohort which were utilized along with socio-demographic factors for Random Forest binary classifier models. Test set performances evaluated using area under the receiver operating characteristic curve (AUC) were: TBI event (AUC = 0.85), external cause of injury: falls (AUC = 0.85), struck by/against (AUC = 0.83), cyclist collision (AUC = 0.76), motor vehicle collision (AUC = 0.83). Our analysis successfully demonstrated the feasibility of using machine learning to predict TBI due to various external causes and identified the most important factors that contribute to this prediction.
The detection of mild traumatic brain injury in paediatrics using artificial neural networks
2021, Computers in Biology and Medicine
Citation Excerpt :
Diagnostic and prognostic models based on ML have been booming as supportive tools in the medical field over the years [18,20]. Many studies have been conducted to predict TBI outcome using ML and have utilized various data types [21–25]. A hybrid ML model comprising decision tree (DT) and ANN was applied to predict the attributes crucial for the prediction of TBI in patients.
Head computed tomography (CT) is the gold standard in emergency departments (EDs) to evaluate mild traumatic brain injury (mTBI) patients, especially for paediatrics. Data-driven models for successfully classifying head CT scans that have mTBI will be valuable in terms of timeliness and cost-effectiveness for TBI diagnosis. This study applied two different machine learning (ML) models to diagnose mTBI in a paediatric population collected as part of the paediatric emergency care applied research network (PECARN) study between 2004 and 2006. The models were conducted using 15,271 patients under the age of 18 years with mTBI and had a head CT report. In the conventional model, random forest (RF) ranked the features to reduce data dimensionality and the top ranked features were used to train a shallow artificial neural network (ANN) model. In the second model, a deep ANN applied to classify positive and negative mTBI patients using the entirety of the features available. The dataset was divided into two subsets: 80% for training and 20% for testing using five-fold cross-validation. Accuracy, sensitivity, precision, and specificity were calculated by comparing the model's prediction outcome to the actual diagnosis for each patient. RF ranked ten clinical demographic features and twelve CT-findings; the hybrid RF-ANN model achieved an average specificity of 99.96%, sensitivity of 95.98%, precision of 99.25%, and accuracy of 99.74% in identifying positive mTBI from negative mTBI subjects. The deep ANN proved its ability to carry out the task efficiently with an average specificity of 99.9%, sensitivity of 99.2%, precision of 99.9%, and accuracy of 99.9%. The performance of the two proposed models demonstrated the feasibility of using ANN to diagnose mTBI in a paediatric population. This is the first study to investigate deep ANN in a paediatric cohort with mTBI using clinical and non-imaging data and diagnose mTBI with balanced sensitivity and specificity using shallow and deep ML models. This method, if validated, would have the potential to reduce the burden of TBI evaluation in EDs and aide clinicians in the decision-making process.
Feasibility study to identify machine learning predictors for a Virtual Environment Grocery Store
2024, Virtual Reality
Machine learning classification analysis for an adaptive virtual reality Stroop task
2023, Virtual Reality
Clinical Performance of a Multiparametric MRI-based Post Concussive Syndrome Index
2023, Research Square
Integrating Unsupervised and Supervised Learning Techniques to Predict Traumatic Brain Injury: A Population-Based Study
2023, SSRN

View all citing articles on Scopus

View full text

Statistical machine learning to identify traumatic brain injury (TBI) from structural disconnections of white matter networks

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Participants

Method

Network connectivity analysis

Discussions

Acknowledgments

Med. Image Anal.

NeuroImage

Comput. Stat. Data Anal.

Neuropsychologia

Neuroimage Clin.

FSL. Neuroimage

Exp. Neurol.

Magn. Reson. Imaging

NeuroImage

Clin. Sports Med.

Arch. Phys. Med. Rehabil.

Comput. Methods Prog. Biomed.

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

Arch. Neurol.

NeuroImage

NeuroImage

Diffuse axonal injury due to non-missile head injury in humans: an analysis of 45 cases

Ann. Neurol.

Deep intracerebral (basal ganglia) haematomas in fatal non-missile head injury in man

J. Neurol. Neurosurg. Psychiatry

Diffusion tensor imaging studies of mild traumatic brain injury: a meta-analysis

J. Neurol. Neurosurg. Psychiatry

Diffusion tensor mr imaging in diffuse axonal injury

Am. J. Neuroradiol.

Classification of traumatic brain injury patients using multi-parametric automatic analysis of quantitative MRI scans

Model-based registration to correct for motion between acquisitions in diffusion MR imaging

Detection of hemorrahagic and axonal pathology in mild traumatic brain injury using advanced MRI: implications for neurorehabilitation

NeuroRehabilitation

Neuropathology of mild traumatic brain injury: relationship to neuroimaging findings

Brain Imaging Behav.

A Training Algorithm For Optimal Margin Classifiers, In: Proc. Of the 5th annual Workshop On Computational Learning Theory (COLT’92)

Increased gray matter diffusion anisotropy in patients with persistent post-concussive symptoms following mild traumatic brain injury

PLoS One

Random forests

Mach. Learn.

Statistical modeling: the two cultures

Stat. Sci.

Classification and Regression Trees

Topological correlations of structural and functional networks in patients with traumatic brain injury

Front. Hum. Neurosci.

Support-vector networks

Mach. Learn.

Prognostic and diagnostic potential of the structural neuroanatomy of depression

PLoS One

Prediction of individual brain maturity using fMRI

Science

Increased cortical–limbic anatomical network connectivity in major depression revealed by diffusion tensor imaging

PLoS One

Relative brain displacement and deformation during constrained mild frontal head impact

J. R. Soc. Interface

Executive function in the real world: BRIEF lessons from Mark Ylvisaker

J. Head Trauma Rehabil.

Neuroinformatics challenges to the structural, connectomic, functional, and electrophysiological multimodal imaging of human traumatic brain injury

Front. Neuroinformatics

An introduction to variable and feature selection

J. Mach. Learn. Res.

Gene selection for cancer classification using support vector machines

Mach. Learn.