Ensemble of SVM, Random-Forest and the BSWiMS Method to Predict and Describe Structural Associations with Fluid Intelligence Scores from T1-Weighed MRI

Tamez-Pena, Jose; Orozco, Jorge; Sosa, Patricia; Valdes, Alejandro; Nezhadmoghadam, Fahimeh

doi:10.1007/978-3-030-31901-4_6

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11791))

Included in the following conference series:

Challenge in Adolescent Brain Cognitive Development Neurocognitive Prediction

1681 Accesses
4 Citations

Abstract

The degree of association between fluid intelligence and neuroanatomy is important in refining our understanding of brain development. The primary goal of this work is twofold: to predict fluid intelligence from T1-weighed MRI, and to describe the MRI features that are associated with fluid intelligence. In this paper, we propose to ensemble the predictions of three machine learning strategies: Support Vector Machine (SVM), Random Forest (RF), and Bootstrapped Step Wise Model Selection (BSWiMS). Gender-stratified SVM was trained on children using age (ages 9–10), plus 122 volumetric scores provided by the ABCD challenge team. RF and BSWiMS were gender-stratified and trained using cubic root transformed data, summarized by left-right mean and relative absolute differences, and augmented by 19 volumetric statistical descriptors of major anatomical regions. Then, the transformed-augmented feature set was adjusted by age and the mean volume of the training set. The predictions of the three models were averaged to get the final prediction on each one of the test subjects. The Mean Squared Error (MSE) of MRI-predicted fluid intelligence on the test subjects was 100.89. The top features associated with fluid intelligence were the volume of the pons white mater and the volume of the parahippocampal gyrus.

You have full access to this open access chapter, Download conference paper PDF

Machine Learning on High Dimensional Shape Data from Subcortical Brain Surfaces: A Comparison of Feature Selection and Classification Methods

NBS-Predict: An Easy-to-Use Toolbox for Connectome-Based Machine Learning

Identification of Cognitive Deficits Based on T1-Weighted Magnetic Resonance Imaging

Keywords

1 Introduction

Fluid intelligence is a major factor in measuring general intelligence [1]. Therefore, the ABCD, the largest long-term study of brain development and child health in the United States, has captured MRI data as well as genetics, neuropsychological, behavioral, and other health assessments to “determine how childhood experiences (such as sports, videogames, social media, unhealthy sleep patterns, and smoking) interact with each other and with a child’s changing biology to affect brain development and social, behavioral, academic, health, and other outcomes”. Hence, one specific challenge is determining if machine learning prediction methods can be used to predict fluid intelligence from T1-weighted MRI [2].

Regarding prediction methods, there are hundreds of methods that can be used to predict a continuous variable from a set of features. Random Forest [3], Support Vector Machines [4], Recursive partitioning for classification [5], and Least Absolute Shrinkage and Selection Operator (LASSO) [6] are among the main machine learning approaches that can learn a linear structure from a set of predictors. Furthermore, there are several ways to improve prediction by first selecting the features that are associated with the data. Once selected, they’re used in a regression equation. Bootstrap Stage-Wise Model Selection (BSWiMS) used this strategy, plus bootstrap samples, to extract a set of regression models that predict the desired outcome [7]. The aim of this work is to explore the behavior of these approaches on the ABCD challenge data, report the features used to predict the fluid intelligence, build a final prediction based on top three predictors, and finally predict validation and testing data sets.

2 Materials and Methods

Data was provided by the ABCD challenge team: 3739, 415, and 4515 children were used for training, validation, and testing, respectively. All the children were measured by standardized protocols, and MRI features were extracted from T1-weighted images using a standard procedure described by the ABCD challenge organizers [2]. In this paper, we describe the methods used to predict the fluid intelligence score from the provided data sets.

2.1 Data Processing

We transformed all the volume scores provided via the challenge organizers by computing the cubic root on all measurements. This operation transformed the inherent distribution of volumetric errors into a Gaussian distribution and consequently mitigating the influence of measurement errors on each one of the volumes’ distributions. Furthermore, brain asymmetry has been associated with several chronic neurological diseases such as Schizophrenia and Alzheimer [8, 9]; therefore, here we explore the hypothesis that structural asymmetry may also be associated with fluid intelligence. To explore this hypothesis, we computed the mean and absolute difference of the left and right volumes of each the brain structures. Finally, we augmented the set of transformed features by adding the descriptive features described in Table 1. All transformations and descriptive computations were done in Microsoft Excel (2013).

Table 1. Descriptive features derived from volumetric data. Transformed volumes were computed using the cubic root transformation.

Full size table

Once we computed the descriptive features, we adjusted all the features (transformed and descriptive) to any residual associations to volume or age, and we did this independently on males and females. The adjustment was done by computing the residuals of the following linear model:

$$ f_{i} = \beta_{o} + \beta_{1} Age + \beta_{2} AVGVolume + \beta_{3} Age*AVGVolume, $$

(1)

on all features using the FRESA.CAD package [7].

2.2 Benchmarking Machine Learning Methods

The adjusted training data was explored by a Benchmarking function provided by the FRESA.CAD R package. The FRESA.CAD benchmarking function does a repeated hold-out cross-validation (RHOCV) [10] of the following machine learning methods: Recursive partitioning (RPART), Random Forest (RF), Support Vector Machine (SVM), LASSO, and Bootstrapped Stage-Wise Model Selection (BSWiMS) [3,4,5,6, 11]. Furthermore, the benchmarking evaluates the efficiency of different feature selection methods like minimum redundancy maximum relevance mRMR [11] and univariate selection adjusted for false discovery rate [12]. Features selected by these methods were used to train robust regression, ridge regression, and linear regression models. Trained models were evaluated in the holdout testing data. Finally, internal-test performance results were reported. Besides exploring the regression models using the augmented data, we also explored the capability of the original ABCD challenge volume scores to model fluid intelligence.

All the benchmarks were run separately on males’ and females’ data set using 90% of the data for training and holding 10% of the data for test evaluation. The hold-out procedure was repeated 100 times. Figures 1 and 2 show the results of benchmarking the regression methods on the ABCD challenge data set. The Pearson correlation between fluid intelligence and benchmarking test results indicated that BSWiMS had an equivalent performance on both males’ and females’ data sets on the augmented training data. On the other hand, the CV results indicate that SVM was able to predict males’ and females’ datasets based only on features provided by the original training set.

2.3 Method Ensemble

The results of the Benchmarking function on the training data set were used to determine successful machine learning strategies that may be able to predict the fluid intelligence on the validation and testing data. We selected three methods according to the benchmark results. The first method was SVM. SVM showed a statistically significant association using original features. The second method was the BSWiMS that selected linear models composed on a small number of features (n < 8), and it had an equivalent performance on both males and females. Finally, we selected the Random Forest method because RF uses an independent learning methodology than SVM and BSWiMS, and the benchmarking results indicated that it was able to predict fluid intelligence. Regarding the selected modeling methods, SVM used the radial kernel, RF is based on decision trees, and BSWiMS built simple linear models. Hence, they approached the regression problem using three different strategies. Once we selected the methods, we refit the SVM, BSWiMS, and RF using the training data sets. SVM was fitted to male and female by training original data using the radial basis function, and we did not attempt to optimize any of the SVM parameters. Once trained, we predicted the validation and testing sets. For BSWiMS and RF we refit the models using the augmented training data set. First, we fitted the BSWiMS model using default parameters and all the sex-stratified training data. The BSWiMS procedure was repeated 10 times, and the final model coefficients were bagged. We also fitted the data utilizing the RF method, by using default parameters. Hence, no parameter optimization was attempted. To minimize prediction noise, we repeated the sex-stratified RF fitting 15 times. Once BSWiMS and RF were fitted on males and females, we proceeded to predict the validation and to test datasets. First by cubic-root transformation, then we augmented the features using the described features. Finally, we adjusted the data using the coefficients of Eq. 1. The final prediction was the average of the three methods.

2.4 Prediction Evaluation

The evaluation of the results was done using the ABCD challenge validation data set using the R code provided by challenge organizers. The code evaluated the mean square error (MSE) and the R² between predicted scores and the actual scores.

3 Results

3.1 Benchmarking Results

Figures 1 and 2 shows the Pearson correlation results of the repeated hold-out validation strategy using only validation data. The Pearson correlation varied from not being statistically significant, for BSWiMS method on the original data set, to a maximum correlation of 0.16 with 95% confidence interval of 0.12 to 0.21. Figure 3 shows the most commonly selected features by the different filtering strategies using the transformed data sets. It is clear that the features that are useful to predict fluid intelligence are different between males and females. Only five features: |frontalsup_gm|, |wm400_wm|, insula_gm, parahippocampal_gm, and parietalinf_gm were common between males and females. Among these five features, two were related to differences between left and right volumes, and three were related to volume size.

3.2 Feature Relevance

The BSWiMS procedure returns a summary analysis of the relevant features required to model the fluid intelligence score using the sex-stratified transformed data sets. Table 2 shows the required features needed to model the male intelligence scores. Table 3 shows the features involved in predicting female scores. The tables show that the volume of the parahippocampal gyrus and the volume of the pons are an important feature associated with fluid intelligence scores in both males and females. Moreover, the other features are unique to males and females. Among them, four features are associated with differences between the left and right volumes in males while only two features associated with differences between and right volumes affect females significantly.

Table 2. Male Model. |°| are features related to the left-right absolute difference

Full size table

Table 3. Female Model. |°| are features related to the left-right absolute difference

Full size table

3.3 Models Performance

The training results of the BSWiMS procedure indicated that the linear models associated with fluid intelligence can explain 3.6% and 2.2% of the male and the female variance, respectively. Table 4 shows the MSE of the internal cross-validation of the models based. The validation results of the ensemble procedure showed that structural MRI predicted 0.7% (MSE = 71.86) of the fluid intelligence variance and that this prediction was marginally significant (p-value = 0.06). Finally, the test results indicated that the ensemble method had an MSE of 100.89.

Table 4. MSE of the internal validation of the three models

Full size table

4 Discussion

Machine learning strategies can be used to explore systematically different methodologies to model and predict abstracts outcomes like fluid intelligence. The exploration of volumetric data and transformed data showed that data transformation can be useful in finding anatomical structures that may affect the expression or development of child fluid intelligence. Several brain structures were found to explain the difference in the intelligence score, and the aggregation of them in linear models increased the power to describe the difference between children. We also found that differences between the left and right structures in the brain are associated with the fluid intelligence scores. Clear gender differences were also observed. Males were modeled by simple linear models while explaining the female fluid intelligence required pattern matching methods. In other words, there are patterns in female fluid intelligence that are difficult to model by simple linear models. Even with this difficulty, we found that there are significant associations between changes in the pons white mater, parahippocampal gyrus, and fluid intelligence in both males in females and that this association can only be explained 0.7% of the variance. Hence, the reported findings are indicating that there are factors beyond differences in brain structures that play a more important role in the development of child fluid intelligence.

5 Conclusion

Machine learning methods can be used to discover the main factors associated with fluid intelligence. Here, we found that brain morphology plays a small role in explaining the difference in fluid intelligence in children between ages 9 and 10.

References

Carroll, J.B.: Human Cognitive Abilities by John B. Carroll (2019)
Google Scholar
Pohl, K.M., Thompson, W.K.: https://sibis.sri.com/abcd-np-challenge/
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Article Google Scholar
Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20, 273–297 (1995)
MATH Google Scholar
De’ath, G., Fabricius, K.E.: Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology 81, 3178–3192 (2000)
Article Google Scholar
Tibshirani, R.: Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B-Methodol. 58, 267–288 (1996)
MathSciNet MATH Google Scholar
Tamez-Pena, J.G., Tamez-Pena, M.J.G.: Package ‘FRESA. CAD’ (2014)
Google Scholar
Toga, A.W., Thompson, P.M.: Mapping brain asymmetry. Nat. Rev. Neurosci. 4, 37–48 (2003)
Article Google Scholar
Thompson, P.M., et al.: Cortical variability and asymmetry in normal aging and Alzheimer’s disease. Cereb. Cortex 8, 492–509 (1998)
Article Google Scholar
Kim, J.H.: Estimating classification error rate: repeated cross-validation, repeated hold-out and bootstrap. Comput. Stat. Data Anal. 53, 3735–3745 (2009)
Article MathSciNet Google Scholar
Ding, C., Peng, H.: Minimum redundancy feature selection from microarray gene expression data. J. Bioinform. Comput. Biol. 3, 185–205 (2005)
Article Google Scholar
Benjamini, Y., Hochberg, Y.: Controlling the false discovery rate - a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B-Methodol. 57, 289–300 (1995)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Tecnologico de Monterrey, 64849, Monterrey, NL, Mexico
Jose Tamez-Pena, Jorge Orozco, Patricia Sosa, Alejandro Valdes & Fahimeh Nezhadmoghadam

Authors

Jose Tamez-Pena
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Orozco
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Sosa
View author publications
You can also search for this author in PubMed Google Scholar
Alejandro Valdes
View author publications
You can also search for this author in PubMed Google Scholar
Fahimeh Nezhadmoghadam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jose Tamez-Pena .

Editor information

Editors and Affiliations

Stanford University, Stanford, CA, USA
Kilian M. Pohl
University of California, San Diego, La Jolla, CA, USA
Wesley K. Thompson
Stanford University, Stanford, CA, USA
Ehsan Adeli
Children’s National Health System, Washington, DC, USA
Marius George Linguraru

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tamez-Pena, J., Orozco, J., Sosa, P., Valdes, A., Nezhadmoghadam, F. (2019). Ensemble of SVM, Random-Forest and the BSWiMS Method to Predict and Describe Structural Associations with Fluid Intelligence Scores from T1-Weighed MRI. In: Pohl, K., Thompson, W., Adeli, E., Linguraru, M. (eds) Adolescent Brain Cognitive Development Neurocognitive Prediction. ABCD-NP 2019. Lecture Notes in Computer Science(), vol 11791. Springer, Cham. https://doi.org/10.1007/978-3-030-31901-4_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-31901-4_6
Published: 10 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-31900-7
Online ISBN: 978-3-030-31901-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Ensemble of SVM, Random-Forest and the BSWiMS Method to Predict and Describe Structural Associations with Fluid Intelligence Scores from T1-Weighed MRI

Abstract

Similar content being viewed by others

Machine Learning on High Dimensional Shape Data from Subcortical Brain Surfaces: A Comparison of Feature Selection and Classification Methods

NBS-Predict: An Easy-to-Use Toolbox for Connectome-Based Machine Learning

Identification of Cognitive Deficits Based on T1-Weighted Magnetic Resonance Imaging

Keywords

1 Introduction

2 Materials and Methods

2.1 Data Processing

2.2 Benchmarking Machine Learning Methods

2.3 Method Ensemble

2.4 Prediction Evaluation

3 Results

3.1 Benchmarking Results

3.2 Feature Relevance

3.3 Models Performance

4 Discussion

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us