Abstract
Recent studies have shown that brain lesions following stroke can be probabilistically mapped onto disconnections of white matter tracts, and that the resulting “disconnectome” is predictive of the patient’s behavioral deficits. Disconnectome maps are sparse, high-dimensional 3D matrices that require unsupervised dimensionality reduction followed by supervised learning for prediction of the associated behavioral data. However, the optimal machine learning pipeline for disconnectome data still needs to be identified. We examined four dimensionality reduction methods at varying levels of compression and used the extracted features as input for cross-validated regularized regression to predict the associated language and motor deficits. Features extracted by Principal Component Analysis and Non-Negative Matrix Factorization were found to be the best predictors, followed by Independent Component Analysis and Dictionary Learning. Optimizing the number of extracted features improved predictive accuracy and greatly reduced model complexity. Moreover, the choice of dimensionality reduction technique was found to optimally combine with a specific type of regularized regression (ridge vs. LASSO). Overall, our findings represent an important step towards an optimal pipeline that yields high prediction accuracy with a small number of features, which can also improve model interpretability.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Corbetta, M., et al.: Common behavioral clusters and subcortical anatomy in stroke. Neuron 85, 927–941 (2015)
Rorden, C., Karnath, H.O.: Using human brain lesions to infer function: a relic from a past era in the fMRI age. Nat. Rev. Neurosci. 5, 813–819 (2004)
Price, C.J., Hope, T.M., Seghier, M.L.: Ten problems and solutions when predicting individual outcome from lesion site after stroke. Neuroimage 145, 200–208 (2017)
Siegel, J.S., et al.: Disruptions of network connectivity predict impairment in multiple behavioral domains after stroke. Proc. Natl. Acad. Sci. USA 113, E4367–E4376 (2016)
Thiebaut de Schotten, M., Foulon, C., Nachev, P.: Brain disconnections link structural connectivity with function and behaviour. Nat. Commun. 11, 5094 (2020)
Salvalaggio, A., de Filippo De Grazia, M., Zorzi, M., de Schotten, M.T., Corbetta, M.: Post-stroke deficit prediction from lesion and indirect structural and functional disconnection. Brain 143, 2173–2188 (2020)
Foulon, C., et al.: Advanced lesion symptom mapping analyses and implementation as BCBtoolkit. Gigascience 7, 1–17 (2018)
Chauhan, S., et al.: A comparison of shallow and deep learning methods for predicting cognitive performance of stroke patients from MRI lesion images. Front. Neuroinf. 13, 53 (2019)
Mwangi, B., Tian, T.S., Soares, J.C.: A review of feature reduction techniques in Neuroimaging. Neuroinformatics 12, 229–244 (2014)
Calesella, F., Testolin, A., De Filippo De Grazia, M., Zorzi M.: A comparison of feature extraction methods for prediction of neuropsychological scores from functional connectivi-ty data of stroke patients. Brain Inf. 8, 8 (2021)
Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157–1182 (2003)
Hua, J., Tembe, W.D., Dougherty, E.R.: Performance of feature-selection methods in the classification of high-dimension data. Pattern Recogn. 42, 409–424 (2009)
Carroll, M.K., Cecchi, G.A., Rish, I., Garg, R., Rao, A.R.: Prediction and interpretation of distributed neural activity with sparse models. Neuroimage 44, 112–122 (2009)
Teipel, S.J., Kurth, J., Krause, B., Grothe, M.J.: The relative importance of imaging markers for the prediction of Alzheimer’s disease dementia in mild cognitive impairment - beyond classical regression. NeuroImage Clin. 8, 583–593 (2015)
Cui, Z., Gong, G.: The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features. Neuroimage 178, 622–637 (2018)
Jollans, L., et al.: Quantifying performance of machine learning methods for neuroimaging data. Neuroimage 199, 351–365 (2019)
Jolliffe, I.T.: Principal component analysis. In: Encyclopedia of Statistics in Behavioral Science (2002)
Hyvärinen, A., Oja, E.: Independent component analysis: algorithms and applications. Neural Netw. 13, 411–430 (2000)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems, pp. 556–562 (2001)
Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Online dictionary learning for sparse coding. In: ACM International Conference Proceeding Series, pp. 689–696 (2009)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. Roy. Stat. Soc. Ser. B (Methodol.) 58, 267–288 (1996)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer Series in Statistics. Springer, New York (2009). https://doi.org/10.1007/978-0-387-84858-7
Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978)
Vieira, S., Pinaya, W.H., Mechelli, A.: Using deep learning to investigate the neuroimaging correlates of psychiatric and neurological disorders: methods and applications. Neurosci. Biobehav. Rev. 74, 58–75 (2017)
Acknowledgments
This work was supported by grants from the Italian Ministry of Health (RF-2013–02359306 to MZ, Ricerca Corrente to IRCCS Ospedale San Camillo) and by MIUR (Dipartimenti di Eccellenza DM 11/05/2017 n. 262 to the Department of General Psychology). We are grateful to Prof. Maurizio Corbetta for providing the stroke dataset, which was collected in a study funded by grants R01 HD061117–05 and R01 NS095741.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Zorzi, M., De Filippo De Grazia, M., Blini, E., Testolin, A. (2021). Assessment of Machine Learning Pipelines for Prediction of Behavioral Deficits from Brain Disconnectomes. In: Mahmud, M., Kaiser, M.S., Vassanelli, S., Dai, Q., Zhong, N. (eds) Brain Informatics. BI 2021. Lecture Notes in Computer Science(), vol 12960. Springer, Cham. https://doi.org/10.1007/978-3-030-86993-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-86993-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86992-2
Online ISBN: 978-3-030-86993-9
eBook Packages: Computer ScienceComputer Science (R0)