Abstract
Recently, high-dimensional shape data (HDSD) has been demonstrated to be informative in describing subcortical brain morphometry in several disorders. While HDSD may serve as a biomarker of disease, its high dimensionality may require careful treatment in its application to machine learning. Here, we compare several possible approaches for feature selection and pattern classification using HDSD. We explore the efficacy of three candidate feature selection (FS) methods: Guided Random Forest (GRF), LASSO and no feature selection (NFS). Each feature set was applied to three classifiers: Random Forest (RF), Support Vector Machines (SVM) and Naïve Bayes (NB). Each model was cross-validated using two diagnostic contrasts: Alzheimer’s Disease and mild cognitive impairment; each relative to matched controls. GRF and NFS outperformed LASSO as FS methods and were comparably competitive. NB underperformed relative to RF and SVM, which were comparable in performance. Our results advocate the NFS-RF approach for its speed, simplicity and interpretability.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
Gutman, B., Jahanshad, N., Ching, C., Wang, Y., Kochunov, P., Nichols, T., Thompson, P.: Medial demons registration localizes the degree of genetic influence over subcortical shape variability: an N = 1480 meta-analysis. In: International Symposium on Biomedical Imaging (2015)
Wade, B., Valcour, V., Busovaca, E., Esmaeili-Firidouni, P., Joshi, S., Wang, Y., Thompson, P.: Subcortical shape and volume abnormalities in an elderly HIV+ cohort. In: Proceedings of SPIE International Society of Optical Engineering (2015)
Mueller, S.G., Weiner, M.W., Thal, L.J., Petersen, R.C., Jack, C.R., Jagust, W., Trojanowski, J.Q., Toga, A.W., Beckett, L.: Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI). Alzheimer’s & Dementia : The Journal of the Alzheimer’s Association 1, 55–66 (2005)
Dale, A.M., Fischl, B., Sereno, M.I.: Cortical surface-based analysis. I. Segmentation and surface reconstruction. NeuroImage 9, 179–194 (1999)
Gutman, B., Wang, Y., Rajagopalan, P., Toga, A., Thompson, P.: Shape matching with medial curves and 1-D group-wise registration. In: International Symposium on Biomedical Imaging, vol. 9, pp. 716–719 (2012)
Yeo, B.T.T., Sabuncu, M.R., Vercauteren, T., Ayache, N., Fischl, B., Golland, P.: Spherical Demons: Fast Diffeomorphic Landmark-Free Surface Registration. IEEE Transactions on Medical Imaging 29, 650–668 (2010)
Deng, H.: Guided Random Forest in the RRF Package (2013). arXiv:1306.0237 1-2
Tibshirani, R.: Regression Shrinkage and Selection Via the Lasso. Journal of the Royal Statistical Society, Series B 267–288 (1996)
Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
DeLong, E.R., DeLong, D.M., Clarke-Pearson, D.L.: Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach. Biometrics 44, 837–845 (1988)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Wade, B.S.C., Joshi, S.H., Gutman, B.A., Thompson, P.M. (2015). Machine Learning on High Dimensional Shape Data from Subcortical Brain Surfaces: A Comparison of Feature Selection and Classification Methods. In: Zhou, L., Wang, L., Wang, Q., Shi, Y. (eds) Machine Learning in Medical Imaging. MLMI 2015. Lecture Notes in Computer Science(), vol 9352. Springer, Cham. https://doi.org/10.1007/978-3-319-24888-2_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-24888-2_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24887-5
Online ISBN: 978-3-319-24888-2
eBook Packages: Computer ScienceComputer Science (R0)