Multiple instance learning for classification of dementia in brain MRI

doi:10.1016/j.media.2014.04.006

Medical Image Analysis

Volume 18, Issue 5, July 2014, Pages 808-818

https://doi.org/10.1016/j.media.2014.04.006 Get rights and content

Highlights

•
Multiple instance learning technique is applied to classification of subjects with Alzheimer’s disease.
•
Graphs are built from images to exploit information about the inherent structure of images for classification.
•
Validation is carried out on different classification tasks, including CN versus AD and SMCI versus PMCI.
•
Comparisons with two state-of-the-art methods show the effectiveness of the proposed method.
•
The proposed method provides an alternative framework for the detection and prediction of neurodegenerative diseases.

Abstract

Machine learning techniques have been widely used to detect morphological abnormalities from structural brain magnetic resonance imaging data and to support the diagnosis of neurological diseases such as dementia. In this paper, we propose to use a multiple instance learning (MIL) method in an application for the detection of Alzheimer’s disease (AD) and its prodromal stage mild cognitive impairment (MCI). In our work, local intensity patches are extracted as features. However, not all the patches extracted from patients with dementia are equally affected by the disease and some of them may not be characteristic of morphology associated with the disease. Therefore, there is some ambiguity in assigning disease labels to these patches. The problem of the ambiguous training labels can be addressed by weakly supervised learning techniques such as MIL. A graph is built for each image to exploit the relationships among the patches and then to solve the MIL problem. The constructed graphs contain information about the appearances of patches and the relationships among them, which can reflect the inherent structures of images and aids the classification. Using the baseline MR images of 834 subjects from the ADNI study, the proposed method can achieve a classification accuracy of 89% between AD patients and healthy controls, and 70% between patients defined as stable MCI and progressive MCI in a leave-one-out cross validation. Compared with two state-of-the-art methods using the same dataset, the proposed method can achieve similar or improved results, providing an alternative framework for the detection and prediction of neurodegenerative diseases.

Graphical abstract

Introduction

The aetiology of Alzheimer’s disease (AD) is the most commonly responsible for clinical dementia worldwide. Its progression leads to a gradual decline of memory and cognitive functions. The prevalence of AD is predicted to quadruple in the next four decades (Brookmeyer et al., 2007). However, no drug or treatment has so far been reported to be able to stop the progress of AD and it remains difficult to predict whether individuals will develop AD. There is a critical need to develop biomarkers for the early diagnosis of AD and measuring the outcomes of clinical drug trials (Clark et al., 2007). Although there is currently no cure for AD, there are some medications that can delay the onset of some symptoms such as memory loss, confusion, and cognitive problems (Yiannopoulou and Papageorgiou, 2013). Diagnosing AD early would allow doctors to treat patients sooner, which can then limit the devastating physical, psychological impact on patients and their relatives and reduce the economic burden on society. Mild cognitive impairment (MCI) is an intermediate stage between normal cognition and clinical dementia. Individuals with MCI have been reported to progress to clinical dementia at a rate of 10–15% annually (Grundman et al., 2004). Research on identifying MCI individuals who will progress to clinical dementia has received increasing attention in recent years (Wolz et al., 2011, Coupé et al., 2012, Wee et al., 2012b, Liu et al., 2013, Gray et al., 2013).

Different imaging techniques, such as structural magnetic resonance imaging (MRI) (Wolz et al., 2011, Coupé et al., 2012, Liu et al., 2013), functional MRI (Pihlajamäki and Sperling, 2008, Wee et al., 2011), fluorodeoxyglucose positron emission tomography (FDG-PET) (Herholz et al., 2002, Gray et al., 2012) and diffusion tensor imaging (DTI) (Wee et al., 2012b, Keihaninejad et al., 2013), have been used to derive image-based biomarkers for AD. Studies have shown that the combination of biomarkers from different imaging modalities (MRI, FDG-PET, DTI, fMRI) can provide complementary information of AD pathology and thus improve the classification accuracy (Zhang et al., 2011, Hinrichs et al., 2011, Wee et al., 2012b, Gray et al., 2013). In comparison to DTI, fMRI or FDG-PET, structural MRI is the most standardized and the most widely available imaging modality in clinical practice. In addition, MRI examinations can provide an opportunity to track different clinical phases of AD (Jack et al., 2013). Therefore, we evaluated our method using structural MR images. However, multiple datasets could also be acquired from different imaging modalities for developing different biomarkers of AD.

Several types of features can be derived from structural MRI for classification, such as gray matter density maps (Cuingnet et al., 2011, Liu et al., 2012a), cortical thickness (Cho et al., 2012, Wee et al., 2012a, Eskildsen et al., 2013) as well as volume and shape measures (Gerardin et al., 2009, Wolz et al., 2010). The number of training images is typically small in comparison with the high dimensionality of the voxel-wise features. Therefore, a feature selection step is necessary to tackle the problem of overfitting. Feature selection has been shown to improve the classification accuracy, but it depends on the adopted approaches (Chu et al., 2011). To reduce the feature space and select the discriminative features, statistical approaches (Yoon et al., 2007, Chu et al., 2011, Wee et al., 2012a) or sparse regression methods (Ghosh and Chinnaiyan, 2005, Liu et al., 2012b) are often used. Another popular method is to segment the whole brain into multiple anatomical (Gray et al., 2012) or discriminative (Fan et al., 2007) regions and then extract regional features such as volume or shape measures for classification. It should be noted that the features extracted from neuroimaging data are not isolated and exhibit high correlations (Chu et al., 2011). Considering the relationships among these features, tree-guided sparse coding methods (Liu et al., 2012b) or re-sampling schemes using Elastic Net (Janousova et al., 2012) has been recently proposed. These approaches can select voxel-wise features in meaningful brain regions, which may be related to pathology.

The features derived from MRI can be extracted from very local regions or the whole brain. At the voxel level, intensities or gray matter densities can be directly used in classification (Cuingnet et al., 2011, Vounou et al., 2012). At the whole image level, similarities between images can be used to derive features (Wolz et al., 2012). However, the structural changes induced in the early stages of AD have been observed to occur in small local regions rather than isolated voxels or the whole brain (Hinrichs et al., 2009). Patches represent features at an intermediate scale between the voxel level and the image level, which can capture disease-induced changes in local regions. Recent approaches (Coupé et al., 2012, Liu et al., 2013) utilize local intensity patterns within patches to capture the local structural information for AD classification. In these approaches, patches from patients with AD are used as positive samples and patches from healthy subjects are regarded as negative samples for training. However, patches are relatively small regions in brain images and not all patches in the brain are characteristic of changes associated with pathology. For example, patches in close vicinity of the hippocampus are more likely to be affected by AD while patches in homogeneous regions may not be affected. This is illustrated in Fig. 1. In addition, different types of dementias have different aetiologies. This means that some patches may be affected by other aetiologies such as cerebrovascular disease rather than AD. Therefore, not all patches from patients necessarily represent positive training samples. This means that there is some ambiguity in assigning disease labels to the training patches extracted from patients. One solution to this problem is to use a weakly supervised method such as multiple instance learning (MIL) (Maron and Lozano-Pérez, 1998), which can learn classifiers from ambiguously labeled training data. Although MIL have been successfully applied to different applications in computer vision (Babenko et al., 2009) and recently in medical imaging (Bi and Liang, 2007, Xu et al., 2012), to the best of our knowledge, it has not been used in the context of classification of neurological diseases. In this paper, we propose to use MIL for the classification of AD and to address the problem of ambiguous patch labels. Specifically, each image is regarded as a bag; the patches extracted from the images are thus treated as inter-correlated instances in the bags. MIL is then used to learn a bag-level classifier to predict the bag labels of unseen images and therefore classify the subjects.

Most existing approaches utilize the intensity values of patches for classification. The relationships among patches are usually ignored since the patches are treated as independently and identically distributed. However, patches from the same subject are rarely independent and often exhibit shared information. This information across patches can convey information about the inherent structure of the images, which may be helpful for disease classification. In recent works, correlated features are extracted to exploit the relationships among patches (Liu et al., 2013) or ROIs (Wee et al., 2012a) of the same subject, which has been shown to improve the classification accuracy. In our work, a graph is constructed from each image in order to investigate the relationships among patches and to exploit the inherent structural information of each image. After that, a graph kernel, which utilizes both the intensity values and the relationships of the extracted patches, is used to distinguish the positive and negative bags. Finally, a bag-level classifier is trained via a kernel machine.

A preliminary version of the presented framework has been published as a conference paper (Tong et al., 2013). The major difference in this work is that we adopted a more robust feature selection method as proposed in Janousova et al. (2012). In addition, an extended evaluation on the whole brain is presented and more detailed comparisons with state-of-the-art methods are also provided. The remainder of this paper is organized as follows: The demographic information of the image dataset in preparation of this article is introduced in Section 2.1. This is followed by a description of the preprocessing pipeline of these images in Section 2.3 and a description on how patches are extracted from the images to form corresponding bags in Section 2.4. We will then introduce the methodology of MIL and how we apply it to the classification of AD in Section 2.5. Performance of the proposed method has been evaluated using 834 subjects from the ADNI study. In Section 3, the influence of different parameters are studied and the performance of the proposed method is also compared with state-of-the-art techniques. The strengths and weaknesses of the proposed method are analyzed in the discussion section and finally we conclude the paper in Section 5.

Section snippets

Subjects

Data used in the preparation of this article were obtained from the ADNI database (adni.loni.ucla.edu). The ADNI was launched in 2003 by the National Institute on Aging (NIA), the National Institute of Biomedical Imaging and Bioengineering (NIBIB), the Food and Drug Administration (FDA), private pharmaceutical companies and non-profit organizations, as a $60 million, 5-year public–private partnership. The primary goal of ADNI has been to test whether serial MRI, PET, other biological markers,

Experiments and results

The performance of the proposed mi-Graph was evaluated on different classification tasks, including CN vs AD, CN vs PMCI and SMCI vs PMCI. Experiments were performed using leave-one-out cross validation since this validation is known to be an almost unbiased estimator (Cawley and Talbot, 2004). For a fair comparison with the study in Wolz et al. (2011), we also utilized a leave 5% out cross validation as adopted in their work. There are five important parameters in our proposed method: the size

Discussion

In this paper, we have developed a patch-based approach for the classification of subjects with disease such as AD. Since patches that are extracted from images of patients with AD may not be affected by AD or affected by other types of diseases (i.e. cerebrovascular disease), there is some ambiguity in assigning disease labels to these patches. We proposed to use MIL to address the problem of ambiguous labels of the training patches. The intensities of the patches and the relationships among

Conclusion

In this study, we have shown that the multiple instance learning technique can be successfully applied to the classification of AD. The proposed method was evaluated on a large database using the entire 834 baseline MR scans in the ADNI study. The direct comparisons with two recent methods demonstrate the effectiveness of the proposed method. In future work, we plan to extend the proposed framework using longitudinal datasets and other imaging modalities, such as FDG-PET images.

Acknowledgments

This project was partially funded by the China Scholarship Council. The ADNI Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI; Principal Investigator: Michael Weiner; NIH Grant U01 AG024904). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering (NIBIB), and through generous contributions from the following: Pfizer Inc., Wyeth Research, Bristol-Myers Squibb, Eli Lilly and

References (63)

S.-i. Amari et al.
Improving support vector machine classifiers by modifying kernel functions
Neural Networks
(1999)
R. Brookmeyer et al.
Forecasting the global burden of Alzheimers disease
Alzheimer’s Dementia
(2007)
Y. Cho et al.
Individual subject classification for Alzheimer’s disease based on incremental learning using a spatial frequency representation of cortical thickness data
Neuroimage
(2012)
P. Coupé et al.
Scoring by nonlocal image patch estimator for early detection of Alzheimer’s disease
NeuroImage: Clinical
(2012)
R. Cuingnet et al.
Automatic classification of patients with Alzheimer’s disease from structural MRI: a comparison of ten methods using the ADNI database
Neuroimage
(2011)
S.F. Eskildsen et al.
Prediction of Alzheimer’s disease in subjects with mild cognitive impairment from the ADNI cohort using patterns of cortical thinning
NeuroImage
(2013)
E. Gerardin et al.
Multidimensional classification of hippocampal shape features discriminates Alzheimer’s disease and mild cognitive impairment from normal aging
Neuroimage
(2009)
K.R. Gray et al.
Multi-region analysis of longitudinal FDG-PET for the classification of Alzheimer’s disease
NeuroImage
(2012)
K.R. Gray et al.
Random forest-based similarity measures for multi-modal classification of Alzheimer’s disease
Neuroimage
(2013)
K. Herholz et al.
Discrimination between Alzheimer dementia and controls by automated analysis of multicenter FDG-PET
Neuroimage
(2002)

Babenko, B., Yang, M.-H., Belongie, S., 2009. Visual tracking with online multiple instance learning. In: IEEE...

Bi, J., Liang, J., 2007. Multiple instance learning of pulmonary embolism detection with geodesic distance along...

G.C. Cawley et al.

Fast exact leave-one-out cross-validation of sparse least-squares support vector machines

Neural Networks

(2004)

Cited by (180)

Variable selection in Bayesian multiple instance regression using shotgun stochastic search
2024, Computational Statistics and Data Analysis
In multiple instance learning (MIL), a bag represents a sample that has a set of instances, each of which is described by a vector of explanatory variables, but the entire bag only has one label/response. Though many methods for MIL have been developed to date, few have paid attention to interpretability of models and results. The proposed Bayesian regression model stands on two levels of hierarchy, which transparently show how explanatory variables explain and instances contribute to bag responses. Moreover, two selection problems are simultaneously addressed; the instance selection to find out the instances in each bag responsible for the bag response, and the variable selection to search for the important covariates. To explore a joint discrete space of indicator variables created for selection of both explanatory variables and instances, the shotgun stochastic search algorithm is modified to fit in the MIL context. Also, the proposed model offers a natural and rigorous way to quantify uncertainty in coefficient estimation and outcome prediction, which many modern MIL applications call for. The simulation study shows the proposed regression model can select variables and instances with high performance (AUC greater than 0.86), thus predicting responses well. The proposed method is applied to the musk data for prediction of binding strengths (labels) between molecules (bags) with different conformations (instances) and target receptors. It outperforms all existing methods, and can identify variables relevant in modeling responses.
Evaluation of MRI-based machine learning approaches for computer-aided diagnosis of dementia in a clinical data warehouse
2023, Medical Image Analysis
A variety of algorithms have been proposed for computer-aided diagnosis of dementia from anatomical brain MRI. These approaches achieve high accuracy when applied to research data sets but their performance on real-life clinical routine data has not been evaluated yet. The aim of this work was to study the performance of such approaches on clinical routine data, based on a hospital data warehouse, and to compare the results to those obtained on a research data set. The clinical data set was extracted from the hospital data warehouse of the Greater Paris area, which includes 39 different hospitals. The research set was composed of data from the Alzheimer’s Disease Neuroimaging Initiative data set. In the clinical set, the population of interest was identified by exploiting the diagnostic codes from the 10th revision of the International Classification of Diseases that are assigned to each patient. We studied how the imbalance of the training sets, in terms of contrast agent injection and image quality, may bias the results. We demonstrated that computer-aided diagnosis performance was strongly biased upwards (over 17 percent points of balanced accuracy) by the confounders of image quality and contrast agent injection, a phenomenon known as the Clever Hans effect or shortcut learning. When these biases were removed, the performance was very poor. In any case, the performance was considerably lower than on the research data set. Our study highlights that there are still considerable challenges for translating dementia computer-aided diagnosis systems to clinical routine.
Multi-modal graph neural network for early diagnosis of Alzheimer's disease from sMRI and PET scans
2023, Computers in Biology and Medicine
In recent years, deep learning models have been applied to neuroimaging data for early diagnosis of Alzheimer's disease (AD). Structural magnetic resonance imaging (sMRI) and positron emission tomography (PET) images provide structural and functional information about the brain, respectively. Combining these features leads to improved performance than using a single modality alone in building predictive models for AD diagnosis. However, current multi-modal approaches in deep learning, based on sMRI and PET, are mostly limited to convolutional neural networks, which do not facilitate integration of both image and phenotypic information of subjects. We propose to use graph neural networks (GNN) that are designed to deal with problems in non-Euclidean domains. In this study, we demonstrate how brain networks are created from sMRI or PET images and can be used in a population graph framework that combines phenotypic information with imaging features of the brain networks. Then, we present a multi-modal GNN framework where each modality has its own branch of GNN and a technique that combines the multi-modal data at both the level of node vectors and adjacency matrices. Finally, we perform late fusion to combine the preliminary decisions made in each branch and produce a final prediction. As multi-modality data becomes available, multi-source and multi-modal is the trend of AD diagnosis. We conducted explorative experiments based on multi-modal imaging data combined with non-imaging phenotypic information for AD diagnosis and analyzed the impact of phenotypic information on diagnostic performance. Results from experiments demonstrated that our proposed multi-modal approach improves performance for AD diagnosis. Our study also provides technical reference and support the need for multivariate multi-modal diagnosis methods.
Population-based GCN method for diagnosis of Alzheimer's disease using brain metabolic or volumetric features
2023, Biomedical Signal Processing and Control
As a deep learning method, graph convolution network (GCN) has the advantage of dealing with non-Euclidean domain problems and is constantly applied in the research of computer-aided diagnosis of Alzheimer's disease (AD). In graph-based methods for AD diagnosis, nodes can represent potential subjects with a set of vectors, and edges combine the interaction and similarity between subjects. However, for the three-dimensional neuroimage like structural Magnetic Resonance Imaging (sMRI) or Positron Emission Tomography (PET), due to the non-sequential of ROI (Region of Interest) features (compared with four-dimensional neuroimage), which makes the graph-based analysis approach more difficult. In this study, we obtained individual features by constructing brain network via health group indirectly, and then constructed a population-based GCN framework by expressing the subject population as adjacency matrix in graph to achieve the diagnosis of AD. The nodes in graph are associated with individual features, and edges are weighted by combining the phenotypic information, we further discuss the influence of phenotypic information on the classification performance of GCN. Compared with acquiring the ROI features of the brain regions as the input features of GCN, our proposed method remarkably improved the prediction accuracy based on both sMRI and PET images by about 5 to 10 percentage. Through our testing and experimental analysis on the public ADNI dataset, our method achieved improved performance for AD diagnosis and mild cognitive impairment conversion prediction tasks. Our proposed method also provides technical support for AD diagnosis using GCN method based on three-dimensional brain images.
A new weakly supervised deep neural network for recognizing Alzheimer's disease
2023, Computers in Biology and Medicine
Alzheimer’s disease (AD) is a chronic neurodegenerative disease that mainly affects older adults, causing memory loss and decline in thinking skills. In recent years, many traditional machine learning and deep learning methods have been used to assist in the diagnosis of AD, and most existing methods focus on early prediction of disease on a supervised basis. In reality, there is a massive amount of medical data available. However, some of those data have problems with the low-quality or lack of labels, and the cost of labeling them will be too high. To solve above problem, a new Weakly Supervised Deep Learning model (WSDL) is proposed, which adds attention mechanisms and consistency regularization to the EfficientNet framework and uses data augmentation techniques on the original data that can take full advantage of this unlabeled data. Validation of the proposed WSDL method on the brain MRI datasets of the Alzheimer’s Disease Neuroimaging Program by setting five different unlabeled ratios to complete weakly supervised training showed better performance according to the compared experimental results with others baselines.
End-to-end automatic pathology localization for Alzheimer's disease diagnosis using structural MRI
2023, Computers in Biology and Medicine
Structural magnetic resonance imaging (sMRI) is an essential part of the clinical assessment of patients at risk of Alzheimer dementia. One key challenge in sMRI-based computer-aided dementia diagnosis is to localize local pathological regions for discriminative feature learning. Existing solutions predominantly depend on generating saliency maps for pathology localization and handle the localization task independently of the dementia diagnosis task, leading to a complex multi-stage training pipeline that is hard to optimize with weakly-supervised sMRI-level annotations. In this work, we aim to simplify the pathology localization task and construct an end-to-end automatic localization framework (AutoLoc) for Alzheimer’s disease diagnosis. To this end, we first present an efficient pathology localization paradigm that directly predicts the coordinate of the most disease-related region in each sMRI slice. Then, we approximate the non-differentiable patch-cropping operation with the bilinear interpolation technique, which eliminates the barrier to gradient backpropagation and thus enables the joint optimization of localization and diagnosis tasks. Extensive experiments on commonly used ADNI and AIBL datasets demonstrate the superiority of our method. Especially, we achieve 93.38% and 81.12% accuracy on Alzheimer’s disease classification and mild cognitive impairment conversion prediction tasks, respectively. Several important brain regions, such as rostral hippocampus and globus pallidus, are identified to be highly associated with Alzheimer’s disease.

View all citing articles on Scopus

¹: Data used in the preparation of this article were obtained from the ADNI database (http://www.loni.ucla.edu/ADNI). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: www.loni.ucla.edu/ADNI/Collaboration/ADNI_Authorship_list.pdf.

View full text

Multiple instance learning for classification of dementia in brain MRI

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Subjects

Experiments and results

Discussion

Conclusion

Acknowledgments

Neural Networks

Alzheimer’s Dementia

Neuroimage

NeuroImage: Clinical

Neuroimage

NeuroImage

Neuroimage

NeuroImage

Neuroimage

Neuroimage

Neuroimage

Neuroimage

NeuroImage

NeuroImage

Neuroimage

Neuroimage

Neuroimage

Neuroimage

Am. J. Geriatric Psych

Neuroimage

Neuroimage

Neuroimage

NeuroImage

Medical Image Anal.

Alzheimer’s Dementia

Neuroimage

Neuroimage

Support vector machines for multiple-instance learning

Adv. Neural Inf. Process. Syst.

Fast exact leave-one-out cross-validation of sparse least-squares support vector machines

Neural Networks