Hypergraph based multi-task feature selection for multimodal classification of Alzheimer's disease

doi:10.1016/j.compmedimag.2019.101663

Computerized Medical Imaging and Graphics

Volume 80, March 2020, 101663

https://doi.org/10.1016/j.compmedimag.2019.101663 Get rights and content

Abstract

Multi-modality based classification methods are superior to the single modality based approaches for the automatic diagnosis of the Alzheimer's disease (AD) and mild cognitive impairment (MCI). However, most of the multi-modality based methods usually ignore the structure information of data and simply squeeze them to pairwise relationships. In real-world applications, the relationships among subjects are much more complex than pairwise, and the high-order structure containing more discriminative information will be intuitively beneficial to our learning tasks. In light of this, a hypergraph based multi-task feature selection method for AD/MCI classification is proposed in this paper. Specifically, we first perform feature selection on each modality as a single task and incorporate group-sparsity regularizer to jointly select common features across multiple modalities. Then, we introduce a hypergraph based regularization term for the standard multi-task feature selection to model the high-order structure relationship among subjects. Finally, a multi-kernel support vector machine is adopted to fuse the features selected from different modalities for the final classification. The experimental results on the Alzheimer's Disease Neuroimaging Initiative (ADNI) demonstrate that our proposed method achieves better classification performance than the start-of-art multi-modality based methods.

Introduction

Alzheimer's disease (AD), which is usually associated with elderly people, is the sixth leading cause of death in the United States (Association, 2012). The progression of AD gradually leads to a widespread loss of mental function such as memory loss, language impairment, disorientation and personality change, ultimately leading to death. In Association (2013), it was reported that the total estimated prevalence of AD is expected to be 13.8 million by 2050. However, no treatment has so far been reported to be able to reverse or stop the progress of AD. Therefore, many studies focus on the early diagnosis of AD and MCI based on neuroimaging data (Sui et al., 2012, Ye et al., 2011) which plays a crucial role in the later treatments.

Neuroimaging which offers great potential to discover features corresponding to the early course of dementing illness is a powerful tool for disease diagnosis in neurodegenerative diseases such as AD. Recently, magnetic resonance imaging (MRI) and positron emission topography (PET) are indicated to be useful to investigate neurophysiological characteristics of AD and MCI (Davatzikos et al., 2011, Fan et al., 2008, Chetelat et al., 2003, Foster et al., 2007). Furthermore, multiple biomarkers have been shown to be sensitive to the diagnosis of AD and MCI by neuroimaging, such as the structural atrophy, pathological amyloid depositions, and metabolic alterations in the brain.

In recent decades, machine learning and pattern recognition methods have been widely used in neuroimaging analysis of AD and MCI, including group-based comparison approaches and individual classification (Ye et al., 2011, Orrú et al., 2012). However, most of the existing studies mainly focus on extracting features from single modality. For example, researchers extracted the features from the structural MRI, such as voxel-wise tissue (Desikan et al., 2009, Fan et al., 2007) and hippocampal volumes (Gerardin et al., 2009) for diagnosis of AD. In addition to the structural MRI, PET images are also utilized for the classification of AD and MCI (Chetelat et al., 2003, Foster et al., 2007, Hinrichs et al., 2009).

However, since the structure and function of brain are very complex, it is challenging to accurately detect all the disease-related features from single modality. Recently, with the development of biomedical imaging techniques, multi-modality based methods are promising in the field of medical image analysis since multi-modality information is naturally available in the data acquisition procedures of various clinical tasks. Different modalities can provide different views of brain structure or function and reveal different aspects of pathological changes related to AD. For example, structural MRI provides information related to the tissue type of brain, while the FDG-PET measures the cerebral metabolic rate for glucose. Numerous studies have shown that the complementary neuroimaging modalities can help to discover the hidden information which may be ignored by the single modality, and the fusion of the information from different modalities can enhance the diagnostic performance. Hence, multiple modalities are preferred to improve the accuracy of AD diagnosis (Liu et al., 2014, Liu et al., 2015, Suk et al., 2014, Zhu et al., 2014, Gray et al., 2013, Ahmed et al., 2017, Lei et al., 2017). For instance, Liu et al. (2015) and Suk et al. (2014) used two modalities including MRI and PET for the diagnosis of AD. Zhu et al. (2014) combined MRI, PET and cerebrospinal fluid (CSF) for the regression and classification of AD. Gray et al. (2013) used MRI, PET, CSF and categorical genetic information for AD/MCI classification.

Although the existing multi-modality based methods have achieved promising results, there are still some problems which may limit the classification performance. For neuroimaging, even after feature extraction, the feature dimension is relatively high compared to the sample size, and the subsequent classification performance may be poor because of the redundant or irrelative features. Therefore, feature selection which removes the redundant or irrelative features has become an important step in the diagnosis of AD. Some feature selection methods have been used for identifying the corresponding disease-related regions in AD. For example, Zhu et al. (2016) combined two subspace learning methods, namely, linear discriminant analysis and locality preserving projection to select features in neuroimaging. Jie et al. (2015) proposed a manifold regularized multitask feature learning method which uses multi-task learning and manifold based Laplacian regularization to preserve both the intrinsic relatedness among multiple modalities of data and the data distribution information in each modality and thus induces more discriminative features. Zu et al. (2016) proposed a label-aligned multi-task feature learning method which adds a new label-aligned regularization term to the objective function of standard multi-task feature selection to ensure that all multi-modal subjects with the same class labels should be close in the new feature-reduced space.

However, one disadvantage of the existing methods is that they only considered the pairwise relationships between subjects, while ignoring the high-order relationships which are a kind of important prior information for the learning task. In many real-world problems, relationships among the objects of interest are more complex than pairwise. Naively squeezing the complex relationships into pairwise ones inevitably leads to the information loss which can be expected valuable for our learning tasks (Zhou et al., 2007). Intuitively, modeling the high-order relationships among subjects can induce more discriminative features and further boost the performance of subsequent classification. In many applications, researchers use hypergraph to model the complex relationships among subjects, where a hyperedge can connect more than two vertices at the same time and capture the high-order structure. For example, Bu et al. (2010) adopted hypergraph for music recommendation. Hong et al. (2016) proposed to recovery human pose via hypergraph Laplacian.

In this paper, we propose a hypergraph based multi-task feature selection method where a hypergraph-based regularization is developed to explicitly depict the high-order relationship in each modality. Specifically, our proposed method contains three steps: (1) hypergraph construction, (2) hypergraph based multi-task feature learning, (3) multimodal classification. Specifically, we first construct a hypergraph in each modality by constructing multiple hyperedges that reflect the high-order relationships among subjects. Then we treat feature learning in each modality as a single learning task and formulate the multimodal classification as the multi-task learning (MTL) problem. MTL exploits the intrinsic task relatedness, based on which the information from each task can be shared across multiple tasks and thus facilitates the individual task learning. Specifically, the $ℓ_{2, 1}$ norm is introduced to select features jointly, which can guarantee the features of different modalities in the same brain regions are selected at the same time. Then, we add hypergraph-based regularization terms to the standard multi-task objective function. Finally, we use the multi-kernel support vector machine SVM to fuse the selected features from multimodal data for final classification. To validate our method, we conduct experiments on the ADNI dataset and the experimental results show the efficiency of the proposed method compared with the start-of-the-art methods.

Section snippets

Method

Fig. 1 illustrates the framework of the proposed method, which includes three main steps: hypergraph construction, feature selection and classification. In this section, we first introduce the dataset used in our experiments and then details of the proposed method will be given.

Experiments and results

To validate the effectiveness of our proposed method, we perform experiments in different scenarios, including AD vs. NC, LMCI vs. NC and EMCI vs. LMCI. Classification performance is accessed on the MRI and FDG-PET modalities from $831$ ADNI participants. In our experiments, 10-fold cross-validation strategy is adopted to evaluate the classification performance. Specifically, the whole set is equally divided into $10$ subsets. For each cross-validation, we take $9$ subsets as the training set and the

Discussion

In this paper, we propose a new multi-modality based classification framework, that is, hypergraph based multimodal classification. Our proposed method includes three steps, hypergraph construction, hypergraph based multi-task feature selection and multi-kernel classification. To validate the effectiveness our proposed method, three sets of experiments are performed on the 831 subjects of ADNI dataset. The results show that the proposed method can not only take advantage of the multi-modality

Conclusion

In summary, this paper proposed a novel multi-task learning method to jointly select features from multimodal neuroimaging data for AD/MCI classification. MTL captures the intrinsic task relatedness, based on which the information from each task can be shared across multiple tasks. By introducing the hypergraph based regularization term into the multi-task learning framework, the proposed method can utilize the high-order relationships among the subjects to seek the most discriminative brain

Conflict of interest

The authors of this manuscript have nothing to disclose.

Acknowledgements

This work was supported by the the National Natural Science Foundation of China (Nos. 61902183, 61876082, 61861130366, 61732006) and National Key Research and Development Program of China (Nos. 2018YFC2001600, 2018YFC2001602), the Royal Society-Academy of Medical Sciences Newton Advanced Fellowship (No. NAFR1180371), and China Postdoctoral Science Foundation funded project (No. 2019M661831)

References (61)

O.B. Ahmed et al.
Recognition of Alzheimer's disease and mild cognitive impairment with multimodal image-derived biomarkers and multiple kernel learning
Neurocomputing
(2017)
P. Cao et al.
Sparse shared structure based multi-task learning for MRI based cognitive performance prediction of Alzheimer's disease
Pattern Recogn.
(2017)
Y. Fan et al.
Structural and functional biomarkers of prodromal Alzheimer's disease: a high-dimensional pattern classification study
NeuroImage
(2008)
E. Gerardin et al.
Multidimensional classification of hippocampal shape features discriminates Alzheimer's disease and mild cognitive impairment from normal aging
NeuroImage
(2009)
K.R. Gray et al.
Random forest-based similarity measures for multi-modal classification of Alzheimer's disease
NeuroImage
(2013)
C. Hinrichs et al.
Spatially augmented lpboosting for ad classification with evaluations on the ADNI dataset
NeuroImage
(2009)
C. Hinrichs et al.
Predictive markers for ad in a multi-modality framework: an analysis of MCI progression in the ADNI population
NeuroImage
(2011)
C. Hong et al.
Hypergraph regularized autoencoder for image-based 3d human pose recovery
Signal Process.
(2016)
O. Kohannim et al.
Boosting power for clinical trials using classifiers based on multiple biomarkers
Neurobiol. Aging
(2010)
M. Liu et al.
View-aligned hypergraph learning for Alzheimer's disease diagnosis with incomplete multi-modality data
Med. Image Anal.
(2017)

M. Liu et al.

Hypergraph regularized sparse feature learning

Neurocomputing

(2017)

G. Orrú et al.

Using support vector machine to identify imaging biomarkers of neurological and psychiatric disease: a critical review

Neurosci. Biobehav. Rev.

(2012)

S.P. Poulin et al.

Amygdala atrophy is prominent in early Alzheimer's disease and relates to symptom severity

Psychiatry Res.: Neuroimaging

(2011)

S.L. Risacher et al.

APOE effect on Alzheimer's disease biomarkers in older adults with significant memory concern

Alzheimer's Dementia

(2015)

J. Sui et al.

A review of multivariate methods for multimodal fusion of brain imaging data

J. Neurosci. Methods

(2012)

H.-I. Suk et al.

Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis

NeuroImage

(2014)

N. Tzourio-Mazoyer et al.

Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain

NeuroImage

(2002)

C. Wee et al.

Enriched white matter connectivity networks for accurate identification of MCI patients

NeuroImage

(2011)

J. Young et al.

Accurate multimodal probabilistic prediction of conversion to Alzheimer's disease in patients with mild cognitive impairment

NeuroImage: Clinical

(2013)

D. Zhang et al.

Multimodal classification of Alzheimer's disease and mild cognitive impairment

NeuroImage

(2011)

X. Zhu et al.

A novel matrix-similarity based loss function for joint regression and classification in AD diagnosis

NeuroImage

(2014)

A. Association

2012 Alzheimer's disease facts and figures

Alzheimer's Dementia

(2012)

A. Association

2013 Alzheimer's disease facts and figures

Alzheimer's Dementia

(2013)

I. Brugere et al.

Network structure inference, a survey: motivations, methods, and applications

ACM Comput. Surv.

(2018)

J. Bu et al.

Music recommendation by unified hypergraph: combining social media information and music content

Proceedings of the 18th ACM International Conference on Multimedia

(2010)

C.-C. Chang et al.

Libsvm: a library for support vector machines

ACM Trans. Intell. Syst. Technol.

(2011)

X. Chen et al.

Accelerated gradient method for multi-task sparse learning problem

Ninth IEEE International Conference on Data Mining, ICDM’09

(2009)

G. Chetelat et al.

Mild cognitive impairment: can FDG-PET predict who is to rapidly convert to Alzheimer's disease?

Neurology

(2003)

C. Davatzikos et al.

Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification

Neurobiol. Aging

(2011)

R.S. Desikan et al.

Automated MRI measures identify individuals with mild cognitive impairment and Alzheimer's disease

Brain

(2009)

Cited by (96)

Multi-level graph regularized robust multi-modal feature selection for Alzheimer's disease classification
2024, Knowledge-Based Systems
Multi-modal classification has demonstrated its superiority over conventional single-modal based methods on Alzheimer’s disease (AD) diagnosis, and multi-modal feature selection has attracted increasing attention. However, most previous approaches use a fixed affinity matrix to describe the local neighborhood relations among samples, and only consider the intra-modal similarity while ignoring the inter-modal similarity. Besides, they generally treat all samples equally and neglect the negative influence of noise and outliers. For solving these problems, this paper proposes a new multi-level Graph regularized Robust Multi-modal Feature Selection method called GRMFS that simultaneously performs noise-robust feature selection and adaptive multi-level similarity preservation. On the one hand, GRMFS introduces an $ɛ$ -capped $ℓ_{2}$ -norm loss into regression framework to improve the robustness against outliers, which adaptively assigns a weight to each sample. On the other hand, to explore the intrinsic multi-modal local structures, GRMFS simultaneously learns intra-modal and inter-modal local similarities, and preserves them in subspace to guide feature selection. Experiments on real AD database illustrate the advantages of our proposed in identifying disease status compared with other approaches.
A patch distribution-based active learning method for multiple instance Alzheimer's disease diagnosis
2024, Pattern Recognition
Medical data, particularly the complex brain imaging structures, acquisition presents significant difficulties and high diagnostic expenses, resulting in a scarcity of the trainable samples in the real-world scenarios. To overcome this limitation, we present an active learning-based sampling strategy that selects the most informative samples from the unlabeled candidate sample pool for expert annotation, leading to high classification performance with a reduced number of training samples. This study adopts a patch-level perspective and introduces a multi-instance learning framework for Alzheimer's Disease diagnosis. Initially, a patch pre-selection module is designed to identify pathology-prone regions while excluding background areas and irrelevant information. Subsequently, an inner-patch local attention mechanism block and an outer-patch global attention mechanism block are developed to enhance the extraction of discriminative local and global information by the network model. Finally, an active learning sampling strategy is devised to minimize the costs associated with data acquisition and expert annotation in medical domain. The effectiveness of the proposed network framework and active learning strategy was validated through four sets of control experiments on the ADNI dataset.
Medical report generation based on multimodal federated learning
2024, Computerized Medical Imaging and Graphics
Medical image reports are integral to clinical decision-making and patient management. Despite their importance, the confidentiality and private nature of medical data pose significant issues for the sharing and analysis of medical image data. This paper addresses these concerns by introducing a multimodal federated learning-based methodology for medical image reporting. This methodology harnesses distributed computing for co-training models across various medical institutions. Under the federated learning framework, every medical institution is capable of training the model locally and aggregating the updated model parameters to curate a top-tier medical image report model. Initially, we advocate for an architecture facilitating multimodal federated learning, including model creation, parameter consolidation, and algorithm enhancement steps. In the model selection phase, we introduce a deep learning-based strategy that utilizes multimodal data for training to produce medical image reports. In the parameter aggregation phase, the federal average algorithm is applied to amalgamate model parameters trained by each institution, which leads to a comprehensive global model. In addition, we introduce an evidence-based optimization algorithm built upon the federal average algorithm. The efficacy of the proposed architecture and scheme is showcased through a series of experiments. Our experimental results validate the proficiency of the proposed multimodal federated learning approach in generating medical image reports. Compared to conventional centralized learning methods, our proposal not only enhances the protection of patient confidentiality but also enriches the accuracy and overall quality of medical image reports. Through this research, we offer a novel solution for the privacy issues linked with the sharing and analyzing of medical data. Expected to assume a crucial role in medical image report generation and other medical applications, the multimodal federated learning method is set to deliver more precise, efficient, and privacy-secured medical services for healthcare professionals and patients.
Diagnosis of Alzheimer's disease via optimized lightweight convolution-attention and structural MRI
2024, Computers in Biology and Medicine
Alzheimer's disease (AD) poses a substantial public health challenge, demanding accurate screening and diagnosis. Identifying AD in its early stages, including mild cognitive impairment (MCI) and healthy control (HC), is crucial given the global aging population. Structural magnetic resonance imaging (sMRI) is essential for understanding the brain's structural changes due to atrophy. While current deep learning networks overlook voxel long-term dependencies, vision transformers (ViT) excel at recognizing such dependencies in images, making them valuable in AD diagnosis. Our proposed method integrates convolution-attention mechanisms in transformer-based classifiers for AD brain datasets, enhancing performance without excessive computing resources. Replacing multi-head attention with lightweight multi-head self-attention (LMHSA), employing inverted residual (IRU) blocks, and introducing local feed-forward networks (LFFN) yields exceptional results. Training on AD datasets with a gradient-centralized optimizer and Adam achieves an impressive accuracy rate of 94.31% for multi-class classification, rising to 95.37% for binary classification (AD vs. HC) and 92.15% for HC vs. MCI. These outcomes surpass existing AD diagnosis approaches, showcasing the model's efficacy. Identifying key brain regions aids future clinical solutions for AD and neurodegenerative diseases. However, this study focused exclusively on the AD Neuroimaging Initiative (ADNI) cohort, emphasizing the need for a more robust, generalizable approach incorporating diverse databases beyond ADNI in future research.
Pyramid-attentive GAN for multimodal brain image complementation in Alzheimer's disease classification
2024, Biomedical Signal Processing and Control
Multimodal medical imaging has a larger volume of data compared to unimodal medical imaging, and can reflect different biological information and tissue features of the human body, complementing the structural details and image features missing from unimodal images to achieve a more accurate and comprehensive classification and diagnosis of diseases. In Alzheimer’s disease, the PET scan imaging technique is difficult to operate and expensive to detect, and is not included in routine examinations, resulting in a lack of PET image data in the dataset. In this paper, we propose a Generative Adversarial Network based on pyramidal attention mechanism to generate PET images through pyramidal attention mechanism and standard discriminators, which can effectively solve the problem of lack of PET data, complete the multimodal data sets of MRI and PET, combine the grey matter part of MRI images with the metabolic information in PET images to achieve multimodal medical image information fusion, and achieve classification and diagnosis of fused images through neural network. The experimental results of AD:MCI:NC triple classification show that our method achieves 89.9% accuracy.
Multimodal fusion diagnosis of Alzheimer's disease based on FDG-PET generation
2024, Biomedical Signal Processing and Control
Alzheimer’s disease (AD) is a central nervous system disease that mainly appears in the aged. Early diagnosis of AD is valuable in delaying the progression of the disease. With the development of medical imaging technology, various medical images such as structural Magnetic Resonance Imaging (sMRI) and Fluorodeoxyglucose Positron Emission Tomography (FDG-PET) can obtain the structural and functional lesions of the brain to assist in diagnosing diseases. However, FDG-PET is usually incomplete due to radiation and high costs. Most existing methods exclude missing modal subjects, which is remarkably one-sided. Meanwhile, how to extract the features of different levels of multimodal fusion is still a challenge. To solve these issues, we propose a Consistent Manifold Projection Generative Adversarial Network (CMPGAN) for FDG-PET generation and a Multilevel Multimodal Fusion Diagnosis Network (MMFDN) for diagnosing AD. First, we propose a CMPGAN model to project the distribution onto low-dimensional manifolds through consistent manifold projection, and present a distribution distance metric to optimize the model. Our proposed model can avoid problems of mode collapse and gradient disappearance. Then, we construct a multiscale feature-level feature extraction network based on our proposed radial medley unit and a voxel-level feature extraction network based on a harmonic voxel fusion matrix. The fusion of the two parts obtains the final diagnosis result. Experimental results indicate that our proposed method performs better than state-of-the-art methods in FDG-PET generation and AD diagnosis. Our approach also has the significance of guiding clinicians in diagnosing diseases.

View all citing articles on Scopus

View full text

Hypergraph based multi-task feature selection for multimodal classification of Alzheimer's disease

Abstract

Introduction

Section snippets

Method

Experiments and results

Discussion

Conclusion

Conflict of interest

Acknowledgements

Neurocomputing

Pattern Recogn.

NeuroImage

NeuroImage

NeuroImage

NeuroImage

NeuroImage

Signal Process.

Neurobiol. Aging

Med. Image Anal.

Neurocomputing

Neurosci. Biobehav. Rev.

Psychiatry Res.: Neuroimaging

Alzheimer's Dementia

J. Neurosci. Methods

NeuroImage

NeuroImage

NeuroImage

NeuroImage: Clinical

NeuroImage

NeuroImage

2012 Alzheimer's disease facts and figures

Alzheimer's Dementia

2013 Alzheimer's disease facts and figures

Alzheimer's Dementia

Network structure inference, a survey: motivations, methods, and applications

ACM Comput. Surv.

Music recommendation by unified hypergraph: combining social media information and music content

Proceedings of the 18th ACM International Conference on Multimedia

Libsvm: a library for support vector machines

ACM Trans. Intell. Syst. Technol.

Accelerated gradient method for multi-task sparse learning problem

Ninth IEEE International Conference on Data Mining, ICDM’09

Mild cognitive impairment: can FDG-PET predict who is to rapidly convert to Alzheimer's disease?

Neurology

Prediction of MCI to AD conversion, via MRI, CSF biomarkers, and pattern classification

Neurobiol. Aging

Automated MRI measures identify individuals with mild cognitive impairment and Alzheimer's disease

Brain