Sparse SPM: Group Sparse-dictionary learning in SPM framework for resting-state functional connectivity MRI analysis
Introduction
Spontaneous low-frequency fluctuations (< 0.1 Hz) of blood oxygen level-dependent (BOLD) signals during resting states have been shown to represent cognitive functions and neural physiology (Biswal et al., 1995, Cordes et al., 2001, Damoiseaux et al., 2006). Spatiotemporally distinct resting-state networks have been consistently identified in the primary visual network, default mode network, salience network, fronto-parietal network, and sensory motor network, among others (De Luca et al., 2006). Among the various resting-state subnetworks, the default mode network (DMN), which significantly deactivates during cognitive task-related experiments, has been studied extensively in functional connectivity analyses. It has been shown that the DMN is closely involved with episodic memory processing (Lustig et al., 2003, Greicius et al., 2004). Furthermore, previous works have provided evidence that the PCC, which shows a neural deactivation in early Alzheimer's disease (AD), is the first brain region to exhibit decreased metabolism in AD patients (Minoshima et al., 1997).
Seed-based approaches (Lowe et al., 1998, Rombouts et al., 2003, Fransson, 2005, Fox et al., 2009) and independent component analysis (ICA)-based approaches (van de Ven et al., 2004, Beckmann et al., 2005) are the most commonly used analysis methods in resting-state functional connectivity studies. The seed-based approach extracts BOLD signal time courses from a region of interest (ROI), called a “seed” region, and computes the cross-correlation between time course signals from the ROI and all other voxels in the brain to obtain a map of neuronal connectivity (Fox and Raichle, 2007). Despite their popularity, seed-based correlation analyses have limitations such that they require a prior determination of the seed's location. On the other hand, ICA automatically decomposes the entire BOLD dataset into maximally independent components. However, the brain networks are not independent of each other due to their complex, interconnected regions. Another issue in using ICA is that the individual analysis is usually not sensitive in detecting networks compared to seed-based analysis. Moreover, the unified theory that links the individual analysis results to group analysis is still not fully established. Additionally, graph theory-based quantitative analyses of brain connectivity have been developed to study structural and functional brain networks and their interactions (Bullmore and Sporns, 2009). However, graph theory-based analysis is dependent on pre-defined parcellations. Therefore, parcellation-independent graph theoretical analyses are required.
Unlike the conventional approaches, here we present a novel parcellation-free functional connectivity analysis that is inspired by the graph theoretical approach for brain networks. More specifically, our method is derived from signal decomposition based on a sparse graph model that regards the temporal dynamics at each voxel as a sparse combination of unknown global information flow. Interestingly, we can show that the sparse dictionary learning algorithm and the concept of a spatially adaptive design matrix used for our fMRI analysis in Lee et al. (2011b) can be used to represent local connectivities based on the sparse graph model. However, one of the technical difficulties of using Lee et al. (2011b) for functional connectivity fMRI analysis is that the extracted temporal dynamics corresponding to each network highly depend on the individual. Moreover, subject-dependent regressors should be estimated, after which the group-level statistical inferences should be performed using group average activation maps that are extracted using the subject-specific regressors. This complicates the group sparse learning and statistical inference. Similar difficulties have been observed in other data-driven approaches, such as ICA. In group ICA, the problem has been addressed by concatenating the data or by using tensor factorization. However, even though group-wise activation maps can be detected using these types of approaches, more advanced group analyses, such as a two-sample t-test, or an analysis of variance (ANOVA), are often difficult. There are some recent methods for ICA to obtain such components, such as dual regression (Zuo et al., 2010), and GRAICAR (generalized ranking and averaging independent component analysis by reproducibility) (Yang et al., 2012). However, a unified framework from individual to group level using standard statistical analysis tools still appears to be lacking.
To overcome such technical difficulties in group analysis, one of the main contributions of this paper is to propose a novel unified mixed-effect model framework where group-level sparse dictionary learning and group inference can be performed in a unified linear mixed model and the restricted maximum likelihood (ReML) variance estimation framework. More specifically, to estimate the unknown global dynamics and local network structures at a group level, we first concatenated the time series across the subject and performed a group sparse dictionary learning for the concatenated temporal data. We showed that the sparse learning for the concatenated time series is equivalent to imposing a constraint that the network structures within a group are similar. Using this constraint, a global dictionary was estimated from the concatenated data, after which the dictionaries from the concatenated time series were separated to obtain each subject-level sparse dictionary. Then, the SPM-type analysis was performed using individualized dictionaries. Under the homoscedasticity variance assumption, we showed that the aforementioned group sparse dictionary learning and inference can be rigorously derived using the unified linear mixed model framework and the restricted maximum likelihood (ReML) variance estimation.
As the mathematical framework for inference turns out to be similar to that of a standard statistical parameter mapping (SPM) analysis with only the exception of a spatially adaptive design matrix (which still retains the homogeneous degree of the freedom), rich statistical analysis tools, such as p-value correction using random field theory and hypothesis-driven inference, can be used. Accordingly, we call the proposed method as sparse SPM (SSPM).
To confirm the validity of the proposed method, we provide extensive comparisons using group data from normal, MCI, and Alzheimer subjects from both our clinical data set and the ADNI (Alzheimer's Disease Neuroimaging Initiative) data set (http://www.loni.usc.edu/ADNI).
Section snippets
Theory
Throughout the paper, xi and xj correspond to the i-th row and the j-th column of matrix X, respectively. When S is an index set, XS and AS correspond to a submatrix collecting the corresponding rows of X and columns of A, respectively; xS denotes a subvector collecting the corresponding elements of X. The superscripts ' and † denote the adjoint operator and pseudo-inverse, respectively. A vector 1L denotes a L-dimensional vector with elements of ones, and Ik × k is a k × k identity matrix.
Data acquisition
We collected three groups of resting-state fMRI data: 1) 22 normal subjects (8 male, mean age 70 years), 2) 37 MCI patients (21 male, mean age 72 years), and 3) 20 AD patients with CDR 0.5 (5 male, mean age 72 years). During the task period, subjects were instructed to be awake and alert but not actively involved in a task and with eye closed. A 3.0 T fMRI system (Philips, Netherlands) was used to measure the BOLD response. The echo planar imaging (EPI) sequence was used with TR/TE = 3000/35 ms, flip
Parameter selection
The determination of the number of atoms in a dictionary is an important issue, because it represents the number of linearly independent temporal dynamics across the whole brain. Group results with various dictionary numbers are shown in Fig. 5. In general, when a small number of dictionary components are chosen, they tend to aggregate different networks. On the other hand, a large dictionary size tends to segregate a subnetwork into multiple fragments. We tested various dictionary component
Discussion
The ANOVA results in Fig. 9 clearly indicate progression of Alzheimer's disease. The results imply that the PCC is the first area to deteriorate and is followed by MPFC and IPL areas. These findings with SSPM coincide with the biological findings that the posterior components of the default network, including the precuneus and posterior cingulate, are particularly vulnerable to early deposition of amyloid β-protein, one of the hallmark pathologies of AD (Sperling et al., 2009). This clearly
Conclusion
In this paper, we developed a unified mixed model called sparse SPM for group sparse dictionary learning and inference for resting-state fMRI analysis. Unlike ICA methods, the new algorithm exploits the fact that temporal dynamics at each voxel can be represented as a sparse combination of global dynamics because of the property of small-worldness of brain networks. In addition, the sparse coding step in the sparse dictionary learning step of our proposed method enabled SSPM for
Acknowledgment
JCY was supported by the Korea Science and Engineering Foundation (KOSEF) grant funded by the Korea government (NRF-2014R1A2A1A11052491). YJ was supported by the Brain Research Program (NRF-2010-0018843), Basic Science Research Program (NRF-2012R1A1A2044776) through the National Research Foundation of Korea funded by the Ministry of Science, ICT and Future Planning.
References (42)
- et al.
fMRI resting state networks define distinct modes of long-distance interactions in the human brain
NeuroImage
(2006) - et al.
Learning brain connectivity of Alzheimer's disease by sparse inverse covariance estimation
NeuroImage
(2010) - et al.
Functional connectivity in single and multislice echoplanar imaging using resting-state fluctuations
NeuroImage
(1998) - et al.
Identifying confounds to increase specificity during a “no task condition”
NeuroImage
(2003) - et al.
Estimation of functional connectivity in fMRI data using stability selection-based sparse partial correlation with elastic net penalty
NeuroImage
(2012) - et al.
Altered effective connectivity during working memory performance in schizophrenia: a study with fmri and structural equation modeling
NeuroImage
(2003) - et al.
Amyloid deposition is associated with impaired default network function in older persons without dementia
Neuron
(2009) - et al.
Generalized RAICAR: discover homogeneous subject (sub) groups by reproducibility of their intrinsic connectivity networks
NeuroImage
(2012) - et al.
Reliable intrinsic connectivity networks: test–retest evaluation using ICA and dual regression approach
NeuroImage
(2010) - et al.
Fast and incoherent dictionary learning algorithms with application to fmri
SIViP
(2015)
K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation
IEEE Trans. Signal Process.
Investigations into resting-state connectivity using independent component analysis
Philos. Trans. R. Soc. B Biol. Sci.
Functional connectivity in the motor cortex of resting human brain using echo-planar mri
Magn. Reson. Med.
Complex brain networks: graph theoretical analysis of structural and functional systems
Nat. Rev. Neurosci.
Frequencies contributing to functional connectivity in the cerebral cortex in “resting-state” data
Am. J. Neuroradiol.
Consistent resting-state networks across healthy subjects
Proc. Natl. Acad. Sci.
Sparse dictionary learning of resting state fMRI networks
Spontaneous fluctuations in brain activity observed with functional magnetic resonance imaging
Nat. Rev. Neurosci.
The global signal and observed anticorrelated resting state brain networks
J. Neurophysiol.
Spontaneous low-frequency BOLD signal fluctuations: an fMRI investigation of the resting-state default mode of brain function hypothesis
Hum. Brain Mapp.
Statistical Parametric Mapping: The Analysis of Functional Brain Images: The Analysis of Functional Brain Images
Cited by (42)
Functional brain network identification and fMRI augmentation using a VAE-GAN framework
2023, Computers in Biology and MedicineA deep learning method for autism spectrum disorder identification based on interactions of hierarchical brain networks
2023, Behavioural Brain ResearchLearning brain representation using recurrent Wasserstein generative adversarial net
2022, Computer Methods and Programs in BiomedicineCitation Excerpt :To understand the mapping of mind and brain, learning brain representation from fMRI, both of the temporal features and spatial features, has been under extensive active research in the past few years. In previous studies, researchers have proposed a variety of computational methods and tools for brain network mapping, such as general linear model (GLM) [3,4], independent component analysis (ICA) [5–7], and sparse dictionary learning (SDL) [8–11]. Among these methods, GLM is widely known in task-based fMRI data analysis, and ICA is dominant in resting-state fMRI data analysis.
Deep learning for brain disorder diagnosis based on fMRI images
2022, NeurocomputingCitation Excerpt :Similarities in the temporal fluctuations of BOLD signals are measured, and ROIs that share similar temporal patterns are considered as functionally connected. Functional connectivity is an observable phenomenon that can be quantified with mathematical measures such as the ROI based temporal correlation, aka seed-based method, extracted by using data driven methods such as independent component analysis (ICA), dictionary learning etc., or analyzed by using graph theory based methods [59–64]. In common, many functional connectivity networks are static by construction.
R-fMRI reconstruction from k–t undersampled data using a subject-invariant dictionary model and VB-EM with nested minorization
2020, Medical Image AnalysisCitation Excerpt :For connectivity analysis, Eavani et al. (2012) learn the dictionary using K-SVD (Aharon et al., 2006) that uses a Gaussian model on the residual, without spatial regularization on the dictionary fits. Similarly, for connectivity analysis, Lee et al. (2016) learn the dictionary by concatenating the R-fMRI time series of a group of subjects. Many methods use dictionary priors for reconstruction of general dynamic MRI (other than R-fMRI) from undersampled data.
White matter fiber analysis using kernel dictionary learning and sparsity priors
2019, Pattern Recognition
- 2
Data used in the preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (http://www.loni.usc.edu/ADNI). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. ADNI investigators include (complete listing available at http://www.loni.usc.edu/ADNI/Collaboration/ADNIAuthorshiplist.pdf).