Abstract
For decades, task-based functional MRI (tfMRI) has been widely used in exploring functional brain networks and modeling brain activities. A variety of brain activity analysis methods for tfMRI data have been developed. However, these methods are mainly shallow models and are limited in faithfully modeling the complex spatial-temporal diverse and concurrent functional brain activities. Recently, recurrent neural networks (RNNs) demonstrate great superiority in modeling temporal dependency signals and autoencoder models have been proven to be effective in automatically estimating the optimal representations of the original data. These characteristics meet the requirement of modeling hemodynamic response patterns in tfMRI data. In order to take the advantages of both models, we proposed a novel unsupervised framework of deep recurrent autoencoder (DRAE) for modeling tfMRI data in this work. The basic idea of the DRAE model is to combine the deep recurrent neural network and autoencoder to automatically characterize the meaningful functional brain networks and corresponding diverse and complex hemodynamic response patterns underlying tfMRI data simultaneously. The proposed DRAE model has been tested on the motor tfMRI dataset of HCP 900 subjects release and all seven tfMRI datasets of HCP Q1 release. Extensive experimental results demonstrated the great superiority of the proposed method.
S. Zhao and Y. Cui—Co-first authors.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Task-based functional magnetic resonance imaging (tfMRI) has been a powerful and popular noninvasive neuroimaging methodology for the study of brain activity patterns and cognitive behaviors of the human brain [1]. To model the very informative but complex tfMRI time series data, a variety of brain network reconstruction and hemodynamic response modeling techniques have been developed in the literature. These methods include model-driven methods such as the general linear model (GLM) [2], and data-driven methods like principal component analysis (PCA) [3], independent component analysis (ICA) [4] and sparse representation/dictionary learning methods [5,6,7]. In general, these methods reconstructed hundreds of meaningful functional brain networks and corresponding hemodynamic response patterns from tfMRI datasets and greatly advanced our understanding of the regularity of brain activities [2, 5].
However, the ability to represent and describe the tfMRI data still limited the performance of hemodynamic response patterns modeling. Therefore, developing a descriptive model that can sufficiently deal with diverse and complex tfMRI data, as well as large noises, is the key towards automatic, effective and accurate modeling of those hemodynamic response patterns in tfMRI data. Under tfMRI condition, participants need to participate in predefined sequential tasks during the whole scan session and the functional brain activity is modulated from the interactions of brain networks in different time periods, which quite coincides with the characteristics of recurrent neural network (RNN) models [8]. Therefore, it is straightforward and justified to adopt RNNs to explore and represent hemodynamic response patterns. RNNs are feed forward neural networks, which can use their internal memory units to process arbitrary sequences of inputs and model the sequential and time dependencies. They are connectionist models with the ability to selectively pass information across sequence steps. In order to characterize the tfMRI brain activities, a deep recurrent neural network (DRNN) model [9] was proposed to reconstruct the whole brain tfMRI signals from stimulus task design patterns. This framework not only identified typical brain networks by traditional methods (e.g., GLM), but also simultaneously obtain a variety of temporal brain activity patterns at multiple time scales. These results proved the great advantage of RNN model in charactering the temporal dependency signals in tfMRI data. However, the DRNN model still highly relies on the prior knowledge of task stimulus patterns which greatly limited the analysis power of the model.
In order to overcome current limitations in DRNN model, in this study, we proposed a novel unsupervised framework of deep recurrent autoencoder (DRAE) for modeling diverse and complex hemodynamic response patterns in tfMRI data. The basic idea is combing the DRNN model and autoencoder to automatically estimate the optimal task stimulus patterns of the tfMRI data and reconstruct the meaningful functional brain networks simultaneously. Autoencoder [10] is an unsupervised model that automatically learns a latent or compressed representation of the input data by minimizing the error between the input and its reconstruction. In this study, we take advantage of the autoencoder to automatically estimate the task stimulus patterns from the original tfMRI data. When the model is converged, the learned weight matrix between the FC layers and reconstructed signals represents the spatial distributions of functional brain networks underlying the tfMRI data and the output of the top RNN layer in decoding part represents the diverse and complex hemodynamic response patterns under the task condition. We adopted the motor tfMRI dataset of HCP 900 subjects release and the whole HCP Q1 release tfMRI datasets as test beds. Extensive experimental results demonstrated that the proposed DRAE model can not only automatically estimate the task stimulus patterns, but also reconstruct the meaningful functional brain networks and corresponding complex and concurrent hemodynamic response patterns with different time delays.
2 Materials and Methods
2.1 Data Acquisition and Pre-processing
The Human Connectome Project (HCP) dataset has been considered as one of the most systematic and comprehensive neuroimaging datasets. Importantly, this dataset is publicly available which makes it a good test bed for different research studies. The design paradigms are available in [11]. There are 68 subjects in HCP Q1 release dataset and over 800 subjects in HCP 900 subjects release dataset. The detailed acquisition parameters of these HCP tfMRI data are as follows: 220 mm FOV, in-plane FOV: 208 × 180 mm, flip angle = 52, BW = 2290 Hz/Px, 2 × 2×2 mm spatial resolution, 90 × 104 matrix, 72 slices, TR = 0.72 s, TE = 33.1 ms. The preprocessing of the task fMRI data sets includes skull removal, motion correction, slice time correction, spatial smoothing, and global drift removal (high-pass filtering). All these preprocessing steps were implemented in FSL FEAT. All of these individual fMRI datasets are first registered to the MNI common space for further study.
2.2 Deep Recurrent Autoencoder
The proposed deep recurrent autoencoder (DRAE) model is summarized as Fig. 1. It consists of two components, the encoder (Fig. 1(b)) and the decoder (Fig. 1(d)). First, for each subject, the extracted and normalized whole brain tfMRI signals are aggregated into a big signal matrix (m voxels’ signals with t time series, Fig. 1(a)). During the encoding stage, the signal matrix is compressed and mapped into a lower dimensional subspace representing a latent structure through a fully connected layer ([m, k], k < m), and then propagated through stacked RNN layers (k input units and n output units) to extract a feature map (n features with t time series, Fig. 1(c)). Next, the decoder passes the extracted feature map through another group of stacked RNN layers (n input units and k output units) to simulate diverse and complex brain activities and then maps the output of the top RNN layer into higher dimensional space (same as the original signals) by a fully connected layer ([k, m]) to reconstruct the whole brain signals (Fig. 1(e)). Specifically, the sequential output of each unit in the top RNN layer represents a temporal brain activity pattern and the corresponding weight vector in the fully connected layer which connects this unit to the reconstructed signals represents the spatial distribution of a functional brain network. The objective of the DRAE model is to minimize the reconstruction errors over all subjects of the training dataset, and the entire training progress is completely data-driven and unsupervised.
Pipeline of the DRAE model. (a) Signal matrix of whole brain tfMRI signals. (b) The encoder which consists of a fully connected layer and deep RNN layers. (c) Extracted features maps. (d) The decoder which consists of deep RNN layers and a fully connected layer. (e) Matrix of reconstructed whole brain signals.
2.3 Estimation of Hemodynamic Responses
In order to further explore the hemodynamic brain response patterns, for each feature, we replaced the feature map with a testing one (Fig. 2(a)) by keeping one selected feature a single impulse and setting the others to zeros, and propagated it through the trained decoder. The decoder was stimulated by the impulse and simulates the brain activities, and the output of each unit in the top RNN layer (Fig. 2(b)) represents the hemodynamic response of the corresponding functional brain network to the certain feature.
A sketch map of exploring derive hemodynamic response patterns. (a) Testing feature map which keeps one feature a single impulse and set the others to zeros to stimulate the trained decoder. (b) Output of each unit in the top RNN layer representing the hemodynamic response patterns to the certain feature.
3 Experimental Results
In this work, the training was applied on the DRAE model with 2 RNN layers of 32 LSTM units and a feature size of 16. To be specific, we extracted 244,341 voxels’ signals for the motor tfMRI dataset of HCP 900 subjects release and 223,945 voxels’ signals for the HCP Q1 release tfMRI dataset. During the training stage, all subjects’ signals were used during the training stage, since training on grouped subjects’ data will help avoid overfitting and either L1- or L2-norm regularization will increase the training loss rapidly, only MSE was taken as the loss function.
3.1 Interpretation of Feature Maps and Spatial Patterns
After training of the DRAE model, a group of feature maps can be obtained for each subject. That is, the whole brain activities can be divided and represented by several feature activities. Since individual feature maps are unique, we work out group feature maps by calculating the group-average values for further interpretation. Among these group-averaged feature maps, a few feature maps which are quite correlated with the task design patterns were identified, as shown in Fig. 3, which suggests that the DRAE model has the ability to extract the whole brain activities to a lower dimensional representation.
When the model is converged, we can also obtain a trained weight matrix of the fully connected layer from the decoder. Specifically, each vector of the weight matrix represents the spatial distribution of a typical functional brain network. Figure 4 illustrates a few identified functional brain networks on the motor task of HCP tfMRI datasets using the DRAE model. As shown, the DRAE model can identify similar functional brain networks for almost all task designs and high spatial overlap rates suggest the reliability of the proposed method. Specifically, the spatial overlap rate is defined as the intersection of the identified brain networks and corresponding GLM activation results. However, the DRAE model was trained in a completely unsupervised process and the functional brain networks can be obtained without prior knowledge of the task designs. These results demonstrates the superiority of the proposed method in modeling functional brain networks.
3.2 Hemodynamic Response Patterns
After model training, the feature maps represent the task stimulus patterns of the whole brain signals and these signals can be reconstructed from this feature maps. The decoder simulates the complex brain activities with its hierarchical inner cells of stacked RNN units. Diverse hemodynamic response activities are invoked by these features, thus the output of each unit in the top RNN layer just represents a temporal response pattern.
When the formatted feature maps were passed through the decoder as described in Sect. 2.3, both positive and negative response patterns were obtained. These response patterns look similar to the theoretical HRF responses but have different shapes and time delays. Figure 5 detailed shows a few typical and HRF-correlated hemodynamic response patterns. There are several minor differences among these patterns, for example, they have different raising speed when meeting the impulse; some patterns start falling down after a period of time while some don’t fall down until the falling edge of the impulse; some patterns have significant undershoots while some don’t have. These hemodynamic response patterns are all possible and meaningful brain activity patterns which are more specific but still interpretable, which suggests that the proposed DRAE model can obtain many more meaningful brain activities through an unsupervised method on unlabeled datasets. In order to further analyze the hemodynamic response patterns, we drew the delay estimation maps trained on the DRAE model with different depths of RNN layers (Fig. 6). Table 1 represents the average values and standard deviations of the response delays. Since positive and negative responses have similar response time delays, the negative response patterns were inverted when calculating the response delays. For the DRAE model with just one RNN layer, peak delays are almost before the falling edge (12.29 s delay) of the impulse, since the network is very simple. As RNN depth goes deeper, response peaks have larger time delays around the falling edge and larger standard deviations. These response patterns also have various undershoots delays. In general, with deep recurrent layers, the DRAE model is able to estimate diverse and complex hemodynamic response patterns.
Estimation of the hemodynamic response delays. X-axis represents delay of peaks, Y-axis represents delay of undershoots; red dots indicate positive response patterns, blue dots indicate negative response patterns; solid lines represent raising edge of the testing impulse and dashed lines represent falling edge of the testing impulse. Subfigures (a) to (f) show the results of 1, 2, 3, 4, 6 and 8 RNN layer(s), respectively. (Color figure online)
4 Discussion and Conclusion
In this work, we proposed a novel framework of deep recurrent autoencoder (DRAE) for modeling diverse and complex hemodynamic response patterns and functional brain networks. The proposed DRAE model combines the deep recurrent neural network (DRNN) model and autoencoder to automatically estimate the optimal task stimulus patterns of the tfMRI data, reconstruct the meaningful functional brain networks and characterize the corresponding hemodynamic response patterns underlying tfMRI data simultaneously. Diverse and complex hemodynamic response patterns can be obtained, which brings a new way to reverse engineering of the brain’s response function patterns. Furthermore, with deeper stacked RNN layers, the DRAE model is able to simulate more complex hemodynamic response patterns with different time delay estimations. In general, extensive experiment results demonstrated the superiority and effectiveness of our proposed method.
References
Logothetis, N.K.: What we can do and what we cannot do with fMRI. Nature 453, 869–878 (2008)
Friston, K.J., Holmes, A.P., Worsley, K.J., Poline, J.P., Frith, C.D., Frackowiak, R.S.: Statistical parametric maps in functional imaging: a general linear approach. Hum. Brain Mapp. 2, 189–210 (1994)
Andersen, A.H., Gash, D.M., Avison, M.J.: Principal component analysis of the dynamic response measured by fMRI: a generalized linear systems framework. Magn. Reson. Imaging 17, 795–815 (1999)
Biswal, B.B., Ulmer, J.L.: Blind source separation of multiple signal sources of fMRI data sets using independent component analysis. J. Comput. Assist. Tomogr. 23, 265–271 (1999)
Lv, J., et al.: Holistic atlases of functional networks and interactions reveal reciprocal organizational architecture of cortical function. IEEE Trans. Biomed. Eng. 62, 1120–1131 (2015)
Zhao, S., et al.: Supervised dictionary learning for inferring concurrent brain networks. IEEE Trans. Med. Imaging 34, 2036–2045 (2015)
Zhang, W., et al.: Experimental comparisons of sparse dictionary learning and independent component analysis for brain network inference from fMRI data. IEEE Trans. Biomed. Eng. 66, 289–299 (2018)
Lipton, Z.C., Berkowitz, J., Elkan, C.: A critical review of recurrent neural networks for sequence learning. Computer Science (2015)
Cui, Y., et al.: Identifying brain networks at multiple time scales via deep recurrent neural network. IEEE J. Biomed. Health Inform. (2018)
Bourlard, H., Kamp, Y.: Auto-association by multilayer perceptrons and singular value decomposition. Biol. Cybern. 59, 291–294 (1988)
Barch, D.M., et al.: Function in the human connectome: task-fMRI and individual differences in behavior. Neuroimage 80, 169–189 (2013)
Acknowledgements
This work was supported by the National Science Foundation of China (61806167, 61603399, 31627802 and U1801265), the Fundamental Research Funds for the Central Universities (3102019PJ005), Natural Science Basic Research Plan in Shaanxi Province of China (2019JQ-630) and the China Postdoctoral Science Foundation (2019T120945).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Zhao, S. et al. (2019). Exploring Brain Hemodynamic Response Patterns via Deep Recurrent Autoencoder. In: Zhu, D., et al. Multimodal Brain Image Analysis and Mathematical Foundations of Computational Anatomy. MBIA MFCA 2019 2019. Lecture Notes in Computer Science(), vol 11846. Springer, Cham. https://doi.org/10.1007/978-3-030-33226-6_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-33226-6_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33225-9
Online ISBN: 978-3-030-33226-6
eBook Packages: Computer ScienceComputer Science (R0)