Multi-source manifold feature transfer learning with domain selection for brain-computer interfaces

doi:10.1016/j.neucom.2022.09.124

Neurocomputing

Volume 514, 1 December 2022, Pages 313-327

https://doi.org/10.1016/j.neucom.2022.09.124 Get rights and content

Abstract

Transfer learning uses the knowledge in source domains to improve the learning performance in the target domain, which is useful in electroencephalogram (EEG) based brain-computer interfaces (BCIs) with small training datasets. However, the existing transfer learning methods for EEG based BCI mainly consider the knowledge transfer from single-to-single (STS) domain or simply merge different source domains into a bigger one. In this paper, we propose a multi-source manifold feature transfer learning (MMFT) framework to transfer multi-source knowledge for EEG signals classification. MMFT minimizes marginal probability distribution on the Riemannian manifold using Riemannian alignment and Grassmann manifold feature learning, then transfers the manifold features with a conditional probability distribution adaptation in the structural risk minimization (SRM) function. Based on MMFT, w-MMFT is proposed to tackle the class imbalance issue for SRM, and the label similarity analysis (LSA) is proposed to select source domains for MMFT, forming a new LSA-MMFT framework. Experimental results on six datasets demonstrate that the proposed MMFT has achieved superior performance in classification accuracy and computational efficiency compared to state-of-the-art methods. The LSA-MMFT can get more stable performance than two other domain selection methods.

Introduction

A brain-computer interface (BCI) offers a direct communication pathway for users to interact with the environment, via their brain signals containing abundant information of the users’ cognitive state or intentions [1]. BCI has great research importance and application value in many fields, such as auxiliary control, entertainment and rehabilitation training [2]. Electroencephalogram (EEG) is the most widely used technology for BCI systems, because of its non-invasive nature and high temporal resolution. Different neurophysiological paradigms of EEG have been used to operate BCIs, such as motor imagery (MI), event-related potentials (ERPs) and steady-state visual evoked potentials (SSVEP) [3]. In MI tasks, the users imagine movements of their body parts (e.g., hands, feet, and tongue), without actually performing the movement and even without tensing the muscles, which causes the modulations of brain rhythms in the involved cortical areas. In ERP tasks, the user is stimulated by a majority of non-target stimuli but a few target stimuli, wherein a specific ERP pattern appears in the EEG response after the user perceives a target stimulus. However, EEG signals are notoriously difficult to analyze due to the low signal-to-noise ratio and significant subject-to-subject variations. Therefore, it is of great interest to be able to extract and recognize EEG features with machine learning methods with shallow and deep structures [4], [5], [6], [7], [8].

Despite several recent advances, most of the BCI systems are still faced with a major challenge of long calibration time. Different users have different neural responses to the same stimulus, and even the same user differs the response to the same stimulus, especially at different time/locations. Besides, calibrating the BCI system require sufficient labeled data to train the subject-specific BCI model, and thus the calibration phase is usually time-consuming. Transfer learning is helpful for BCIs to conquer the abovementioned challenges [9]. The knowledge in one or more source domains is fully utilized to help the classification in a target domain, which improves the calibration performance. And cross-subject classification has become one of the most popular directions where the generalization ability (of the classification model) can be enhanced by improving the cross-subject performance. Recently, there is a trend to utilize the covariance matrices of EEG trials, which are symmetric positive definite (SPD) and can be viewed as points on an SPD manifold, the covariance matrix can boost the role of Riemannian geometry in BCIs. The Riemannian alignment (RA) framework was proposed to align EEG covariance matrices from different source domains, which can quickly reduce the distribution difference between the source and target domains by changing the reference position on an SPD manifold [10]. However, the Riemannian alignment approaches are computationally expensive, and not compatible with the machine learning approaches defined in Euclidean space.

Transfer learning methods for BCI usually aim to solve single-to-single (STS) transfer problems. In practice, it is noted that a good source domain can help to obtain a high classification accuracy, even with a simple transfer learning algorithm. Therefore, the quality of the source domain is important. However, multiple source domains are factually available for EEG signal classification, such as labelled data from other subjects or other sessions used before. In multiple source domains cases, good source domains are more likely to be found. Due to the data expansion, in multi-source transfers, good source domains can reduce the negative transfer (NT) caused by bad source domains. Therefore, multi-source transfer learning usually achieves more stable and higher classification accuracy than STS transfer learning. Multi-source unsupervised domain adaptation is proved to be valuable in performance improvement [11], [12], [13].

Despite the progress mentioned above, there are some unresolved difficulties and challenges in the field. Firstly, traditional transfer learning methods usually seek to adapt marginal and conditional probability distributions [14], [15]. As an efficient way for EEG transfer, the Riemannian alignment approaches usually aligns distribution centers and cannot completely minimize the marginal probability distribution difference. If the aligned features can be transferred by traditional transfer learning methods further, marginal probability distribution still need to be adapted in the subspace. Moreover, the existing transfer learning methods for BCI are lack of information exploration between multi-source domains. It is necessary to propose a multi-source transfer learning framework with simple structure which can integrate traditional transfer learning method and Riemann alignment approach. Secondly, class imbalance often exists in many BCI scenarios. Most transfer learning methods ignore this issue by treating the classes as balanced across domains, or they only handle the bias in one domain [14], [16], and this may hinder the effectiveness of transfer learning. Therefore, how to handle the class imbalance situation in transfer learning for BCI is necessary. Thirdly, to overcome NT, many domain selection methods were proposed but most of them are based on mathematical similarity calculation [17], [18], which aims to explore the relationship between one source domain and the target domain, but lacks the exploration of the relationship between multiple source domains.

Different from the existing results, this paper presents a multi-source manifold feature transfer learning (MMFT) framework with simple structure and strong stability. More specifically, the distribution means of source and target domains are aligned on the SPD manifold, and then the tangent space features are extracted using tangent space mapping in the proposed MMFT. Next, the tangent space features are reconstructed to Grassmann manifold features via the Geodesic flow kernel (GFK) approach [18]. Finally, the classifier of MMFT is trained by adapting condition probability distribution in the structural risk minimization (SRM) function. Compared with existing multi-source transfer learning algorithms, the new method has several noteworthy advantages:

1) MMFT minimizes the marginal probability distribution on the Riemannian manifold, and then transfers the manifold features with conditional probability distribution adaptation. To the best of our knowledge, it’s the first attempt to align the two probability distributions separately in the field of EEG signal classification, where the calculation of their weights is avoided. Additionally, the MMFT has a simple structure with only three hyperparameters to be determined, and classification accuracy is not sensitive to the changes of these hyperparameters.
2) To better align conditional probability distributions for each pair of source and target domains, the MMFT utilizes a voting mechanism to transfer knowledge from source domains to the target domain individually, instead of transferring all source domains together.
3) To tackle the class imbalanced scenarios, a weighted MMFT algorithm is proposed which boosts the performance of MMFT for imbalanced training datasets.
4) To reduce the negative transfer in multi-source transfer learning, a label similarity analysis for MMFT (LSA-MMFT), is proposed for domain selection, which improves the classification accuracy and the time efficiency of MMFT.

The remainder of this paper is organized as follows. Section 2 introduces related work on Riemannian geometry, subspace adaptation and domain selection. Section 3 describes the details of the proposed MMFT, w-MMFT and LSA-MMFT. In Section 4, the experiments verify the performance of MMFT compared with several state-of-the art transfer learning approaches, and the performance of LSA-MMFT with two other state-of-the art domain selection approaches. Finally, the conclusion is given in Section 5.

Section snippets

Riemannian geometry

Riemannian geometry is the branch of mathematics that studies smoothly curved spaces that locally behave like Euclidean spaces. There are three main types of Riemannian manifolds, i.e., symmetric positive definite (SPD) manifold, Stiefel manifold, and Grassmann manifold [19]. The proposed MMFT minimized the marginal probability distribution by aligning different domains on the SPD manifold and learning Grassmann features in the Grassman manifold [18]. Three abovementioned manifolds can be

Proposed method

The MMFT framework is proposed for EEG signal classification in this section. The goal is to transfer the knowledge from multiple source subjects to the target subject. To takes advantage of the properties of Riemannian manifold and the knowledge transfer from multiple source domains, Section 3.1 introduces the MMFT framework. To solve the class imbalance problem, we proposed w-MMFT wherein the class with a smaller number of samples in the source domain gets a bigger weight in Section 3.2. To

Experiment and evaluations

In this section, we evaluate the performance of MMTF and LSA-MMFT on six EEG datasets as compared with several state-of-the-art algorithms. The code is available at https://github.com/rick604/MMFT. All evaluations are implemented in MATLAB R2017a on a laptop with i7-6700HQ CPU @ 2.60 GHz, 16 GB memory, and 64-bit Windows 10 Home Edition.

Conclusion

Transfer learning is a useful tool in EEG-based BCIs to tackle variations among different subjects and/or tasks. This paper considered offline unsupervised multi-source EEG classification. The proposed MMFT framework learned manifold features to minimize the marginal probability distribution, and then transferred manifold features using a voting mechanism. Moreover, we considered handling the class imbalance problem by proposing weighted MMFT (W-MMFT). The experimental results and its analogous

CRediT authorship contribution statement

Qingshan She: Methodology, Conceptualization, Funding acquisition. Yinhao Cai: Writing - original draft, Software, Validation. Shengzhi Du: Writing - review & editing, Formal analysis. Yun Chen: Project administration.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

This work was partly supported by Zhejiang Provincial Natural Science Foundation of China under Grant (No. LZ22F010003) and National Natural Science Foundation of China under Grant (Nos. 61871427, 61971168 and 62171171).

Qing-Shan She was born in Hubei Province, China. He received the B.E. degree in 2001 and M.E. degree in 2004, both from Lanzhou University of Technology, Lanzhou, China and the Ph.D. degree in 2007 from Zhejiang University, Hangzhou, China. He is currently a Professor with the School of Automation, Hangzhou Dianzi University, Hangzhou, China. His research interests include machine learning and pattern recognition, brain-computer interface and its applications.

References (47)

X. Zhang et al.
Sub-band target alignment common spatial pattern in brain-computer interface
Comput. Methods Programs Biomed.
(2021)
A. Barachant et al.
Classification of covariance matrices using a riemannian-based kernel for bci applications
Neurocomputing
(2013)
Y. Cai et al.
Motor imagery eeg decoding using manifold embedded transfer learning
J. Neurosci. Methods
(2022)
B. Blankertz et al.
The non-invasive berlin brain–computer interface: fast acquisition of effective performance in untrained subjects
NeuroImage
(2007)
A. Delorme et al.
Eeglab: an open source toolbox for analysis of single-trial eeg dynamics including independent component analysis
J. Neurosci. Methods
(2004)
M.M. Shanechi
Brain–machine interfaces from motor to mood
Nature Neurosci.
(2019)
T.J.C. Limchesing, A.E.C. Chua, C.J.L. Shi, R.G. Baldovino, F.E.T. Munsayac, N.T. Bugtai, A review on recent...
R.P. Rao
Brain-computer interfacing: an introduction
(2013)
H. Cheng, Z. Wang, Z. Wei, L. Ma, X. Liu, On adaptive learning framework for deep weighted sparse autoencoder: A...
Y. Chen et al.
Exponential synchronization for delayed dynamical networks via intermittent control: dealing with actuator saturations
IEEE Trans. Neural Networks Learn. Syst.
(2018)

V.J. Lawhern et al.

Eegnet: a compact convolutional neural network for eeg-based brain–computer interfaces

J. Neural Eng.

(2018)

W. Liu et al.

A pso-based deep learning approach to classifying patients from emergency departments

Int. J. Mach. Learn. Cybern.

(2021)

D. Wu et al.

Transfer learning for eeg-based brain–computer interfaces: A review of progress made since 2016

IEEE Trans. Cognit. Develop. Syst.

(2020)

P. Zanini et al.

Transfer learning: A riemannian geometry framework with applications to brain–computer interfaces

IEEE Trans. Biomed. Eng.

(2017)

M. Tiwari et al.

Multi-source based approach for visual domain adaptation

M.J. Sorocky et al.

Experience selection using dynamics similarity for efficient multi-source transfer learning between robots

Z.-G. Liu et al.

Combination of transferable classification with multisource domain adaptation based on evidential reasoning

IEEE Trans. Neural Networks Learn. Syst.

(2020)

S.J. Pan et al.

Domain adaptation via transfer component analysis

IEEE Trans. Neural Networks

(2010)

M. Long et al.

Transfer feature learning with joint distribution adaptation

H. Yan et al.

Mind the class weight bias: Weighted maximum mean discrepancy for unsupervised domain adaptation

W. Zhang et al.

Manifold embedded knowledge transfer for brain-computer interfaces

IEEE Trans. Neural Syst. Rehabil. Eng.

(2020)

B. Gong, Y. Shi, F. Sha, K. Grauman, Geodesic flow kernel for unsupervised domain adaptation, in: 2012 IEEE conference...

S. Lang

(2012)

Cited by (18)

Manifold embedded instance selection to suppress negative transfer in motor imagery-based brain–computer interface
2024, Biomedical Signal Processing and Control
Individual distribution discrepancy poses significant challenges to cross-subject electroencephalography (EEG) signal decoding. Although transfer learning has emerged as an effective approach to minimize this distribution gap, EEG signals’ low voltage amplitude makes them vulnerable to noise and results in abnormal, low-quality samples that cause negative transfer phenomenon. This phenomenon, in turn, undermines the efficacy of transfer learning and impedes brain-computer interface (BCI) applications. To overcome this challenge, we introduce the manifold embedded instance selection (MEIS) algorithm, which addresses negative transfer. The MEIS algorithm operates in two ways: converting raw EEG matrices into manifold embedded vectors that maintain sample discriminability, and designing an evaluator to assess the transferability of samples and filter out negative transfer samples from the source domain. When faced with a large number of source domains, our proposed method employs domain similarity estimation to determine the most beneficial subset of source domains for the target domain. The effectiveness of the proposed method is confirmed via offline and simulated online motor imagery-based BCI experiments where it demonstrates superior accuracy compared to other advanced techniques. Additionally, the MEIS algorithm significantly reduces both training time and the number of required training samples without affecting the model’s performance. The code is available at https://github.com/ZilinL/MEIS.
Multi-source adversarial transfer learning for ultrasound image segmentation with limited similarity
2023, Applied Soft Computing
Lesion segmentation of ultrasound medical images based on deep learning techniques is a widely used method for diagnosing diseases. Although there is a large amount of ultrasound image data in medical centers and other places, labeled ultrasound datasets are a scarce resource, and it is likely that no datasets are available for new tissues/organs. Transfer learning provides the possibility to solve this problem, but there are too many features in natural images that are not related to the target domain. As a source domain, redundant features that are not conducive to the task will be extracted. Migration between ultrasound images can avoid this problem, but there are few types of public datasets, and it is difficult to find sufficiently similar source domains. Compared with natural images, ultrasound images have less information, and there are fewer transferable features between different ultrasound images, which may cause negative transfer. To this end, a multi-source adversarial transfer learning network for ultrasound image segmentation is proposed. Specifically, to address the lack of annotations, the idea of adversarial transfer learning is used to adaptively extract common features between a certain pair of source and target domains, which provides the possibility to utilize unlabeled ultrasound data. To alleviate the lack of knowledge in a single source domain, multi-source transfer learning is adopted to fuse knowledge from multiple source domains. In order to ensure the effectiveness of the fusion and maximize the use of precious data, a multi-source domain independent strategy is also proposed to improve the estimation of the target domain data distribution, which further increases the learning ability of the multi-source adversarial migration learning network in multiple domains. The effectiveness of multi-source adversarial transfer learning is demonstrated through experiments on three datasets of ultrasound image datasets.
Supervised feature selection on gene expression microarray datasets using manifold learning
2023, Chemometrics and Intelligent Laboratory Systems
In recent decades, the ultimate output from microarray assay, has produced enormous numbers of microarray datasets, regardless of the used technology. These datasets include complex and high dimensional samples and genes that the number of samples is much smaller than the number of genes (features). Due to the redundant dimensions in these datasets, processing them directly not only leads to poor performance but also increases computation time and memory usage. Feature selection reduces computational expense while improving or maintaining diagnosis accuracy. In this study, we propose a new supervised feature selection method based on a manifold learning approach. We focus in two different directions to address this issue. First, maximum relevancy criterion that achieves by integrating Supervised Laplacian Eigenmaps (S-LE) and a matrix, which can realize the process of feature selection. The applied criterion simultaneously opts the features that make same-class samples closer to each other and ignores the features that cause different-class samples be near. Second, minimum redundancy among selected features by applying the Pearson correlation coefficient. In the test phase, the proposed method is compared with ten state-of-the-art algorithms on seven microarray datasets. Reported results show that the proposed method has more promising performance than the other methods.
Cross-subject EEG emotion recognition using multi-source domain manifold feature selection
2023, Computers in Biology and Medicine
Recent researches on emotion recognition suggests that domain adaptation, a form of transfer learning, has the capability to solve the cross-subject problem in Affective brain-computer interface (aBCI) field. However, traditional domain adaptation methods perform single to single domain transfer or simply merge different source domains into a larger domain to realize the transfer of knowledge, resulting in negative transfer. In this study, a multi-source transfer learning framework was proposed to promote the performance of multi-source electroencephalogram (EEG) emotion recognition. The method first used the data distribution similarity ranking (DDSA) method to select the appropriate source domain for each target domain off-line, and reduced data drift between domains through manifold feature mapping on Grassmann manifold. Meanwhile, the minimum redundancy maximum correlation algorithm (mRMR) was employed to select more representative manifold features and minimized the conditional distribution and marginal distribution of the manifold features, and then learned the domain-invariant classifier by summarizing structural risk minimization (SRM). Finally, the weighted fusion criterion was applied to further improve recognition performance. We compared our method with several state-of-the-art domain adaptation techniques using the SEED and DEAP dataset. Results showed that, compared with the conventional MEDA algorithm, the recognition accuracy of our proposed algorithm on SEED and DEAP dataset were improved by 6.74% and 5.34%, respectively. Besides, compared with TCA, JDA, and other state-of-the-art algorithms, the performance of our proposed method was also improved with the best average accuracy of 86.59% on SEED and 64.40% on DEAP. Our results demonstrated that the proposed multi-source transfer learning framework is more effective and feasible than other state-of-the-art methods in recognizing different emotions by solving the cross-subject problem.
Improving NeuCube spiking neural network for EEG-based pattern recognition using transfer learning
2023, Neurocomputing
Electroencephalogram (EEG) data are produced in quantity for measuring brain activity in response to external stimuli. With the rapid development of brain-inspired intelligence, spiking neural network (SNN) possesses the potential to handle EEG data by using spiking activity transmitted among spatially located synapses and neurons. As an original and unifying SNN architecture, NeuCube, is developed to model, recognize and understand EEG data. However, the NeuCube still faces some challenges for EEG-based pattern recognition, such as few labeled data and changes of data probability distribution. Hence, this paper proposes a novel method to improve the performance of the NeuCube for EEG-based pattern recognition by transfer learning. In the first place, the covariance matrix alignment of EEG data is implemented for every subject in the Euclidean space, which reduces the probability distribution discrepancy of EEG data between different subjects. Different estimation methods for reference covariance matrix are tested and the optimal one is selected for different subjects. Secondly, spatio-temporal features of EEG data are extracted based on the NeuCube reservoir. Since hyper-parameters of the NeuCube reservoir have a great impact on its spatio-temporal representation, an improved cuckoo search algorithm is proposed to discover the optimal hyper-parameters for obtaining the optimal spatio-temporal features. Last but not least, a weighted transfer support vector machine is proposed to improve the original output classifier of the NeuCube in order to make the model adaptive to the cross-domain variability of EEG data. The proposed method is tested on open dataset 2a from BCI competition IV 2008 and achieves good spatio-temporal pattern recognition results. Furthermore, the neuron connectivity and activation level associated with the process of mental tasks are illustrated.
A Multi-Morphological Representation Approach for Analysis of Multi-Source Eeg Signals
2024, SSRN

View all citing articles on Scopus

Ying-Hao Cai was born in Zhejiang province, China. He received the B.E. degree from Jiangsu University, Zhenjiang, China, in 2020. He is currently pursuing the M.E. degree in control science and engineering with Hangzhou Dianzi University, Hangzhou, China. His research interests include transfer learning, brain-computer interface.

Sheng-Zhi Du received the M.S. degree in control theory and control engineering from Tianjin Poly Technology University, Tianjin, China, in 2001 and the Ph.D. degree in control theory and control engineering from Nankai University, Tianjin, China, in 2005. He is currently a professor in French South Africa Institute of Technology (F’SATI), Tshwane University of Technology, South Africa. His research interests include computer vision, AI, pattern recognition, and Human in the loop systems.

Yun Chen received the B.E. degree in 1999 from Central South University of Technology (Central South University), Changsha, China, and the M.E. degree in 2002 and Ph.D. degree in 2008, both from Zhejiang University, Hangzhou, China. He is currently a Professor with School of Automation, Hangzhou Dianzi University, Hangzhou, China. His research interests include stochastic systems, robust control and filtering, etc.

^☆: Fully documented templates are available in the elsarticle package on CTAN.

View full text

Multi-source manifold feature transfer learning with domain selection for brain-computer interfaces☆

Abstract

Introduction

Section snippets

Riemannian geometry

Proposed method

Experiment and evaluations

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgements

Comput. Methods Programs Biomed.

Neurocomputing

J. Neurosci. Methods

NeuroImage

J. Neurosci. Methods

Brain–machine interfaces from motor to mood

Nature Neurosci.

Brain-computer interfacing: an introduction

Exponential synchronization for delayed dynamical networks via intermittent control: dealing with actuator saturations

IEEE Trans. Neural Networks Learn. Syst.

Eegnet: a compact convolutional neural network for eeg-based brain–computer interfaces

J. Neural Eng.

A pso-based deep learning approach to classifying patients from emergency departments

Int. J. Mach. Learn. Cybern.

Transfer learning for eeg-based brain–computer interfaces: A review of progress made since 2016

IEEE Trans. Cognit. Develop. Syst.

Transfer learning: A riemannian geometry framework with applications to brain–computer interfaces

IEEE Trans. Biomed. Eng.

Multi-source based approach for visual domain adaptation

Experience selection using dynamics similarity for efficient multi-source transfer learning between robots

Combination of transferable classification with multisource domain adaptation based on evidential reasoning

IEEE Trans. Neural Networks Learn. Syst.

Domain adaptation via transfer component analysis

IEEE Trans. Neural Networks

Transfer feature learning with joint distribution adaptation

Mind the class weight bias: Weighted maximum mean discrepancy for unsupervised domain adaptation

Manifold embedded knowledge transfer for brain-computer interfaces

IEEE Trans. Neural Syst. Rehabil. Eng.