EEG signal classification using universum support vector machine

doi:10.1016/j.eswa.2018.03.053

Expert Systems with Applications

Volume 106, 15 September 2018, Pages 169-182

https://doi.org/10.1016/j.eswa.2018.03.053 Get rights and content

Highlights

•
Electroencephalogram signal classification is performed using universum learning.
•
Support vector machine classifier uses prior information from interictal signals.
•
Many feature extraction techniques are used for comparing the algorithms.
•
Universum support vector machine is used first time for seizure classification.

Abstract

Support vector machine (SVM) has been used widely for classification of electroencephalogram (EEG) signals for the diagnosis of neurological disorders such as epilepsy and sleep disorders. SVM shows good generalization performance for high dimensional data due to its convex optimization problem. The incorporation of prior knowledge about the data leads to a better optimized classifier. Different types of EEG signals provide information about the distribution of EEG data. To include prior information in the classification of EEG signals, we propose a novel machine learning approach based on universum support vector machine (USVM) for classification. In our approach, the universum data points are generated by selecting universum from the EEG dataset itself which are the interictal EEG signals. This removes the effect of outliers on the generation of universum data. Further, to reduce the computation time, we use our approach of universum selection with universum twin support vector machine (UTSVM) which has less computational cost in comparison to traditional SVM. For checking the validity of our proposed methods, we use various feature extraction techniques for different datasets consisting of healthy and seizure signals. Several numerical experiments are performed on the generated datasets and the results of our proposed approach are compared with other baseline methods. Our proposed USVM and proposed UTSVM show better generalization performance compared to SVM, USVM, Twin SVM (TWSVM) and UTSVM. The proposed UTSVM has achieved highest classification accuracy of 99% for the healthy and seizure EEG signals.

Introduction

Electroencephalogram (EEG) signal classification is a major challenge in the field of machine learning and signal processing. EEG is widely used non-invasive technique for the detection of various types of brain disorders such as epileptic seizures and sleep disorders. In epilepsy, the extent of disease ranges from partial to generalized seizures which are reflected in their respective EEG. The different types of EEG signals are shown in Fig. 2. For the better feature extraction and classification of EEG signals, several signal processing techniques have been used by researchers. Among the various feature extraction techniques, wavelet transform is one of the frequently used methods. In wavelet transform, the frequency domain features are extracted from the signal with good localization in time which is in contrast to the Fourier transform where the signal analysis is done mainly in the frequency domain. In wavelet analysis, the approximation and decomposition coefficients are used to form the feature vector as shown in Fig. 3. The different families of wavelet are used for specific type of signals to get better characteristics of that signal. Adeli, Zhou, and Dadmehr (2003) proposed a computer aided diagnosis (CAD) method for epilepsy using discrete wavelet transform (DWT). They used Daubechies wavelet with db-4 as the mother wavelet for the feature extraction. Rosso et al. (2005) used orthogonal decimated discrete wavelet transform (ODWT) for detecting maturational changes associated with childhood absence epilepsy. Ocak (2008) performed the classification of EEG signals using wavelet packet analysis and genetic algorithm. Daubechies wavelet-2 is used for the classification of five different EEG signals (Guler & Ubeyli, 2005). Subasi and Gursoy (2010) used principal component analysis (PCA), linear discriminant analysis (LDA) and independent component analysis (ICA) for the feature extraction, and support vector machine (SVM) for classification.

The proper selection of classification techniques is very crucial for the automated diagnosis of patients having neurological diseases. Among the various classification algorithms, support vector machines (SVMs) (Cortes & Vapnik, 1995) have emerged as a powerful classification technique. SVM solves a convex optimization problem which leads to a globally optimal solution. This is in contrast to artificial neural network (ANN) that suffers from the problem of local minima. SVM also has a lower VC (Vapnik-Chervonenkis) dimension that enables it to classify high dimensional data with less optimizing parameters. Many researchers have used SVM in the classification of EEG signals (Ma et al., 2016) and for the diagnosis of neurological diseases like epilepsy (Liu et al., 2012, Nicolaou and Georgiou, 2012, Zavar et al., 2011). Guo et al. (2011) performed the classification of mental tasks from the analysis of EEG signals using SVM. Least squares support vector machine (LSSVM) (Suykens & Vandewalle, 1999) is used in (Bajaj and Pachori, 2012, Joshi et al., 2014, Li and Wen, 2009, July, Sharma and Pachori, 2015) for the detection of epilepsy. LSSVM is used for classification of EEG signal with a clustering based approach (Li & Wen, 2011). For multiclass classification of EEG signals, Guler and Ubeyli (2007) proposed a support vector machine based model and showed that SVM gives better classification accuracy for EEG signals as compared to probabilistic neural network (PNN) and multilayer perceptron neural network (MLPNN).

Weston, Collobert, Sinz, Bottou, and Vapnik (2006) proposed a universum support vector machine (USVM) to give prior information to the classifier about the distribution of data. The universum data points do not belong to any of the classes and lie within an ε − insensitive tube between the two classes. This approach is also called as ‘learning through contradiction’ . In USVM, along with the hinge loss it involves an ε − insensitive loss function. This universum based approach has been applied to various real world applications. Long, Tang, and Tian (2016) performed the classification of investor sentiments using universum support vector machine. Gao, Tian, Shao, and Deng (2008) used universum SVM for prediction of translation initiation in proteins. They used two approaches for selecting the universum: one is based on uniform distribution of noise and other using random averaging of the data points. Hao and Zhang (2013) proposed an ensemble universum support vector machine for the detection of Alzheimer's disease from brain imaging data by using the patients with mild cognitive impairment (MCI) as the universum. Text classification is also performed using universum data (Liu, Hsaio, Lee, Chang, & Kuo, 2016).

The major challenge with universum based approach is the proper selection of universum data points. In Weston et al. (2006), the universum data is selected based on similarity of digits in digit classification. For example, digit ‘3’ is chosen as universum for classifying ‘5’ and ‘8’ since its shape is similar to both ‘5’ and ‘8’. Chapelle, Agarwal, Sinz, and Schölkopf (2008) presented an analysis for the selection of proper universum data. In (Bai & Cherkassky, 2008), universum samples are generated for classification of faces using the random averaging approach where the average of the pixels of two faces is used as the universum. In (Chen & Zhang, 2009), an in-between-universum (IBU) approach is proposed for the proper selection of universum. The practical conditions for choosing the universum data are given in (Cherkassky and Dai, 2009, Cherkassky et al., 2011). In the recent decade some nonparallel SVMs such as generalized eigenvalue proximal support vector machine (GEPSVM) (Mangasarian & Wild, 2006) and twin support vector machine (TWSVM) (Jayadeva, Khemchandani, & Chandra, 2007) are proposed to reduce the computational complexity of standard SVM. Inspired by the work of TWSVM, some scholars proposed variants of TWSVM (Khemchandani et al., 2016, Kumar and Gopal, 2009, Qi et al., 2013, Shao et al., 2011, Tanveer, 2015a, Tanveer, 2015b, Tanveer et al., 2016, Wang et al., 2015, Xu et al., 2017) to improve the performance and reduce the computational complexity of TWSVM. TWSVM is used for the first time in this work for the classification of seizure EEG signals. Qi, Tian, and Shi (2012) proposed a universum twin support vector machine (UTSVM) to reduce the computational complexity of USVM and used the random averaging approach for universum selection. Xu, Chen, and Li (2016) also used the random averaging scheme for selecting the universum data. Since the random averaging approach suffers from the effect of outliers, the method of generation of universum data depends solely on the type of application and is currently an area of research.

Motivated by the work on universum support vector machine in (Gao et al., 2008, Hao and Zhang, 2013, Long et al., 2016), we propose a novel approach of selecting the universum in the classification of EEG signals for seizure detection. Since universum based support vector machines have not been used for the classification of EEG signals, we also present an application of USVM and UTSVM for EEG signals. For the classification of EEG signals in the healthy and seizure (ictal) classes, the interictal EEG signals are chosen as the universum which corresponds to the EEG recording for the time period in between the seizures in a patient with epilepsy. Our approach of EEG classification is tested for different datasets that are generated using various feature extraction techniques, and the results are compared with other existing methods.

In this work, all vectors are taken as column vectors. The inner product of two vectors is represented by: a^tb where a and b are the vectors of n − dimensional real space Rⁿ, and a^t is the transpose of a. ||a|| and ||G|| represent the 2-norm of a vector a and a matrix G respectively. e denotes the vector of ones of dimension m. I represents the identity matrix of appropriate size.

The rest of this paper is organized as follows: Section 2 discusses the formulations of USVM and UTSVM. Section 3 elaborates our proposed approach of USVM and UTSVM. Several numerical experiments are performed on the datasets generated from EEG signals using different feature extraction techniques for the discussed and proposed approach in Section 4. Finally, Section 5 gives the conclusions and possible future directions.

Section snippets

Related work

In this section, we briefly review USVM and UTSVM. For detailed description, the interested readers are referred to (Qi et al., 2012, Weston et al., 2006).

Proposed approach

In many of the classification approaches for EEG signals, the prior information about the distribution of EEG data is not used. Due to this, the classification techniques are not able to give better generalization performance even if the most efficient feature extraction technique is used. The universum based approach actually gives some prior information in the construction of the classifier. So we used a universum based approach with support vector machine to classify the EEG signals.

Numerical experiments

In this section, numerical experiments are performed for the classification of EEG signals of healthy state and seizure. The EEG dataset is taken from (Andrzejak et al., 2001) which is available online. The dataset consists of five sets viz. Z, O, N, F and S. Each set contains 100 single-channel EEG signals sampled at a sampling rate of 173.61 Hz and of 23.6 s duration. The sets Z and O are surface EEG recordings of five healthy volunteers with eyes open and closed respectively. The sets N and

Conclusions

On the basis of the experimental results, it can be stated that our universum based approach gives better generalization performance for the classification of EEG signals as compared to the existing approaches. Our method of selection of universum points has proved to be a promising approach for the classification of healthy and seizure EEG signals. Also, the effect of outliers on the universum is reduced by using the universum from the EEG dataset itself i.e., the seizure free EEG signal. The

Acknowledgements

This work was supported by Science and Engineering Research Board (SERB) as Early Career Research Award grant no. ECR/2017/000053 and Department of Science and Technology as Ramanujan fellowship grant no. SB/S2/RJN-001/2016. We gratefully acknowledge the Indian Institute of Technology Indore for providing facilities and support. We are thankful to the Ministry of Human Resource Development (MHRD), Govt. of India for providing Teaching Assistantship (TA) fellowship to Mr. Bharat Richhariya.

References (47)

H. Adeli et al.
Analysis of EEG records in an epileptic patient using wavelet transform
Journal of Neuroscience Methods
(2003)
I. Guler et al.
Adaptive neuro-fuzzy inference system for classification of EEG signals using wavelet coefficients
Journal of Neuroscience Methods
(2005)
V. Joshi et al.
Classification of ictal and seizure-free EEG signals using fractional linear prediction
Biomedical Signal Processing and Control
(2014)
R. Khemchandani et al.
TWSVR: Regression via twin support vector machine
Neural Networks
(2016)
N. Nicolaou et al.
Detection of epileptic electroencephalogram based on permutation entropy and support vector machines
Expert Systems with Applications
(2012)
H. Ocak
Optimal classification of epileptic seizures in EEG using wavelet analysis and genetic algorithm
Signal Processing
(2008)
Z. Qi et al.
Robust twin support vector machine for pattern classification
Pattern Recognition
(2013)
Z. Qi et al.
Twin support vector machine with universum data
Neural Networks
(2012)
O.A. Rosso et al.
Quantitative EEG analysis of the maturational changes associated with childhood absence epilepsy
Physica A: Statistical Mechanics and its Applications
(2005)
R. Sharma et al.
Classification of epileptic seizures in EEG signals based on phase space representation of intrinsic mode functions
Expert Systems with Applications
(2015)

A. Subasi et al.

EEG signal classification using PCA, ICA, LDA and support vector machines

Expert Systems with Applications

(2010)

M. Zavar et al.

Evolutionary model selection in a wavelet-based support vector machine for automated seizure detection

Expert Systems with Applications

(2011)

R.G. Andrzejak et al.

Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state

Physical Review E

(2001)

X. Bai et al.

Gender classification of human faces using inference through contradictions

V. Bajaj et al.

Classification of seizure and nonseizure EEG signals using empirical mode decomposition

IEEE Transactions on Information Technology in Biomedicine

(2012)

M.S. Bartlett et al.

Face recognition by independent component analysis

IEEE Transactions on Neural Networks

(2002)

O. Chapelle et al.

An analysis of inference with the universum

S. Chen et al.

Selecting informative Universum sample for semi-supervised learning

V. Cherkassky et al.

Empirical study of the Universum SVM learning for high-dimensional data

V. Cherkassky et al.

Practical conditions for effectiveness of the universum learning

IEEE Transactions on Neural Networks

(2011)

C. Cortes et al.

Support-vector networks

Machine Learning

(1995)

J. Demšar

Statistical comparisons of classifiers over multiple data sets

Journal of Machine Learning Research

(2006)

T. Gao et al.

Accurate prediction of translation initiation sites by Universum SVM

Cited by (205)

Sparse least-squares Universum twin bounded support vector machine with adaptive L<inf>p</inf>-norms and feature selection
2024, Expert Systems with Applications
In data analysis, when attempting to solve classification problems, we may encounter a large number of features. However, not all features are relevant for the current classification, and including irrelevant features can occasionally degrade learning performance. As a result, selecting the most relevant features is critical, especially for high-dimensional data sets in classification problems. Feature selection is an effective method for resolving this issue. It attempts to represent the original data by extracting relevant features containing useful information. In this research, our aim is to propose a $p$ -norm least-squares Universum twin bounded support vector machine (LS $_{p}$ - $U$ TBSVM) to perform classification and feature selection at the same time. Indeed, the proposed method, which outperforms the traditional least-squares Universum twin bounded support vector machine, can achieve good classification accuracy in a reasonable amount of time while also providing a sparse solution. The model we propose is an adaptive learning procedure with $p$ -norm $(0 < p < 1)$ , where the parameter $p$ can be automatically selected by the data set. The algorithm we use to find the approximate solution of this model involves solving systems of linear equations. Furthermore, we obtain new bounds for the absolute values of non-zero components of a local optimal solution. These bounds allow us to remove the zero components from an arbitrary numerical solution. Setting the parameter $p$ , LS $_{p}$ - $U$ TBSVM improves classification accuracy and selects the relevant features. Numerical experiments on a handwritten digit recognition, University of California Irvine (UCI) benchmark, Normally Distributed Clusters (NDC) and high dimensional data sets confirm the superiority of the proposed method in the accuracy of classification and the selection of relevant features in comparison with some popular methods.
Multi-task twin support vector machine with Universum data
2024, Engineering Applications of Artificial Intelligence
Multi-task learning (MTL) has emerged as a promising topic of machine learning in recent years, aiming to enhance the performance of numerous related learning tasks by exploiting beneficial information. Traditionally, during the training phase, existing multi-task learning models focused solely on the data related to the target task. In our approach, Universum data, which does not belong to any class in the classification problem but belongs to the same domain as the target data, is incorporated into classifier training as prior knowledge. This study looks at the challenge of multi-task learning using Universum data to employ non-target task data, which leads to better performance. It proposes a multi-task twin support vector machine with Universum data ( $U$ MTSVM) and provides two approaches to its solution. The first approach takes into account the dual formulation of $U$ MTSVM and tries to solve a quadratic programming problem. The second approach formulates a least-squares version of $U$ MTSVM and refers to it as LS- $U$ MTSVM to further increase the generalization performance. The solution of the two primal problems in LS- $U$ MTSVM is simplified to solving just two systems of linear equations, resulting in an incredibly simple and quick approach. Numerical experiments on several popular multi-task data sets and medical data sets demonstrate the efficiency of the proposed methods.
Diagnosis of breast cancer using flexible pinball loss support vector machine
2024, Applied Soft Computing
Breast cancer is a common disease that affects feminine health, making it an active area of research. Also, support vector machine with pinball loss (pin-SVM) is an efficient classification algorithm to address noise sensitivity and re-sampling instability. The pinball loss function uses a loss parameter $τ \in [0, 1]$ which corresponds to the quantile level. However, the non-negativity condition on $τ$ is not necessary, and it can be extended to the negative values for an improvement in classification accuracy. Also, instead of a positive loss parameter $τ$ , two positive parameters, $τ_{1}$ and $τ_{2}$ are used in literature, which improve the generalization performance of the pin-SVM. Taking motivation from the aforementioned observations, in this paper, we propose an innovative loss function, termed the flexible pinball loss, which extends the parameters $τ_{1}$ and $τ_{2}$ to encompass negative values. This extension enables the function to take $τ_{1}$ and $τ_{2}$ values from $- 1$ to 1 while preserving convexity. Subsequently, we integrate the proposed flexible pinball loss function into the support vector machine framework and propose a novel model named flexible pinball loss support vector machine (FP-SVM) for the prediction of breast cancer. FP-SVM provides loss to both incorrectly and correctly classified samples, leveraging the parameters $τ_{1}$ and $τ_{2}$ , respectively. Importantly, FP-SVM strategically traverses the maximum solution path, ensuring the preservation of convexity within the optimization problem. The proposed FP-SVM outperforms the baseline models in terms of accuracy, which is empirically supported by numerical experiments on 30 UCI and KEEL benchmark datasets. Furthermore, to show the efficacy of the proposed FP-SVM in real-world application, we performed experiments on publicly available breast cancer dataset (BreakHis), and the results demonstrate that the proposed FP-SVM outperforms the baseline models.
Primal dual algorithm for solving the nonsmooth Twin SVM
2024, Engineering Applications of Artificial Intelligence
In this paper, we propose an improved version of Twin SVM using a non-smooth optimization method. Twin SVM generally consists in determining two non-parallel planes by alternately solving two constrained optimization models. Solving this problem using the classical Lagrangian method has many limitations, notably: its only limited to handle Gaussian noise, generally exaggerates the influence of outliers and cannot handle unbalanced data, this due to the differentiability of the model. To circumvent these issues, we transform two-constraint optimization models using the penalty method into an unconstrained non-smooth optimization one. The non-smoothness nature of the problem has many advantages, but it requires special treatment, which is why we use the primal dual method to solve it, since it is the most appropriate and it is robust in terms of stability, convergence and speed (Lyaqini, Nachaoui and Hadri, 2022). To demonstrate the effectiveness of the proposed approach, several experiments were carried out on numerous UCI benchmarks, medical image and HandPD datasets. These experiments demonstrated the effectiveness and applicability of the proposed approach, with satisfactory results compared to the state of the art.
BCI-AMSH: A MATLAB based open-source brain computer interface assistive application for mental stress healing
2023, e-Prime - Advances in Electrical Engineering, Electronics and Energy
Numerous Brain computer interface (BCI) assistive software applications or toolboxes help to monitor the status of brain through Electroencephalogram (EEG); however, currently there is no application available to alleviate mental stress along with EEG analysis. In this paper, a new MATLAB-based assistive application called BCI-AMSH (Brain-Computer Interface-Assistive Mental Stress Healing) is developed. This application includes EEG signal offline analysis and stress healing techniques, such as guided meditation and singing bowl sound therapy, combined with real-time EEG analysis using the Enobio-8 device. The EEG offline analysis and stress healing modules are designed to be platform-independent.
EEG offline analysis module offers signal preprocessing methods, diverse EEG signal analysis techniques, feature extraction methods, such as time and frequency domain analysis, and a range of machine learning and deep learning models for classification, various performance evaluation metrics. Additionally, it generates comprehensive EEG report about the user's mental states such as focus, relax and stress both in real-time and offline analysis. Furthermore, the stress healing module provides a variety of guided breathing practices and singing bowl sound therapy practices to effectively reduce stress levels. This new application is designed to be very simple and user-friendly, making it accessible to users without prior experience. Its potential applications are significant, particularly in the clinical industry for mental health purposes, as well as for novice researchers seeking to explore new possibilities in BCI.
Universum twin support vector machine with truncated pinball loss
2023, Engineering Applications of Artificial Intelligence
For classification problems, twin support vector machine with pinball loss (Pin-GTSVM) is noise insensitive and has better performance than twin support vector machine (TWSVM). However, it lacks sparsity in comparison to TWSVM. In this article, to maintain a trade-off between the noise insensitivity and sparsity of the model along with preserving the theoretical properties of pinball loss, we propose universum twin support vector machine with truncated pinball loss (Tpin-UTWSVM). The proposed Tpin-UTWSVM considers universum data which gives prior information about the distribution of the data, thus improves the generalization performance of the proposed model. Further, the proposed optimization problem is non-convex and non-differentiable which is solved by concave–convex procedure. We employed the SOR approach to train the proposed model effectively with minimum training time. We conducted numerical experiments on 19 UCI binary datasets with different noise levels to validate the noise insensitivity of the proposed Tpin-UTWSVM model. We also conducted numerical experiments for electroencephalogram (EEG) signal classification and Alzheimer’s disease (AD) detection. The overall experimental outcomes and statistical tests demonstrate the superiority of the proposed Tpin-UTWSVM model in comparison to the baseline models. The source code for the proposed Tpin-UTWSVM is available at https://github.com/mtanveer1/Universum-twin-SVM-with-truncated-pinball-loss.

View all citing articles on Scopus

View full text

EEG signal classification using universum support vector machine

Highlights

Abstract

Introduction

Section snippets

Related work

Proposed approach

Numerical experiments

Conclusions

Acknowledgements

Journal of Neuroscience Methods

Journal of Neuroscience Methods

Biomedical Signal Processing and Control

Neural Networks

Expert Systems with Applications

Signal Processing

Pattern Recognition

Neural Networks

Physica A: Statistical Mechanics and its Applications

Expert Systems with Applications

Expert Systems with Applications

Expert Systems with Applications

Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state

Physical Review E

Gender classification of human faces using inference through contradictions

Classification of seizure and nonseizure EEG signals using empirical mode decomposition

IEEE Transactions on Information Technology in Biomedicine

Face recognition by independent component analysis

IEEE Transactions on Neural Networks

An analysis of inference with the universum

Selecting informative Universum sample for semi-supervised learning

Empirical study of the Universum SVM learning for high-dimensional data

Practical conditions for effectiveness of the universum learning

IEEE Transactions on Neural Networks

Support-vector networks

Machine Learning

Statistical comparisons of classifiers over multiple data sets

Journal of Machine Learning Research

Accurate prediction of translation initiation sites by Universum SVM