Classification of human hand movements based on EMG signals using nonlinear dimensionality reduction and data fusion techniques
Introduction
Machine learning is integrated nowadays in many data driven application, such as signal and image processing tasks, due to its generic and flexible methodologies. The rapid development of advanced and sophisticated signal acquisition tools, which results with large amounts of complex data, gave rise to new branches in machine learning that deal with compact modeling of these data sets. Coupling dimensional reduction methods with time series processing is a natural match and recent work in diverse fields (Dov, Talmon, & Cohen, 2016; Dsilva, Talmon, Coifman, & Kevrekidis, 2018; Duncan, Talmon, Zaveri, & Coifman, 2013; Rabin, Bregman, Lindenbaum, Ben-Horin, & Averbuch, 2016; Talmon, Cohen, Gannot, & Coifman, 2013) justify this claim. Algorithms that utilize machine learning for signal processing typically include a number of consecutive steps. First, features are extracted from the time series signal. Next, when the dimension of the feature vector is large, a dimensionality reduction method is used to construct a compact representation of the dataset. Last, a supervised or an unsupervised method is applied to predict the class of a new recorded signal.
The use of electromyography (EMG) signal allows convenience and non-invasive access signal acquisition resulting in new developed methods for classification and feature extraction. Identification of hand movements based on EMG measurements have been largely used in the field of computer and automatic video games, robotic exoskeleton, operative devices and for power prostheses (Batzianoulis et al., 2017; Gailey, Artemiadis, & Santello, 2017; Na, Kim, Jo, & Kim, 2017) and thus, has been the subject of many studies over the past few years.
A large number of these studies focus in features selection for EMG movement classifications and included in their methods is a dimensionality reduction step followed by a machine learning based classifier. The dimensionality reduction step is typically performed by a linear dimensionality reduction technique such as Principal Component Analysis (PCA), which is limited in representing complex data that has in it nonlinear relationships. Nonlinear dimensionality techniques enable to preserve the local structures in the original feature space and by that overcome some of PCA's limitations, due to the method's global and linear nature. However, while the extension to new data points in the linear dimensionality reduction methods is straightforward, methods based on the Nyström technique (Coifman and Lafon, 2006b) are used for the extension of nonlinear models.
Selection of significant discriminative features for EMG hand movement classification is discussed in several recent papers. Phinyomark et al. (2013) computed 50 features from EMG data that was measured in a period of 21 days from a single male subject, to identify a feature set that is stable over a long period of time. Using a linear discriminate analysis classification algorithm it was found that the sample entropy was the most stable feature. In another study (Tsai, Luh, & Lin, 2015) it was found that the Short time Fourier transform (STFT) ranking feature gives higher classification rates (93.9%) compared with conventional EMG features as mean absolute value, zero crossing, slope sign change, waveform length, auto regression, median and mean spectrum frequency (33–90.8%) for motion pattern recognition from multi-channel EMG signals of 6 muscles. The linear PCA method was applied in this work for dimensionality reduction and the Support Vector Machines (SVM) for the classification. Other studies investigate the best features selection for pattern recognition classification of myoelectric limb prostheses (Al-Angari, Kanitz, Tarantino, & Cipriani, 2016; Khushaba, Al-Timemy, Kodagoda, & Nazarpour, 2016). Khushaba et al. (2016) studied the effect of both forearm orientation and muscle contraction on the classification accuracy using wavelet transform based features and time and frequency domain features. Al-Angari et al. (2016) investigate the best pairs of features and channels for the classification of 5 hand postures at 9 different arm positions. Using 10 features from the time and frequency domain two methods for feature-channels pairs selection were compared, distance based and correlation-based feature selection. In both studies the results were evaluated for each subject separately, based only on his/her train data. The performance of a convolutional neural network classifier was compared with two other classifiers (linear discriminant analysis and stacked sparse autoencoders) for the classification of eight hand motions that were repeatedly performed by each of seven subjects. Features from the time domain were computed and fed as inputs to the classifiers and the results, in line with the previous all above works were evaluated for each subject separately.
Several studies have also been conducted over the years to investigate the ability to identify hand movements using different time and frequency domain EMG features. These EMG features were used in a study aimed to identify six daily hand movements (Sapsanis, Georgoulas, & Tzes, 2013). Two dimensionality reduction techniques, PCA (Pearson, 1901) and RELIEF (Kira et al., 1992) were used in this study to reduce the dimension of the feature space. Then, the classification was carried out on each subject separately, where 50%–70% of the data were used as train points. The reduced input was then inserted into a linear classifier and resulted with an average of 85% success classification rate without significant difference between the two methods. The dataset used by Sapsanis et al. (2013), which is publicly available in the UCI repository (UCI datasets), was further tested in several recent studies (Gu, Zhang, Zhao, & Luo, 2017; Ramírez-Martínez, Alfaro-Ponce, Pogrebnyak, Aldape-Pérez, & Argüelles-Cruz, 2019; Ruangpaisarn & Jaiyen, 2015). Gu et al. (2017) used the above dataset to compare between several machines learning algorithms for classification after calculating featured with Empirical Mode Decomposition (EDM). The compared classification methods were Neural Network, Adaptive Boosting, Linear Discriminant Analysis, Random Forest and Random Forest with PCA. Using 80% of the data from all the subjects as the training set, the Neural Network classifier resulted in accuracy of 85% while the Adaptive Boosting and the Linear Discriminant Analysis achieved only a 55% and 65% correct classification rate, respectively. An increase in the accuracy rates to 91% and 94% was achieved with the Random Forest and Random Forest with PCA, correspondingly.
In Ruangpaisarn and Jaiyen (2015), where Singular Value Decomposition (SVD) was applied for feature extraction, Naive Bayes, Radial Basis Function Network, k-nearest neighbors (KNN) and SVM trained by Sequential minimal optimization (SMO) (Platt, 1999), were compared for classification. The classification step was performed on each subject separately achieving accuracies of 91.66%, 94%, 94.77% and 98.22%, correspondingly. More recently, Ramírez-Martínez et al. (2019) examined the use of Burg reflection coefficients and tested many combinations of different feature-based datasets with machine learning algorithms for classification of the UCI hands movement dataset. A ten-fold cross-validation procedure was applied to the full dataset that held all of the subject's data. Results for a specific setting of an instance based classifier reached the accuracy of 100%, using 60 features. The results degraded a bit when the number of features was reduced.
Several other public and not-public dataset were tested in several papers, with the task being gesture or hand movement classification. In (Tang, Liu, Lv, & Sun, 2012), the authors collected EMG signals from multiple channels in order to discriminate between eleven hand gestures. In this study the dimensionality reduction step was bypassed by extracting only a small number of informative EMG features such as energy ratios and concordance correlation coefficient. A cascaded-classifier that was based on 25 train movements of each type was applied on each subject separately to the feature set and resulted in an accuracy rate of 89%. A real-time EMG recognition algorithm for the control of multifunction myoelectric hand was developed based on wavelets for feature extraction (Chu, Moon, & Mun, 2006). PCA followed by Self Organization Maps and a multi-layer neural network was used for dimensionality reduction while using data from the same subject for the train and for the test. Discrete Wavelet feature along with Artificial Neural Network classifier was applied in Mane, Kambli, Kazi, & Singh, 2015, and was optimized to each of four subjects separately that performed three basic hand movements, with an overall correct classification rate of 93%. A PCA based algorithm was also used to drive an under actuated prosthetic hand prototype having a two dimensional control input (Matrone, Cipriani, Secco, Magenes, & Carrozza, 2010) and for classification of 52 hand movements using an individual classifier that was constructed for each subject (Isaković, Miljković, & Popović, 2014).
In all of the above previous studies the computed classification algorithms, as far as we know, were tested on each subject separately, and were usually based on a relatively large number of training samples. The presented classification results in these works represent the average correct classification rates from all of the subjects. In other words, data fusion for merging between several subjects, is not performed. In order to achieve high classification rates for a single subject classification algorithm, one is required to record a large amount of training examples. In real life application, this request is not practical. Constructing a fused multi-subject representation is challenging. The variance between the signals recorded from two subjects that perform the same set of hand movements is large due to the different physical characteristics of the subjects. Thus, a simple step of merging between training data of different subjects is bound to fail. Recently, deep-learning methods were applied for creating multi-subject models.
In Côté-Allard et al. (2019), a deep-learning algorithm, which uses transfer learning, was applied for automatic hand movement classification. Time and frequency information was computed in the features extraction step. Linear Discriminant Analysis was tested for dimensionality reduction. Two datasets, which comprise of 19 and 17 participants were used as input for the deep learning method. Transfer learning implemented the data fusion between different subjects.
Other deep learning based methods were proposed for multi-subject classification (Phinyomark & Scheme, 2018; Tsinganos, Cornelis, Cornelis, Jansen, & Skodras, 2018). These methods typically compute STFT features (similarly to what is proposed in this paper), and treat them as input images for Convolutional Neural Networks.
Although deep learning based models, and in particular those which include transfer learning, are suitable for fusing samples recorded from different subjects, large amounts of data is required for the model construction. This study proposes a different approach, which is also suited for training data of limited size, and is easy to implement (when compared to deep learning), for constructing unified, multi-subject models. Implementation of the proposed model is done by solving a least square optimization problem, which is a common computational task in many numerical analysis and machine learning algorithms.
The goal of this paper was, therefore, twofold; to compare between the performances of nonlinear dimensionality techniques to standard linear dimensionality methods for single subject EMG hand movement classification. In particular, to examine the classification rates of a PCA and a diffusion maps (DM) based algorithm as the size of the training set decreases. Second, to propose an algorithm, which utilized data alignment, for classification of one subject's hand movements based on other subjects’ hand movements based on limited size of training data. This fusion procedure aimed to bypass the variance between subjects, has rarely been studied or addressed in most of the papers in this field.
Section snippets
Mathematical background
This section provides the essential mathematical background for the dimensionality reduction and data alignment techniques, which were utilized in this work.
STFT
All EMG signals that were acquired during 6 hand motions, were padded with zeros according to the signal with the maximal length in the dataset. STFT algorithm was then applied to all EMG signals using a window size of 256 with an overlap of 50% to yield the changes in the frequency distribution of the signals over time. An example of the STFT of 6 EMG signals, each represents one of the 6 hand motions is described in Fig. 4. The colors in the Fig represent the intensity (amplitude) of the
Discussion and conclusion
The research aimed to compare between linear and non-linear dimensionality reduction techniques and to introduce an alignment algorithm for fusion multi-subject data set. These methods were implemented for classification of human hand movements based on EMG signals. A data set of EMG signals recorded from five healthy subjects was taken from an open public source. EMG features were extracted by application of the STFT. PCA and DM methods were used for dimension reduction and their performances
CRediT authorship contribution statement
Neta Rabin: Conceptualization, Methodology, Writing - original draft. Maayan Kahlon: Data curation. Sarit Malayev: Data curation. Anat Ratnovsky: Conceptualization, Methodology, Writing - original draft.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References (32)
- et al.
Distance and mutual information methods for EMG feature and channel subset selection for classification of hand movements
Biomedical Signal Processing and Control
(2016) - et al.
EMG-based decoding of grasp gestures in reaching-to-grasping motions
Robotics and Autonomous Systems
(2017) - et al.
Diffusion maps
Applied and Computational Harmonic Analysis
(2006) - et al.
Geometric harmonics: A novel tool for multiscale out-of-sample extension of empirical functions
Applied and Computational Harmonic Analysis
(2006) - et al.
Parsimonious representation of nonlinear dynamical systems through manifold learning: a chemotaxis case study
Applied and Computational Harmonic Analysis
(2018) - et al.
Combined influence of forearm orientation and muscular contraction on EMG pattern recognition
Expert Systems with Applications
(2016) - et al.
Hand motion recognition from single channel surface EMG using wavelet & artificial neural network
Procedia Computer Science
(2015) - et al.
EMG feature evaluation for improving myoelectric pattern recognition robustness
Expert Systems with Applications
(2013) - et al.
A novel STFT-ranking feature of multi-channel EMG for motion pattern recognition
Expert Systems with Applications
(2015) - et al.
A real-time EMG pattern recognition system based on linear-nonlinear feature projection for a multifunction myoelectric hand
IEEE Transactions on biomedical engineering
(2006)
Deep learning for electromyographic hand gesture signal classification using transfer learning
IEEE Transactions on Neural Systems and Rehabilitation Engineering
Kernel-based sensor fusion with application to audio-visual voice activity detection
IEEE Transactions on Signal Processing
Identifying preseizure state in intracranial EEG data using diffusion kernels
Mathematical Biosciences and Engineering
Proof of concept of an online EMG-based decoding of hand postures and individual digit forces for prosthetic hand control
Frontiers in Neurology
Numerical linear algebra for applications in statistics
Cited by (67)
Using machine learning algorithms for grasp strength recognition in rehabilitation planning
2024, Results in EngineeringApplication of context-dependent interpretation of biosignals recognition to control a bionic multifunctional hand prosthesis
2024, Biocybernetics and Biomedical EngineeringDifference analysis of musculation and estimation of sEMG-to-force in process of increasing force and decreasing force
2023, Expert Systems with ApplicationsClassification of tracheal stenosis with asymmetric misclassification errors from EMG signals using an adaptive cost-sensitive learning method
2023, Biomedical Signal Processing and ControlA remote and personalised novel approach for monitoring asthma severity levels from EEG signals utilizing classification algorithms
2023, Expert Systems with ApplicationssEMG time–frequency features for hand movements classification
2022, Expert Systems with ApplicationsCitation Excerpt :Dimension reduction has been notably generalized since the introduction of non-linear techniques, which do not require the low-dimensional space to be Euclidean. A study (Rabin et al., 2020) compared PCA and Diffusion Maps (DM), when applied on STFT features. This study concluded that DM outperforms PCA when less training data is available.