Unsupervised automatic online spike sorting using reward-based online clustering

doi:10.1016/j.bspc.2019.101701

Biomedical Signal Processing and Control

Volume 56, February 2020, 101701

https://doi.org/10.1016/j.bspc.2019.101701 Get rights and content

Highlights

•
The novel method is Reward Based Online Clustering.
•
A modified method for feature extraction called Aligned First Derivative with just two features is proposed.
•
The graphical results for experimental data obtained from rat brain approve the efficiency of the proposed method.
•
In comparison to the other similar works the proposed method has more accurate clustering results.

Abstract

Brain-machine interfaces (BMIs) can enable paralyzed people to regain mobility. In these interfaces, some different type of signals can be obtained from the brain, one of which is the action potential waveform (spike). In the case of using spikes, sorting the recorded signals and isolating the effects of the individual neurons can lead to a greater efficiency. Also, because of the nature of BMIs, real-time spike sorting is necessary. In many spike sorting approaches, the main outline consists of the following steps: spike detection, feature extraction, and clustering. In this study, a novel method for clustering is presented. This method is referred to as Reward-Based Online Clustering (RBOC) which is formed based on the reinforcement learning algorithm. The significant property of this proposed technique is its capability for real-time implementation that is required by BMIs. This method can automatically detect the clusters while there is no knowledge about the number of clusters. The performance of the proposed method is demonstrated through both simulation and experimental study. Evaluation with artificially simulated (ground truth) data shows that, on average, the accuracy of categorizing the spikes from the same origins is above 94 percent. Moreover, implementation of the method on the experimental data obtained from the rat brain represents convincing sorting results. It is noteworthy to say that, in most cases, this new method outperforms the results of similar previous works.

Introduction

One of the basic approaches for understanding brain mechanisms is monitoring the electrical activity of the neurons [1] which helps us to decode the brain information which is vital for BMIs. These neural electrical signals are usually recorded with one or more electrodes, but each electrode records a superposition of activities of several adjacent and farther neurons. The adjacent neurons to the electrode shape a spike train and the farther neurons contribute to the background noise [1]. Hence, spike sorting is required to distinguish each part of the recorded signals and obtain information from the brain [2]. In addition, it has been shown that even a basic spike sorting is beneficial for BMI signal decoding [3].

There are wide variety of spike sorting approaches on neuronal pattern recognition [4]. Most spike sorting algorithms comprise three major steps: spike detection, feature extraction, and clustering [5].

In spike detection, the common method is applying a simple threshold to extract the actual spike waveforms from the raw data [4]. Besides, more complicated techniques, which are based on the wavelet method, such as wavelet transform (WT) [6] or wavelet packets decompositions (WPD) [7] and the energy signal [8,9] for reducing harmful effects on the spike shape and spike detecting, are introduced. In [10] a sparse coding and a compressive sensing method are introduced to solve the problem of overlapping spikes. Moreover, in order to simplify the spike detection step, some studies use mathematical transformation to obtain a new signal from the raw signals that identified spikes in a simple manner [11,12].

Feature extraction is mapping from the original data space to another space, and features are properties of a cluster that specify it from the other clusters. Besides, feature extraction can map the data in a space with lower dimension and as a result, reduce the computational complexity. The other desirable property of feature extraction is that it makes a cluster more distinct, so, it facilitates the clustering process [13]. The most common method for feature extraction and order reduction, which is used in spike sorting literature, is Principal Component Analysis (PCA) [1] which is mostly used in offline clustering methods. Moreover, Wavelet coefficients are proper features to discriminate clusters, but they are complex [6,7]. The other features are the first and second extrema (FSDE) method that are simple shape-based features [13]. There are other features like Diffusion Map features that use a graph of the data for feature extraction [14], and Locality Preserving Projection (LPP) that constructs a graph incorporating neighborhood information of the dataset [15], for offline applications. In contrast, the FSDE features that can be applied for online applications. Recently, a feature extraction approach is introduced that uses shape, phase, and distribution together [16]. It seems that the shape-based feature extractions are more appropriate for online applications rather than other mapping-based methods.

The main part of most spike sorting methods is the clustering technique. Hence, different clustering approaches are presented in the literatures, such as k-means [7,13,14,17], expectation maximization [18,19], superparamagnetic clustering method [6], landmark-based spectral clustering [15], hierarchical adaptive means (HAM) clustering [20], neural networks [8], support vector machines (SVMs) [21]. However, none of the mentioned approaches successfully provided generally optimal results in sorting the spikes. In addition, because of the nature of BMIs, real-time spike sorting is necessary and none of above-mentioned clustering methods satisfy this requirement. Hence, in this study, a new method for online clustering is proposed. This method is Reward-Based Online Clustering (RBOC) which is formed based on the reinforcement learning algorithm. The significant property of this proposed technique is the capability of real-time implementation which is required by BMIs. Besides, this method can automatically detect the clusters while there is no knowledge about the number of clusters.

Reinforcement learning algorithms such as Q-learning have been built based on interaction with the environment and receiving a reward or punishment [22]. Reward or punishment signal can be a physical or real signal, such as a banana given to a monkey for accomplishing a task [23]. It also can be a purely mathematical definition for example distance with the target for an agent. Such algorithms are acting real-time, i.e., when the algorithm starts, information or experience completes by receiving the reward or punishment signal. Because the experience can take place without a supervisor, the algorithm can be done unsupervised. In this case, only the reward or punishment signal definition is required for the algorithm. Besides, this algorithm can be used as an unsupervised automatic real-time clustering method.

Each experience accompanies with a cost and takes plenty of time. Therefore, the concept of planning helps reinforcement learning algorithm to be faster and cheaper. In the planning algorithm, along with the experience, a model of the system is built. The simulated model helps to gain virtual experiences quickly without real interaction with the environment. Then the algorithm would be faster [24].

In this study, the procedure consists of three-parts, detection using simple threshold, feature extraction with Aligned First Derivative (AFD) method, and the new main part which is online unsupervised clustering method with Reward-Based Online Clustering (RBOC) approach.

The evaluation is performed with a set of simulated datasets. Each dataset simulates one trial of recording the brain signal using a single electrode. These sets are classified into four categories based on the degree of complexity regarding the similarity of the spikes in the spike train. Each smaller dataset has four different noise levels. After evaluation, the proposed method is applied to the experimental data obtained from the rat brain using a single electrode.

Section snippets

Simulated data

The simulated datasets first introduced in [7] and were frequently used in previous similar works. The simulated data used in this work consist of 16 different datasets. For Construction of these datasets, a database of 594 different spike waveforms, which are created by averaging of real records from the neocortex and basal ganglia, have been used. Initially, the background noise is generated by several randomly selected spikes from the database with different random amplitudes. This noise is

Results

This section consists of two subsections. First, for a comparative study and validating the proposed method, some simulated (ground truth) datasets are used. Then, the method applied to an experimental dataset obtained from the rat brain.

Discussion

This paper consists of three parts, i.e., spike detection, feature extraction, and clustering. We evaluated the method with a ground truth dataset which is used frequently in the spike sorting literature. The evaluation performed in three sections.

First, just the detection part is examined. The results of this part show that the detection part had only on average about 2.5% false negative and under 2% false positives. It means the detection part accomplished its mission, but there were some

Conclusion

This paper presented a new method for sorting the spikes. It consists of three parts: detection and alignment of spikes, feature extraction, and automatic online clustering.

In the detection and alignment of spikes, a register for storing a set of real-time sampled data was used. Moreover, spikes were detected using two simple thresholds and aligned with their peaks or valleys.

The feature extraction consists of two main parts. First, finding the CPs and calculating the maximum slope on both

Ethical standards

•
The authors declare that all experiments on human subjects were conducted in accordance with the Declaration of Helsinki http://www.wma.net and that all procedures were carried out with the adequate understanding and written consent of the subjects.
•
The authors also certify that formal approval to conduct the experiments described has been obtained from the human subjects review board of their institution and could be provided upon request.
•
If the studies deal with animal experiments, the authors

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (26)

H.G. Rey et al.
Past, present and future of spike sorting techniques
Brain Res. Bull.
(2015)
R. Bestel et al.
A novel automated spike sorting algorithm with adaptable feature extraction
J. Neurosci. Methods
(2012)
E. Hulata et al.
A method for spike sorting and detection based on wavelet packets and Shannon’s mutual information
J. Neurosci. Methods
(2002)
S.E. Paraskevopoulou et al.
Feature extraction using first and second derivative extrema (FSDE) for real-time and hardware-efficient spike sorting
J. Neurosci. Methods
(2013)
T. Nguyen et al.
Automatic spike sorting by unsupervised clustering with diffusion maps and silhouettes
Neurocomputing.
(2015)
T. Nguyen et al.
Spike sorting using locality preserving projection with gap statistics and landmark-based spectral clustering
J. Neurosci. Methods
(2014)
S.E. Paraskevopoulou et al.
Hierarchical Adaptive Means (HAM) clustering for hardware-efficient, unsupervised and real-time spike sorting
J. Neurosci. Methods
(2014)
K.R. Vanga et al.
A Review on Recording of Neural Signal to Monitor Brain Activity
(2019)
S. Todorova et al.
To sort or not to sort: the impact of spike-sorting on neural decoding performance
J. Neural Eng.
(2014)
E.N. Brown et al.
Multiple neural spike train data analysis: state-of-the-art and future challenges
Nat. Neurosci.
(2004)

R.Q. Quiroga et al.

Unsupervised spike detection and sorting with wavelets and superparamagnetic clustering

Neural Comput.

(2004)

K.H. Kim et al.

Neural spike sorting under nearly 0-dB signal-to-noise ratio using nonlinear energy operator and artificial neural-network classifier

IEEE Trans. Biomed. Eng.

(2000)

J.H. Choi et al.

A new action potential detector using the MTEO and its effects on spike sorting systems at low signal-to-noise ratios

IEEE Trans. Biomed. Eng.

(2006)

Cited by (10)

Functional clustering of neuronal signals with FMM mixture models
2023, Heliyon
The identification of unlabeled neuronal electric signals is one of the most challenging open problems in neuroscience, widely known as Spike Sorting. Motivated to solve this problem, we propose a model-based approach within the mixture modeling framework for clustering oscillatory functional data called MixFMM. The core of the approach is the FMM (Frequency Modulated Möbius) waves, which are non-linear parametric time functions, flexible enough to describe different oscillatory patterns and simple enough to be estimated efficiently. In particular, specific model parameters describe the phase, amplitude and shape of the waveforms. A mixture model is defined using FMM waves as basic functions and gaussian errors, and an EM algorithm is proposed for estimating the parameters. Spike Sorting (SS) has received considerable attention in the literature, and different functional clustering approaches have been considered. We have conducted a fair comparative analysis of the MixFMM with three competitors. Two of them are traditional methods in functional clustering and widely used in Spike Sorting. The third is an approach that has proven superior to many others solving Spike Sorting problems. The datasets used for validation include benchmarking simulated and real cases. The internal and external validation indexes confirm a better performance of the MixFMM on real data sets against the three competitors and an outstanding performance in simulated data against traditional approaches.
Edge computing on TPU for brain implant signal analysis
2023, Neural Networks
The ever-increasing number of recording sites of silicon-based probes imposes a great challenge for detecting and evaluating single-unit activities in an accurate and efficient manner. Currently separate solutions are available for high precision offline evaluation and separate solutions for embedded systems where computational resources are more limited.
We propose a deep learning-based spike sorting system, that utilizes both unsupervised and supervised paradigms to learn a general feature embedding space and detect neural activity in raw data as well as predict the feature vectors for sorting. The unsupervised component uses contrastive learning to extract features from individual waveforms, while the supervised component is based on the MobileNetV2 architecture. One of the key advantages of our system is that it can be trained on multiple, diverse datasets simultaneously, resulting in greater generalizability than previous deep learning-based models.
We demonstrate that the proposed model does not only reaches the accuracy of current state-of-art offline spike sorting methods but has the unique potential to run on edge Tensor Processing Units (TPUs), specialized chips designed for artificial intelligence and edge computing. We compare our model performance with state of art solutions on paired datasets as well as on hybrid recordings as well. The herein demonstrated system paves the way to the integration of deep learning-based spike sorting algorithms into wearable electronic devices, which will be a crucial element of high-end brain–computer interfaces.
An optimized GMM algorithm and its application in single-trial motor imagination recognition
2022, Biomedical Signal Processing and Control
Citation Excerpt :
The task of the clustering algorithm is to find a way that group the objects into sensible clusters by revealing similarities and differences among objects [4]. It was revealed by subdividing the objects into groups, with maximally possible proximity of within-group subset, and minimally possible proximity of between-group subset [5,6]. Clustering algorithms can be divided into hierarchical clustering, partition clustering, density clustering, grid clustering, and model clustering [7].
The Gaussian mixture model (GMM) is utilized to illustrate the possibility of applying probabilistic models to data clustering and provide an efficient method for processing EEG signals. However, the existence of outliers in EEG will reduce the robustness of GMM and affect the clustering results. In this paper, an optimized GMM clustering technique that exhibits low sensitivity with respect to outliers within clusters has been proposed, which eliminates deviations caused by outliers. Experimental research is conducted to verify the effectiveness of the proposed methods. The results are supported by statistical inference and characteristic curves. The proposed model outperforms traditional methods by achieving the accuracy of 84.4%, 77.2%, 81.6%, and 88.3% on the BCI Competition IV Dataset 1. Furthermore, we combined this improved method with the state-of-the-art clustering methods, the experiments on public datasets show a comparable improvement in accuracy. This paper provides an optimized GMM clustering technique that exhibits low sensitivity to outliers, which may promote the development of BCI applications.
Nearly symmetric orthogonal wavelets for time-frequency-shape joint analysis: Introducing the discrete shapelet transform's third generation (DST-III) for nonlinear signal analysis
2021, Communications in Nonlinear Science and Numerical Simulation
Citation Excerpt :
Thus, spike sorting (SS) [18,25,26] based on minimum-distortion strategies has emerged as one of the most relevant data analysis problems in nonlinear science research. Neurocomputation [27], biomedicine [28], signal control [29], among others, represent the main fields where SS is of paramount importance. Over the years, different automated strategies [30] have been adopted to efficiently implement SS algorithms, such as phase- and shape-based [31], filtering-based [32], direct feature extraction- [33,34] and fusion-based [35], neural network-based [36], Gaussian Mixture Model-based [37], just to mention a few possibilities, where the specific objectives vary considerably [38–40].
This article introduces the third generation of an interesting tool created for time-frequency-shape (TFS) joint analysis. Called Discrete Shapelet Transform (DST-III), it improves both its predecessors, i.e., DST-I and DST-II, in such a way that nearly symmetric major shapelet functions, and consequently almost linear-phase filterbanks, are obtained. Following a brief review on important concepts, the DST-III formulation is specified in detail and complemented with a numerical example. In addition, a prototype pattern matching strategy, where a comparison with ordinary wavelets takes place, and a spike sorting application are also presented and discussed. Notably, wavelet expansions use to provide very concise signal representations, thus simplifying subsequent nonlinear signal analysis in both time and frequency and, consequently, bringing advantages for non-stationary problem solving in science and engineering. This reassures the efficacy of this new tool.
Wireless closed-loop deep brain stimulation using microelectrode array probes
2024, Journal of Zhejiang University: Science B
From End to End: Gaining, Sorting, and Employing High-Density Neural Single Unit Recordings
2022, Frontiers in Neuroinformatics

View all citing articles on Scopus

View full text

Unsupervised automatic online spike sorting using reward-based online clustering

Highlights

Abstract

Introduction

Section snippets

Simulated data

Results

Discussion

Conclusion

Ethical standards

Declaration of Competing Interest

Brain Res. Bull.

J. Neurosci. Methods

J. Neurosci. Methods

J. Neurosci. Methods

Neurocomputing.

J. Neurosci. Methods

J. Neurosci. Methods

A Review on Recording of Neural Signal to Monitor Brain Activity

To sort or not to sort: the impact of spike-sorting on neural decoding performance

J. Neural Eng.

Multiple neural spike train data analysis: state-of-the-art and future challenges

Nat. Neurosci.

Unsupervised spike detection and sorting with wavelets and superparamagnetic clustering

Neural Comput.

Neural spike sorting under nearly 0-dB signal-to-noise ratio using nonlinear energy operator and artificial neural-network classifier

IEEE Trans. Biomed. Eng.

A new action potential detector using the MTEO and its effects on spike sorting systems at low signal-to-noise ratios

IEEE Trans. Biomed. Eng.