Multivariate denoising using wavelets and principal component analysis

doi:10.1016/j.csda.2004.12.010

Computational Statistics & Data Analysis

Volume 50, Issue 9, 1 May 2006, Pages 2381-2398

https://doi.org/10.1016/j.csda.2004.12.010 Get rights and content

Abstract

A multivariate extension of the well known wavelet denoising procedure widely examined for scalar valued signals, is proposed. It combines a straightforward multivariate generalization of a classical one and principal component analysis. This new procedure exhibits promising behavior on classical bench signals and the associated estimator is found to be near minimax in the one-dimensional sense, for Besov balls. The method is finally illustrated by an application to multichannel neural recordings.

Introduction

On one hand, denoising algorithms based on wavelet decompositions are a popular method for one-dimensional statistical signal extraction and filtering. On the other hand, principal component analysis (PCA) is among the most notorious data-analysis tools designed to simplify multidimensional data by tracking new factors supposed to capture the main features.

This paper proposes a multivariate extension of wavelet denoising procedures, combining a straightforward multivariate generalization of the classical one for scalar valued signals and principal component analysis. This proposal takes place among the various recent approaches combining wavelet strategies and data analytic tools to cope with the problem of feature extraction in regression models. Numerous applied situations strongly motivate this interest. Let us mention some of them together with some references focusing on a wavelet-data analysis approach: spectral calibration problems (Vannucci et al., 2003), multivariate statistical process control (see Bakshi, 1998; Teppola and Minkkinen, 2000), blind source separation (Roberts et al., 2004), functional magnetic resonance imaging (fMRI) analysis (Meyer and Chinrungrueng, 2003), spike detection and sorting (Oweiss and Anderson, 2001).

This paper focuses on multivariate wavelet denoising and deals with regression models of the form $X (t) = F (t) + ε (t)$ , where the observation X is p-dimensional, F is deterministic and is the signal to be recovered and $ε$ is a spatially correlated noise. This kind of model is well suited for situations for which such an additive spatially correlated noise is realistic. For example, a longitudinal study on p subjects, the analysis of a part a fMRI region (involving p voxels) or the noise reduction in multichannel neural recordings (using p channels). Let us be a little bit more precise on this last example which is chosen to illustrate the behavior of the proposed procedure on a real world data set, at the end of the paper. Following Oweiss and Anderson (2001), extra-cellular neural recordings can be modeled as an invariant deterministic signal and an additive noise which obscures neural discharges from cells of interest. This noise contains a component exhibiting spatial correlation coming from background activity caused by neural cells.

To close this introduction, let us recall some facts about classical univariate wavelet denoising dealing both with signal processing and functional estimation in statistics and which is of interest in various applied fields. Valuable references are the books (Mallat, 1998, Percival and Walden, 2000, Vidakovic, 1999) and the survey paper (Antoniadis, 1997). For basics on wavelets, we refer the reader to Mallat (1998) or Misiti et al. (2003) for example.

The simplest considered model is of the following form: $X (t) = f (t) + ε (t), t = 1, \dots, n,$ where $(X (t))_{1 ⩽ t ⩽ n}$ is observed, $(ε (t))_{1 ⩽ t ⩽ n}$ is a centered Gaussian white noise of unknown variance $σ^{2}$ and f is an unknown function to be recovered through the observations.

For a given orthogonal wavelet basis denoted by $({(φ_{J, k})}_{k \in Z}, {(ψ_{j, k})}_{1 ⩽ j ⩽ J, k \in Z})$ where $ψ$ is a wavelet, $φ$ the associated scaling function, J a suitably chosen decomposition level and where $g_{j, k} (x) = 2^{- j / 2} g (2^{- j} x - k)$ , wavelet denoising proceeds in three steps:

$•$
Step 1: Compute the wavelet decomposition of the observed signal up to level J;
$•$
Step 2: Threshold conveniently the wavelet detail coefficients;
$•$
Step 3: Reconstruct a denoised version of the original signal, from the thresholded detail coefficients and the approximation coefficients, using the inverse wavelet transform.

Various strategies are available (see the survey paper Antoniadis et al., 2001) to perform this task and the asymptotic performance of the associated estimators is the minimax one up to a logarithmic factor, for large classes of functions simultaneously (let us mention that block thresholding, see Hall et al., 1999, could be used to remove the logarithmic factor). For simplicity and since only relative performance between the proposed multivariate procedures are of interest, we restrict our attention to the so-called universal threshold (introduced by Donoho and Johnstone, 1994) which is of the form

\hat{σ} \sqrt{2 \log (n)}

, where

\hat{σ}

is an estimator of

σ

based on the detail coefficients at level 1 (the finest one). Such methods are effective because functions f belonging to various general classes are such that they admit a sparse wavelet representation (Kerkyacharian and Picard, 2000). So the energy of f is mainly concentrated in a few large wavelet coefficients which are adaptively selected by this procedure since the coefficients below the threshold are attributable to the additive noise.

This paper dedicated to a multivariate denoising procedure that takes into account the correlation structure of the noise, is organized in two main sections. Section 2 proposes a first denoising procedure which is a direct generalization of the one-dimensional strategy. The method is based on a change of basis followed by a classical one-dimensional soft-thresholding. This new procedure exhibits promising behavior on classical test signals and the associated estimator is found to be near minimax in the one-dimensional sense, for Besov balls. The change of basis is obtained from the diagonalization of a robust estimate of the noise covariance matrix given by the minimum covariance determinant estimator based on the matrix of finest details.

Section 3 first recalls the multiscale PCA proposed by Bakshi (1998) for statistical process control purposes. This scheme is discussed and then a second denoising procedure combining wavelets and PCA is proposed. The introduction of a PCA step try to take advantage of the deterministic relationships between the signals, leading to an additional denoising effect. It is then illustrated by some simulation examples and by an application to multichannel neural recordings.

Section snippets

Procedure

Let us consider the following p-dimensional model: $X (t) = f (t) + ε (t), t = 1, \dots, n,$ where $X (t), f (t), ε (t)$ are of size $1 \times p$ and $(ε (t))_{1 ⩽ t ⩽ n}$ is a centered Gaussian white noise with unknown covariance matrix $E (ε (t)^{T} ε (t)) = Σ_{ε}$ . Each component of $X (t)$ is of the previous form (1), for $1 ⩽ i ⩽ p$ : $X^{i} (t) = f^{i} (t) + ε^{i} (t), t = 1, \dots, n,$ where $f^{i}$ belongs to some functional space (typically $L^{2}$ or Besov spaces).

The covariance matrix $Σ_{ε}$ , supposed to be positive definite, captures the stochastic link between the components of $X (t)$ and models

Multivariate wavelet denoising using PCA

The multivariate procedure previously examined can be generalized by looking at the deterministic relationships between the p signals. The idea is to use principal component analysis, not to discover new variables which could be of interest, but to kill unsignificant principal components to obtain an additional denoising effect.

In this section, we first recall the multiscale PCA proposed by Bakshi (1998) in another context and we discuss it from the denoising perspective. Next, a second

Conclusion

We have proposed a multivariate denoising procedure combining wavelets and PCA, that takes into account the correlation structure of the noise. This new procedure exhibits promising behavior on classical bench signals and seems to perform well when it is applied to multichannel neural recordings, the real world example which illustrates the method.

This work could be extended in various directions, let us mention some of them for future work. First, the way to select the parameters of the

Acknowledgements

The authors thank Anestis Antoniadis for valuable discussions, Karim Oweiss and Yasir Suhail for making available to us the multichannel neural recordings that we used to illustrate our method here and the three anonymous referees for helpful comments and suggestions.

References (22)

S. Bierer et al.
Multi-channel spike detection and sorting using an array processing technique
Neurocomputing
(1999)
K. Oweiss et al.
Noise reduction in multichannel neural recordings using a new array wavelet denoising algorithm
Neurocomputing
(2001)
S. Roberts et al.
Hierarchy, priors and waveletsstructure and signal modelling using ICA
Signal Process.
(2004)
M. Vannucci et al.
A decision theoretical approach to wavelet regression on curves with a high number of regressors
J. Statist. Plann. Inference
(2003)
A. Antoniadis
Wavelet in statisticsa review
J. Ital. Statist. Soc.
(1997)
A. Antoniadis et al.
Wavelet estimators in nonparametric regressiona comparative simulation study
J. Statist. Software
(2001)
B. Bakshi
Multiscale PCA with application to MSPC monitoring
AIChE J.
(1998)
I. Daubechies
Ten Lectures on Wavelets
(1992)
R. DeVore et al.
Constructive Approximation
(1993)
D. Donoho
De-noising by soft-thresholding
IEEE Trans. Inform. Theory
(1995)

D. Donoho et al.

Ideal spatial adaptation by wavelet shrinkage

Biometrika

(1994)

Cited by (169)

The Msegram: A useful multichannel feature synchronous extraction tool for detecting rolling bearing faults
2023, Mechanical Systems and Signal Processing
Citation Excerpt :
However, with the continuous growth of big data, an urgent task turns out to be the powerful methods to multichannel signal simultaneous processing for the high efficiency and effectiveness of data cleaning and digging. Hereinto, Aminghafari studied multivariate wavelet denoising (MWD) with principal component analysis for multichannel denoising [19]. Rehman proposed multivariate empirical mode decomposition (MEMD) for synchronized processing of multichannel signals and adaptive decompositions [20].
Multichannel signals collected by multiple sensors contain richer condition information of equipment than single-channel signals. However, such issues as simultaneous denoising, adaptive decomposition and synchronous extraction are still challenging for multichannel signals, which are beneficial to accurate fault diagnosis. Thus, a useful multichannel feature synchronous extraction tool is proposed for detecting rolling bearing faults, named as Msegram. First, a tensor synchronization denoising method based on high order singular value decomposition (HOSVD) is proposed for multichannel signal preprocessing. Original multichannel signals of testing bearings are constructed to be a third-order tensor by phase space reconstruction. Hereinto, a singular entropy increment is adopted to determine a reasonable singular order for each unfolding, and an optimal core tensor is obtained for local reconstruction analysis. Second, multi-layer K-value multivariate variational mode decomposition (MVMD) is designed after the multichannel noise reduction to realize synchronous adaptive filtering and decomposition for the multichannel signals. Third, inspired by the idea of the spectral kurtosis, a tower-shaped crest factor of envelope spectrum (EC) diagram similar to Fast Kurtogram (FK) is proposed to visualize the output of multichannel bearing fault feature results. According to the tower-shaped EC diagram with the maximum fault crest factor, the optimal analytic results of multichannel signals are selected and output to synchronously extract bearing fault features. Finally, repeatable simulations and two experimental fault cases of rolling bearings are implemented to demonstrate the practicability and effectiveness of the proposed method. The results show that the proposed method can successfully reveal the compound faults from experimental bearing and effectively identify the compound faults from locomotive wheelset bearing.
Recognition of lower limb movements using empirical mode decomposition and k-nearest neighbor entropy estimator with surface electromyogram signals
2022, Biomedical Signal Processing and Control
Lower limb movement recognition is critical to the daily care of the elderly, the weak, and the disabled. Surface electromyogram (sEMG) signals reflect the intention of human movements and can be used as the source of lower limb movement recognition. However, sEMG signals exhibit low stability due to electrode displacement, muscle structure differences, and muscle contraction strength. The effective extraction of features from sEMG signals is considered a difficult problem in the studies on sEMG signals-based lower limb movement recognition. In this work, we proposed a novel method of lower limb motion recognition based on empirical mode decomposition (EMD) and k-Nearest Neighbor entropy (KNN-En) estimator. First, the sEMG signals of four lower limb movements from twenty subjects were recorded with seven wearable sEMG signal sensors, and the sEMG signals were denoised through the multi-scale principal component analysis (MSPCA). Then, the sEMG signals were decomposed by EMD into multiple intrinsic mode functions (IMF), and the KNN-En estimator features were extracted from the IMF. Next, the KNN-En estimator features were projected into low-dimensional spaces by three feature projection techniques, namely principal component analysis (PCA), isometric mapping (Isomap), and diffusion mapping (DM). Finally, the four lower limb movements were recognized by three machine learning classifiers, namely support vector machine (SVM), k-nearest neighbor (KNN), and Bagging. The experimental results showed that the combination of the SVM classifier and the DM method exhibited excellent recognition performance and an accuracy of 99.63%, thereby proving the feasibility of the proposed method in lower limb motion recognition.
Automated interface detection in liquid-liquid systems using self-calibrating ultrasonic sensor
2021, Chemical Engineering Science
Detection of liquid-liquid interface is critical in solvent extraction equipment/contactors typically used in petroleum and hydrometallurgical industries. An instrumentation system, comprising of a self-calibrating ultrasonic sensor and a novel signal processing algorithm is reported. It can be used to estimate location of liquid-liquid interface along with speed of sound in both the liquids without any manual/user intervention. The algorithm involves use of discrete wavelet transform and matched filter. The system has been tested at laboratory scale under wide range of industrially relevant conditions. The system works for any pair of liquid without apriori knowledge of their acoustic characteristics, is insensitive to changes in temperature and/or compositions. It also works for dirty interfaces contaminated with particulate matter or emulsion. The above mentioned features truly impart the sensor system “deploy-and-forget” capabilities. The range, accuracy and resolution of the system, with respect to interface location, are 40–200 mm, 0.45% FS (Full-scale) and less than 1 mm, respectively.
Meander Statistics Toolbox (MStaT): A toolbox for geometry characterization of bends in large meandering channels
2021, SoftwareX
This contribution presents MStaT, a wavelet-based open-source software developed to provide a detailed characterization of large meandering river morphodynamics. MStaT integrates three independent modules: (i) meandering morphometrics module; (ii) migration module; and (iii) confluence module. MStaT delivers a short and medium-term framework to analyze the river centerline and valley-meandering channel interrelationship at low computational cost. It provides quantitative information on the spatial distribution of the arc-wavelength, migration rates, cutoffs events, and tributary channels influences. Data are presented through a user-friendly graphical user interface that makes the output interpretation easier, and that is freely available to the communities of river morphodynamics scientists and engineers.
Fully Multivariate Detrended Fluctuation Analysis Using Mahalanobis Norm with Application to Multivariate Signal Denoising
2023, SSRN
Gaussian process-based quasi-coherent noise suppression in magnetic confinement devices with superconductors
2023, Nuclear Fusion

View all citing articles on Scopus

View full text

Multivariate denoising using wavelets and principal component analysis

Abstract

Introduction

Section snippets

Procedure

Multivariate wavelet denoising using PCA

Conclusion

Acknowledgements

Neurocomputing

Neurocomputing

Signal Process.

J. Statist. Plann. Inference

Wavelet in statisticsa review

J. Ital. Statist. Soc.

Wavelet estimators in nonparametric regressiona comparative simulation study

J. Statist. Software

Multiscale PCA with application to MSPC monitoring

AIChE J.

Ten Lectures on Wavelets

Constructive Approximation

De-noising by soft-thresholding

IEEE Trans. Inform. Theory

Ideal spatial adaptation by wavelet shrinkage

Biometrika