An adaptive thresholding method for the wavelet based denoising of phonocardiogram signal

doi:10.1016/j.bspc.2017.07.002

Biomedical Signal Processing and Control

Volume 38, September 2017, Pages 388-399

https://doi.org/10.1016/j.bspc.2017.07.002 Get rights and content

Highlights

•
An adaptive threshold estimation method for wavelet based denoising of phonocardiography signal.
•
The proposed method is adaptive to the level of noise present in the signal.
•
New statistical parameter is proposed based on domain knowledge about the PCG signal.
•
A non-linear mid threshold function is also proposed, and its parameters are optimized.
•
The proposed method efficiently suppresses various types of noise and murmur sounds.

Abstract

Segmentation of the phonocardiography (PCG) signal into cardiac cycles is a primary task for the diagnosis of cardiovascular diseases. However, PCG is highly susceptible to noise, and extra sound called murmur may also be present in the PCG signal due to pathology. These components cause difficulties in the segmentation and therefore, segmentation is often preceded by the denoising of the PCG signal to emphasize the fundamental heart sounds S1 and S2, by removing these unwanted components. For the denoising of the PCG signal, discrete wavelet transform (DWT) based algorithms have shown good performance because such algorithms suppress in-band noise besides the out-of-band noise. Selection of threshold value and threshold function significantly affects the performance of these algorithms. In this paper, for threshold value estimation, an adaptive method based on statistical parameters of the given PCG signal is proposed. The statistical parameters are found to be highly effective for this purpose. We also propose a new threshold function, non-linear mid function, to address the issues of SNR and transients in the existing threshold functions, soft and hard. The proposed method is applied on a large number of PCG signals with additive white Gaussian noise, red noise, and pink noise. The Performance of the proposed method is also evaluated on the PCG signals recorded in real-life noisy scenarios and signals with murmur sound. The obtained results show that the proposed method is significantly superior to the competitive algorithms.

Introduction

Heart valvular diseases are manifested into heart sound signal much earlier than they cause serious damage to the heart [1]. It is because the genesis of the heart sound components is related to the movement of the heart valves [2]. Phonocardiography is the most widely used technique for the heart sound signal analysis because of its user-friendly features such as easy to use, timeless set-up process, and cost effectiveness [3], [4], [5]. It uses a sensor called electronic stethoscope, which acquires the heart sound signal and converts it into an electrical form using a microphone. Thus, analysis of the PCG signal is of paramount importance to diagnose heart valvular diseases at an early stage. However, availability of only a small number of experts in this field motivated researchers to develop algorithms for automatic analysis of the PCG signal.

Automatic analysis of the PCG signal is generally performed in two steps; segmentation and classification [6], [7]. In the segmentation step, fundamental heart sounds (FHS), S1 and S2, are identified and then the signal is segmented into systole and diastole periods [7]. Systole is the time duration from S1 to S2 and the diastole is the time duration from S2 to next S1 [6]. In the classification step, the PCG signal is classified as normal or abnormal having a particular disease. Thus, the primary task in the PCG signal analysis is its segmentation [7]. However, PCG signal is highly susceptible to various noises generated due to the motion of subject, subject's own speech and other's speech, movement of the stethoscope used to acquire the PCG signal, lung sounds, and ambient sources [4], [7], [8]. Furthermore, pathologies may cause the presence of extra sound in the PCG signal, called as murmur [6]. The presence of these components makes the segmentation task difficult. Therefore, to emphasize the S1 and S2, segmentation is often preceded by denoising to minimize the contamination level of noise and to remove the murmurs from the signal [9], [10]. However, the presence of murmurs in systole or diastole period and their time-frequency characteristics leads to a diagnosis of heart valvular diseases [11], [12]. For example, murmur due to aortic stenosis, mitral regurgitation and pulmonary stenosis occur in systole period and aortic regurgitation and mitral stenosis occur in diastolic period [6]. The features related to the murmurs can be extracted from the segmented cardiac cycles more efficiently as compared to the signal without segmented cardiac cycles [10]. Therefore, once the PCG signal is segmented into cardiac cycles, pathological features, if present, are extracted from these cycles [10].

Various denoising algorithms for the PCG signal in time domain and frequency domain have been proposed [3]. In the time domain, denoising algorithms have been proposed based on conventional filters such as Chebyshev IIR filter [13], adaptive noise canceller (ANC), and autocorrelation method [14]. The conventional filters are limited to suppress the noise which is out of the frequency band of the signal components. On the other hand, ANC based algorithms such as least mean square (LMS) [15], [16], suppress the noise in an adaptive manner and, hence, suppress in-band noise as well. The major drawback of the ANC algorithms is that they need reference (noise source) signal, which is not available in most cases of the real-life scenarios. Manikandan and Soman [14], proposed a computationally efficient denoising algorithm based on lag-1 autocorrelation method [14]. However, performance of these algorithms significantly degrades as the level of noise increases.

In the frequency domain based denoising algorithms, the time domain signal is first transformed into the frequency domain using a specific transform function such as Fourier transform and wavelet transform (WT), and then the transformed signal is processed. Analysis of the PCG signal in frequency domain provides the information about spectral characteristics of the components presented in the signal and, hence, more efficiency in noise removal can be obtained as compared to the time domain. Sanei et. al. proposed an approach to separate the murmurs from the PCG signal using singular spectrum analysis [11]. In another approach, murmurs were removed from the PCG signal using constrained tunable-Q wavelet transform [17]. However, both the algorithms require high computational time.

The most widely used method for the denoising of the PCG signal is based on the DWT [8], [18], [19] due to the fact that the DWT coefficients of the PCG signal components will be large and they will be confined to specific frequency band, while the coefficients for the noise components will have small amplitude and scattered in different frequency bands [20]. Thus, denoising can be achieved by suppressing the small coefficients. However, the performance of the DWT based denoising algorithm significantly depends on the choice of the parameters: (1) Mother wavelet, (2) Number of decomposition levels and the levels to be processed, (3) Threshold value, and (4) Threshold function [8].

For the denoising, mother wavelet should be orthogonal, which allows perfect reconstruction of the signal [5]. For the PCG signal denoising, various orthogonal wavelets have been suggested such as Coiflet [18], [21], Symlet [12] and Daubechies [2], [22]. In [4], Chourasia et al. developed a new wavelet for PCG signal of a foetus.

The second parameter is the number of decomposition levels. The number of decomposition levels should be selected precisely such that the useful signal components and unwanted components lie in different levels. The frequency range of each level depends on the sampling frequency of the signal. As the sampling frequency increases the frequency range of particular level increases [23]. Therefore, different choices for a number of decomposition levels are reported in the literature [5], [8]. After the decomposition, levels to be processed should be chosen appropriately. In literature, most of the algorithms processed all the decomposed levels [5], [8], [21], [24], which requires unnecessary high computation. In other approaches, the signal is reconstructed using coefficients at a few selected levels, while discarding others [2], [12], [22]. These approaches remove only out-of-band noise. To suppress the in-band noise, the levels associated with the signal components should also be processed [18]. Researchers have also proposed algorithms for the appropriate selection of the levels based on the energy and frequency range of the PCG signal [25], [26].

The third parameter, threshold value, plays a crucial role in DWT based denoising. A large value of threshold affects the useful signal components, while a low threshold value will be ineffective to suppress the unwanted signal components [9]. For the PCG signal, mostly used threshold estimation methods are ‘rigrsure’ [4], [18], [27], ‘heursure’ [5], [8], ‘sqtwolog’ [24], and ‘minimaxi’ [8], [21]. The ‘sqtwolog’ is a fixed form method and does not take into account the content of the signal, but only depends on the length of the signal [4]. It provides a threshold value larger than other methods and hence it may result into over thresholding. ‘minimaxi’ is also a fixed form threshold method, in which the threshold value is estimated such that the maximum risk of estimation error is minimized [5]. ‘rigrsure’ method determines a threshold value to minimize the Steins unbiased risk estimation (SURE). ‘rigrsure’ and ‘minimaxi’ methods estimates threshold value to minimize the risk estimation and results into low threshold value [28]. ‘heursure’ method selects one of the methods from the ‘sqtwolog’ and ‘rigrsure’ methods, based on the comparison between the SURE estimation and SNR [5]. Naseri and Homaeinezhad [22] devised a threshold estimation method based on the weighted variance of the noise while Kumar and Saha [29] calculated the threshold as the 20% of the weighted maximum energy of the coefficient vector. However, values of parameters used in these methods were obtained heuristically.

The fourth parameter is the threshold function, which defines the way to treat the wavelet coefficients using the estimated threshold value. Soft and hard are two existing threshold functions used extensively for the denoising of PCG signals [4], [18], [21]. In soft threshold function, the coefficients lower than the threshold are replaced by zeros while other coefficients get shrank by the threshold value [4]. In hard threshold function, the coefficients lower than the threshold are replaced by zeros, as in soft threshold, while larger coefficients remain unchanged. Hard threshold function may cause discontinuities in the reconstructed signal and make it oscillating [5]. In soft threshold function, shrinkage of the wavelet coefficients by threshold reduces the effect of singularities and transients that cannot be addressed by the hard threshold function [30]. However, hard threshold function produces larger SNR value than the soft threshold function. Zhao [27] proposed a generalized threshold function although it needs a selection of parameter, which controls the performance of the algorithm.

To address the issues discussed above related to threshold estimation and threshold function, we propose a new DWT based denoising algorithm for the PCG signals. We used ‘Coif-5’ wavelet as mother wavelet and performed five levels of decomposition of the PCG signal sampled at 2 kHz. When a signal with 2 kHz sampling frequency is decomposed, the 4th and 5th levels cover the frequency range 31–125 Hz, approximately [31], and hence cover most of the frequency range of the S1 and S2, which is 25–120 Hz [31]. Therefore, in the proposed algorithm, only these two levels are processed. Removal of the coefficients of lower detailed levels also removes the out-of-band noise. To further improve the performance of the denoising, we propose a novel adaptive threshold estimation method using statistical properties of the DWT coefficients. The proposed method uses the domain knowledge that the sum of the length of the S1 and S2 remains less than 25% of the length of a cardiac cycle [32], [33]. Therefore, a new parameter, med₇₅ is calculated instead of traditional median value. The med₇₅ represents the 75th percentile value in the sorted absolute values of a coefficient vector in ascending order. Further, to address the issue of threshold function, we also propose a new method called as ‘non-linear mid’ function for the PCG signal. Furthermore, its parameters are optimized using the genetic algorithm, to improve its performance for the PCG signals. Thus, the proposed algorithm adaptively shrinks the wavelet coefficients of the PCG signal.

Rest of the paper is organised as follow; The proposed denoising algorithm is described in Section 2. The obtained results of the proposed method and its comparison with state-of-the-art methods are discussed in Section 3. Section 4 concludes the result of the paper.

Section snippets

Methodology

The proposed algorithm calculates a threshold value adaptively and the same is applied to the wavelet coefficients of selected levels. The proposed algorithm performs in three steps as described below.

Results and discussion

Experiments are performed on the PCG signal contaminated with simulated white Gaussian noise, and presented in Section 3.1. For the noise simulation, pink and red noise models are also considered because Gradolewski and Redlarski [8] observed that the characteristics of real-life noise are similar to pink and red noises. Obtained results for these noises are presented in Section 3.2. Furthermore, the performance of the proposed method is also analysed on the PCG signal acquired in real-life

Conclusions

In this paper, we have proposed a new threshold estimation method for the DWT based denoising of the heart sound signal. For the estimation of the threshold value, the proposed method uses the domain knowledge about the heart sound signal that the sum of the length of the S1 and S2 remains less than 25% of the length of a cardiac cycle. We also proposed non-linear mid threshold function and optimised its parameter for the PCG signal. The performance of the proposed method is analysed for

References (43)

L.H. Cherif et al.
Choice of the wavelet analyzing in the phonocardiogram signal analysis using the discrete and the packet wavelet transform
Expert Syst. Appl.
(2010)
V.S. Chourasia et al.
A novel approach for phonocardiographic signals processing to make possible fetal heart rate evaluations
Digit. Signal Process.
(2014)
S.R. Messer et al.
Optimal wavelet denoising for phonocardiograms
Microelectron. J.
(2001)
P.K. Jain et al.
Heart monitoring systems – a review
Comput. Biol. Med.
(2014)
A. Moukadem et al.
A robust heart sounds segmentation module based on s-transform
Biomed. Signal Process. Control
(2013)
D. Gradolewski et al.
Wavelet-based denoising method for real phonocardiography signal recorded by mobile devices in noisy environment
Comput. Biol. Med.
(2014)
H. Naseri et al.
Noise/spike detection in phonocardiogram signal as a cyclic random process with non-stationary period interval
Comput. Biol. Med.
(2013)
S. Patidar et al.
Classification of cardiac sound signals using constrained tunable-q wavelet transform
Expert Syst. Appl.
(2014)
S. Sanei et al.
An adaptive singular spectrum analysis approach to murmur detection from heart sounds
Med. Eng. Phys.
(2011)
Z. Dokur et al.
Heart sound classification using wavelet transform and incremental self-organizing map
Digit. Signal Process.
(2008)

S. Patidar et al.

Segmentation of cardiac sound signals by removing murmurs using constrained tunable-q wavelet transform

Biomed. Signal Process. Control

(2013)

S. Vaisman et al.

Passive fetal monitoring by adaptive wavelet denoising method

Comput. Biol. Med.

(2012)

F. Liu et al.

Research and implementation of heart sound denoising

Phys. Proc.

(2012)

F. Safara et al.

Multi-level basis selection of wavelet packet decomposition tree for heart sound classification

Comput. Biol. Med.

(2013)

S. Choi

Detection of valvular heart disorders using wavelet packet decomposition and support vector machine

Expert Syst. Appl.

(2008)

S. Debbal et al.

Computerized heart sounds analysis

Comput. Biol. Med.

(2008)

J. Yuan et al.

Gear fault detection using customized multiwavelet lifting schemes

Mech. Syst. Signal Process.

(2010)

C. Ahlstrom et al.

Feature extraction for systolic heart murmur classification

Ann. Biomed. Eng.

(2006)

S. Leng et al.

The electronic stethoscope

BioMed. Eng. Online

(2015)

Y.-W. Bai et al.

The embedded digital stethoscope uses the adaptive noise cancellation filter and the type i chebyshev iir bandpass filter to reduce the noise of the heart sound

M. Sabarimalai Manikandan et al.

Robust heart sound activity detection in noisy environments

Electron. Lett.

(2010)

Cited by (70)

Automated valvular heart disease detection using heart sound with a deep learning algorithm
2024, IJC Heart and Vasculature
Insufficient clinicians' auscultation ability delays the diagnosis and treatment of valvular heart disease (VHD); artificial intelligence provides a solution to compensate for the insufficiency in auscultation ability by distinguishing between heart murmurs and normal heart sounds. However, whether artificial intelligence can automatically diagnose VHD remains unknown. Our objective was to use deep learning to process and compare raw heart sound data to identify patients with VHD requiring intervention.
Heart sounds from patients with VHD and healthy controls were collected using an electronic stethoscope. Echocardiographic findings were used as the gold standard for this study. According to the chronological order of enrollment, the early-enrolled samples were used to train the deep learning model, and the late-enrollment samples were used to validate the results.
The final study population comprised 499 patients (354 in the algorithm training group and 145 in the result validation group). The sensitivity, specificity, and accuracy of the deep-learning model for identifying various VHDs ranged from 71.4 to 100.0%, 83.5–100.0%, and 84.1–100.0%, respectively; the best diagnostic performance was observed for mitral stenosis, with a sensitivity of 100.0% (31.0–100.0%), a specificity of 100% (96.7–100.0%), and an accuracy of 100% (97.5–100.0%).
Based on raw heart sound data, the deep learning model effectively identifies patients with various types of VHD who require intervention and assists in the screening, diagnosis, and follow-up of VHD.
A denoising method of ECG signal based on variational autoencoder and masked convolution
2023, Journal of Electrocardiology
Wearable electrocardiogram (ECG) equipment can realize continuous monitoring of cardiovascular diseases, but these devices are more susceptible to interference from various noises, which will seriously reduce the diagnostic correctness. In this work, a novel noise reduction model for ECG signals is proposed based on variational autoencoder and masked convolution. The variational Bayesian inference is conducted to capture the global features of the ECG signals by encouraging the approximate posterior of the latent variables to fit the prior distribution, and we use the skip connection and feature concatenation to realize the information interaction across the channels. To strengthen the connection of local features of the ECG signals, the masked convolution module is used to extract local feature information, which supplement the global features and the noise reduction performance of whole model can be greatly improved. Experiments are carried out on the MIT-BIH arrythmia database, and the results display that the performance metrics of signal-to-noise ratio (SNR) and root mean square error (RMSE) are significantly improved compared with other approaches while causing less signal distortion.
Exploring the impact of noise and degradations on heart sound classification models
2023, Biomedical Signal Processing and Control
The development of data-driven heart sound classification models has been an active area of research in recent years. To develop such data-driven models in the first place, heart sound signals need to be captured using a signal acquisition device. However, it is almost impossible to capture noise-free heart sound signals due to the presence of internal and external noises in most situations. Such noises and degradations in heart sound signals can potentially reduce the accuracy of data-driven classification models. Although different techniques have been proposed in the literature to address the noise issue, how and to what extent different noise and degradations in heart sound signals impact the accuracy of data-driven classification models remains unexplored. To answer this question, we produced a synthetic heart sound dataset including normal and abnormal heart sounds contaminated with a variety of noise and degradations. We used this dataset to investigate the impact of noise and degradation in heart sound recordings on the performance of different classification models. The results show different noises and degradations affect the performance of heart sound classification models to a different extent; some are more problematic for classification models, and others are less destructive. Comparing the findings of this study with the results of a survey we previously carried out with a group of clinicians shows noise and degradations that are more detrimental to classification models are also more disruptive to accurate auscultation. The findings of this study can be leveraged to develop targeted heart sound quality enhancement approaches — which adapt the type and aggressiveness of quality enhancement based on the characteristics of noise and degradation in heart sound signals.
Wavelet-domain group-sparse denoising method for ECG signals
2023, Biomedical Signal Processing and Control
In this paper, a group-sparse signal denoising approach is proposed, and both non-convex regularization and sparsity characteristics in wavelet domain are incorporated to estimate the electrocardiogram (ECG) signals with noise. To strongly promote wavelet sparsity, a parameterized non-convex penalty function is introduced, and the interval for the parameter is identified to guarantee the strict convexity of the total cost function. To retain the details of ECG signals, all the wavelet coefficients are estimated by minimizing certain single objective function, and thus the insignificant coefficients that do not survive wavelet thresholding can be maintained. The algorithm is solved based on the majorization–minimization optimization method, the alternating direction method of multipliers, and proximal method. The real collected ECG signals and MIT-BIH arrhythmia database are used to evaluate the effectiveness of the proposed wavelet-domain group-sparse method (WDGS) for ECG signal enhancement. By qualitative and quantitative analysis, it shows that our method can effectively suppress the undesired noise and keep the important morphology of ECG signals.
Improving purity of blasting vibration signals using advanced Empirical Mode Decomposition and Wavelet packet technique
2022, Applied Acoustics
Citation Excerpt :
Wavelet packet threshold denoising (WPTD) has been developed to reduce the noise components for undecomposed and decomposed acoustic signals. Compared to the widely used Wavelet threshold denoising [13,28], the technique allows a higher frequency resolution for acoustic signal and thus a better denoising capability [29]. where |·| is the absolute value, x is the decomposition factor and T is the thresholding value.
The study proposes a denoising method based on Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) and Wavelet packet threshold denoising (WPTD) to reduce the noise of blasting vibration signals. CEEMDAN is first employed to decompose noisy blasting vibration signals into different intrinsic mode functions (IMFs), i.e., different frequencies of subcomponents. By using correlation coefficients, all of IMFs are automatically categorized as the dominant IMFs with higher portions of vibration components and the secondary IMFs with less vibration contents. Then, with further decomposing two groups of IMFs into different scales of Wavelet packet coefficient series, WPTD based on the fixed-threshold technique and the soft threshold function method is applied to minimize the level of noise in two groups of IMFs. Two groups of denoised IMFs are finally reconstructed as the pure blasting vibration waveform. Numerical waveforms with different types and intensities of noise and measured blasting vibration signals with different main frequency bands are used to verify the accuracy and capability of the proposed method. The numerical and field results show that the proposed method can not only effectively reduce noise components, but also well remain components of blasting vibrations.
Contactless inspection of CFRP artificial disbonds using combined laser thermography and laser ultrasonics with optical microphone
2022, Composite Structures
Citation Excerpt :
Wavelet transform, which has proven a high effective denoising method in terms of time and quality concerning denoising methods, is widely used in the realm of signal denoising. To recover the LU signals, three steps are always needed in terms of wavelet denoising [36], which are shown in Fig. 6. A significant part of wavelet denoising is finding a critical thresholding value due that noise manifests through the decomposition coefficients.
Precise and efficient inspection of disband defect in sub-mm layers of composite structures is a big challenge for the non-destructive and testing industry. In this work, 16 artificial disbands (flat bottom holes), with different sizes and depths, in carbon fiber reinforced polymer (CFRP) composites were contactless detected using combined laser thermography and laser ultrasonics with optical microphone. The presented laser ultrasonic technique employed a membrane free optical microphone is a contactless and nondestructive technique and shows high detectability for flat bottom holes with deep depth in this preliminary investigation. Sizes and depths of the flat-bottom holes were determined by the presented laser ultrasonics technique. This investigation combined the high detection efficiency of laser thermography and high detection sensitivity of laser ultrasonics with the optical microphone together for the contactless inspection of artificial disbonds.

View all citing articles on Scopus

View full text

An adaptive thresholding method for the wavelet based denoising of phonocardiogram signal

Highlights

Abstract

Introduction

Section snippets

Methodology

Results and discussion

Conclusions

Expert Syst. Appl.

Digit. Signal Process.

Microelectron. J.

Comput. Biol. Med.

Biomed. Signal Process. Control

Comput. Biol. Med.

Comput. Biol. Med.

Expert Syst. Appl.

Med. Eng. Phys.

Digit. Signal Process.

Biomed. Signal Process. Control

Comput. Biol. Med.

Phys. Proc.

Comput. Biol. Med.

Expert Syst. Appl.

Comput. Biol. Med.

Mech. Syst. Signal Process.

Feature extraction for systolic heart murmur classification

Ann. Biomed. Eng.

The electronic stethoscope

BioMed. Eng. Online

The embedded digital stethoscope uses the adaptive noise cancellation filter and the type i chebyshev iir bandpass filter to reduce the noise of the heart sound

Robust heart sound activity detection in noisy environments

Electron. Lett.