Heart sound classification based on scaled spectrogram and partial least squares regression

doi:10.1016/j.bspc.2016.10.004

Biomedical Signal Processing and Control

Volume 32, February 2017, Pages 20-28

https://doi.org/10.1016/j.bspc.2016.10.004 Get rights and content

Highlights

•
First, the heart cycles are estimated and aligned by detecting S1.
•
Second, the spectrograms of heart cycles are scaled for comparison.
•
Third, the category information is utilized using partial least squares regression.

Abstract

Phonocardiogram (PCG) signal analysis is an effective and convenient method for the preliminary diagnosis of heart disease. In this study, a scaled spectrogram and partial least squares regression (PLSR) based method was proposed for the classification of PCG signals. Proposed method is mainly comprised of four stages, namely as being heart cycle estimation, spectrogram scaling, dimension reduction and classification. At the heart cycle estimation stage, the short time average magnitude difference of the Shannon energy envelope is applied. Then the spectrogram of the obtained heart cycle is calculated for feature extraction. However, the sizes of the spectrograms between different PCG signals are usually not the same. In order to overcome the difficulty of direct comparison, the bilinear interpolation is used for the spectrogram to get the scaled spectrogram with a fixed size. Nevertheless, the scaled spectrogram contains a large quantity of redundant and irrelevant information. To extract the most relevant features from the scaled spectrogram, we adopt the PLSR to reduce the dimension of the scaled spectrograms. Since PLSR has the advantage of using the category information during the dimension reduction process, the extracted features are more discriminative. Then the classification results are obtained via support vector machine (SVM). The proposed method is evaluated on two public datasets offered by the PASCAL classifying heart sounds challenge, and the results are compared to those obtained using the best methods in the challenge, thereby proving the effectiveness of our method.

Introduction

Many pathological conditions of the cardiovascular system are reflected in heart sound signals, which makes it possible to diagnose heart disease by analyzing heart sound signals. Heart sound auscultation is a method used to analyze heart sound signals using a stethoscope. Because of its easy implementation, auscultation is widely used in the clinical diagnosis of heart disease [1], [2]. However, the accuracy of auscultation depends on the skill and subjective experience of the physician [3]. Therefore, an objective analysis of heart sound signals is necessary. PCG signal analysis is another method of analyzing heart sound signals using phonocardiograms. The physiological and pathological information has been extracted from the PCG signal using signal processing and artificial intelligence techniques in the literature [3], [4]. With the PCG, the objective analysis of heart sound signals using computer technology is becoming popular. Moreover, telemedicine is becoming available with the development of electronic stethoscopes and smart phones [5]. Overall, the analysis of PCG signals has important significance for the diagnosis of heart disease. Heart sound classification aims at the automatic classification of PCG signals. It is very important for preliminary diagnosis.

Heart sound classification usually involves two steps. The first step is heart sound segmentation, which attempts to detect the location of the fundamental heart sounds (FHs). The FHs include the first (S1) and second (S2) heart sounds, which are the important physical characteristics of heart sounds. The accurate localization of the FHs shows the systolic and diastolic regions of the heart sounds. In addition, the heart cycles are identified by the FHs. Thus, the characteristics of different pathological situations in the region of one heart cycle are used to classify different heart sound categories. Many methods, such as the envelope-based method [6], the method using dynamic clustering [7] and the logistic regression-hsmm based method [8], have been developed for this task. However, heart sound segmentation remains a challenging task, and it is difficult to segment the FHs accurately in a noisy environment.

The second step of heart sound classification is to extract the features in one heart cycle and use the features for classification. Many features have been proposed in the literature. The three main types are time [5], frequency [9] and time-frequency complexity-based features [10], [11]. Although the time-frequency-based features are more computationally complex than features based on only time or frequency, they provide more comprehensive information about the PCG signal. Thus, time-frequency-based features usually outperform other features. The commonly used time-frequency feature extraction methods for PCG signals are wavelets [10], S-transform [12] and short time Fourier transform (STFT) [13]. The magnitude of the STFT yields the spectrogram. This spectrogram is used in this paper since it is easy to implement and convenient to scale.

The primary goal of heart sound classification is to identify different heart sound categories. This is not necessary for segmentation in some situations, especially when the heart cycles are known. So the estimation of heart cycle duration and alignment methods based on the envelope are proposed to obtain the heart cycles instead of locating both S1 and S2. The calculation process is simplified in this way. Although the correct segmentation information can improve the classification performance, it requires a lot of computing. More importantly, the segmentation results are not correct in many cases which greatly affect the accuracy of the classification.

The spectrogram is extracted for each heart cycle after the heart cycles are estimated. However, the sizes of the spectrograms are different since the heart rates of different PCG signals are usually not the same. This prohibits a direct comparison between the spectrograms of different PCG signals. A bilinear interpolation [14] method is used to scale the size of the spectrogram, thus enabling the direct comparison. Nevertheless, the scaled spectrogram contains a large quantity of redundant and irrelevant information. In order to extract the most relevant information, a dimension reduction process of the scaled spectrogram is adopted. In addition, the heart sound category provides valuable information to distinguish between different categories and it helps to improve the classification performance. As a result, the extracted features will be more discriminative if the category information is fully utilized during the dimension reduction process. PLSR [15] maximizes the correlation between the PCG signals and their corresponding category information during the dimension reduction process. Thus the category information is utilized. Also, PLSR is capable to robustly handle more descriptor variables than the number of samples. These are the advantage of PLSR compared with other dimensionality reduction method, such as principle component analysis (PCA) [16], linear discriminant analysis (LDA) [17]. Therefore, the discriminative features of the scaled spectrogram are extracted using PLSR in this paper. Finally, the classification is performed using the SVM classifier [11].

The main framework of this paper is shown in Fig. 1 and consists of four steps: estimation of heart cycle duration and alignment, spectrogram scaling of each heart cycle, PLSR and classification. PLSR consists of two parts, i.e., dimension reduction and regression. The contributions of this paper are threefold. First, the heart cycles are estimated and aligned instead of locating both S1 and S2 to simplify the calculation process. Second, the spectrograms of heart cycles of different lengths can be compared directly using the bilinear scaling process which has not been applied in heart sound researches to our knowledge. Third, the category information is utilized during the dimension reduction process. In this way, the extracted features are best correlated with their categories in the dimension reduction process which makes the features more discriminative.

Section snippets

Data collection

The datasets used in this paper, including Dataset-A and Dataset-B [18], are collected from the classifying heart sounds Pascal challenge competition. Dataset-A is collected by volunteers using iStethoscope which is an iPhone application that enables an iPhone to use its microphone as a digital stethoscope [19]. Dataset-A includes 176 records with a 44,100 Hz sampling frequency and it can be grouped into four categories: Normal, Murmur, Extra Heart Sound and Artifact. A normal heart sound has a

Results

The experimental results on the two datasets are compared with the three best methods in the challenge competition: J48 [20], MLP [20] and CS UCL [25]. All the methods are evaluated on the same datasets for the same evaluation criteria.

In the methods of J48 and MLP, only the temporal features are used. The difference is that J48 uses the decision trees as the classifier and MLP uses multi layer perceptron for classification [20]. In the CS UCL method, the wavelet decomposition and spectrogram

Discussion

The objective of this paper is to classify different heart sound signals automatically. Thus, the classification results provide a preliminary diagnosis, which helps to determine whether further diagnosis is necessary. The main categories of heart sounds in this paper are normal, murmur and some problematic heart beats. Their physiological and pathological information is contained in the heart cycles. It is reasonable to use the heart cycles information for classification instead of locating

Conclusion

This paper proposed a novel method for classification based on scaled spectrograms and PLSR. This method can efficiently detect whether a PCG signal is problematic. Thus, it provides valuable information for deciding whether further treatment is necessary. Instead of characterizing the feature of a heart cycle obtained via explicit segmentation, the feature is extracted based on the estimated heart cycle to simplify the computation process. The spectrogram extracted from the heart cycle is

Acknowledgements

This research is partly supported by the National Natural Science Foundation of China under grant Nos. 91120303 and 61471145.

References (26)

I.R. Hanna et al.
A history of cardiac auscultation and some of its contributors
Am. J. Cardiol.
(2002)
Z. Jiang et al.
A cardiac sound characteristic waveform method for in-home heart disorder monitoring with electric stethoscope
Expert Syst. Appl.
(2006)
S.-W. Deng et al.
Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps
Future Gener. Comput. Syst.
(2016)
H. Tang et al.
Segmentation of heart sounds based on dynamic clustering
Biomed. Signal Process. Control
(2012)
F. Safara et al.
Multi-level basis selection of wavelet packet decomposition tree for heart sound classification
Comput. Biol. Med.
(2013)
S. Ari et al.
Detection of cardiac abnormality from PCG signal using LMS based least square SVM classifier
Expert Syst. Appl.
(2010)
I. Maglogiannis et al.
Support vectors machine-based identification of heart valve diseases using heart sounds
Comput. Methods Program Biomed.
(2009)
A. Moukadem et al.
A robust heart sounds segmentation module based on s-transform
Biomed. Signal Process. Control
(2013)
Y. Soeta et al.
Detection of features of prosthetic cardiac valve sound by spectrogram analysis
Appl. Acoust.
(2015)
P. Geladi et al.
Partial least-squares regression: a tutorial
Anal. Chim. Acta
(1986)

K. Senior

Smart phones: new clinical tools in oncology?

Lancet Oncol.

(2011)

R.M. Rangayyan et al.

Phonocardiogram signal analysis: a review.

Crit. Rev. Biomed. Eng.

(1986)

J. Herzig et al.

Monitoring cardiac stress using features extracted from s1 heart sounds

IEEE Trans. Biomed. Eng.

(2015)

Cited by (104)

Hilbert domain characterizations of wavelet packets for automated heart sound abnormality detection
2024, Biomedical Signal Processing and Control
Heart valve disease (HVD) is a common disease that affects millions of people worldwide. Early detection and treatment are essential for improving the prognosis of patients with HVD. Phonocardiogram (PCG) signals are a non-invasive and inexpensive way to assess the mechanical activity of the heart. In this study, a novel method for HVD detection using Hilbert domain mapping of wavelet packet of PCG signals is proposed. Two standard PCG databases are used to evaluate the proposed method. Packet instantaneous frequency deviation (PIFD) and packet instantaneous energy deviation (PIED) features are extracted from the PCG signals and used for classification. A support vector machine (SVM) and K-nearest neighbour (KNN) based error-correcting output code (ECOC) approach is used to handle multiclass classification and minimize classification error. The proposed method achieves an unweighted average recall (UAR) of 99.8% on database 1 and 99.32% on database 2, which outperforms other baseline methods. The results suggest that the proposed method is a promising approach for HVD detection using PCG signals.
Heart sound classification based on bispectrum features and Vision Transformer mode
2023, Alexandria Engineering Journal
In regions with limited resources and moderate incomes, the relentless spectre of cardiovascular diseases (CVDs) continues to loom large. Amidst this challenge, the precise classification of heart sounds is emerging as a pivotal linchpin in the realm of early CVD diagnosis and intervention. Manual heart sound auscultation efficacy remains tethered to the expertise of physicians, but the tides are shifting. With deep learning algorithms, heart sound classification reaches new heights. In this paradigm-shifting article, we unveil an ingenious model fortified by bispectrum-inspired feature extraction and the cutting-edge prowess of the Vision Transformer (ViT) model. This model spearheads the binary classification of heart sounds, labelling them as either 'normal' or 'abnormal.' Our model uses data from the PhysioNet Challenge 2022 database, which contains 3163 data points from 942 patients. The model showcases an adept classification process with a remarkable consistency, notably holding its own when distinguishing between heart sounds of pregnant and nonpregnant patients. Moreover, we dare to challenge the status quo. This article boldly pits the performance of our model against that of seasoned cardiologists. Our model emerges as the triumphant frontrunner, eclipsing the proficiency of even the most seasoned cardiologists. In a world where health resources are unevenly distributed, this pioneering work offers a beacon of hope, unlocking a future where advanced algorithms not only match but surpass human expertise. Step into the forefront of transformative cardiac care—where artificial intelligence becomes the key to unravelling the secrets of heart sounds.
Detection of valvular heart diseases combining orthogonal non-negative matrix factorization and convolutional neural networks in PCG signals
2023, Journal of Biomedical Informatics
Valvular heart disease (VHD) is associated with elevated mortality rates. Although transthoracic echocardiography (TTE) is the gold standard detection tool, phonocardiography (PCG) could be an alternative as it is a cost-effective and noninvasive method for cardiac auscultation. Many researchers have dedicated their efforts to improving the decision-making process and developing robust and precise approaches to assist physicians in providing reliable diagnoses of VHD.
This research proposes a novel approach for the detection of anomalous valvular heart sounds from PCG signals. The proposed approach combines orthogonal non-negative matrix factorization (ONMF) and convolutional neural network (CNN) architectures in a three-stage cascade. The aim of the proposal is to improve the learning process by identifying the optimal ONMF temporal or spectral patterns for accurate detection. In the first stage, the time–frequency representation of the input PCG signal is computed. Next, band-pass filtering is performed to locate the spectral range that is most relevant for the presence of such cardiac abnormalities. In the second stage, the temporal and spectral cardiac structures are extracted using the ONMF approach. These structures are utilized in the third stage and fed into the CNN architecture to detect abnormal heart sounds.
Several state-of-the-art CNN architectures, such as LeNet5, AlexNet, ResNet50, VGG16 and GoogLeNet, have been evaluated to determine the effectiveness of using ONMF temporal features for VHD detection. The results reveal that the integration of ONMF temporal features with a CNN classifier significantly improve VHD detection. Specifically, the proposed approach achieves an accuracy improvement of approximately 45% when ONMF spectral features are used and 35% when time–frequency features from the short-time Fourier transform (STFT) spectrogram are used. Additionally, feeding ONMF temporal features into low-complexity CNN architectures yields competitive results comparable to those obtained with complex architectures.
The temporal structure factorized by ONMF plays a critical role in distinguishing between normal heart sounds and abnormal heart sounds since the repeatability of normal heart cycles is disrupted by the presence of cardiac abnormalities. Consequently, the results highlight the importance of appropriate input data representation in the learning process of CNN models in the biomedical field of valvular heart sound detection.
Heart sounds classification: Application of a new CyTex inspired method and deep convolutional neural network with transfer learning
2023, Smart Health
Analysis of heart sounds is an effective means for the early diagnosis of cardiac pathologies. Heart sound classification is a challenging multi-disciplinary field that attracts many machine learning researchers. This paper proposes a new CyTex-inspired transform to convert heart sound signals to textured images. In our proposed method, the neighboring pixels have meaningful relationships that result in semi-periodic patterns in the output image. This method has two significant benefits. Firstly, this makes it possible to apply deep convolutional neural networks (DCNN) to heart sound classification. Consequently, correlative moving masks of the convolutional layers can extract short-term and long-term information from these images in vertical and horizontal directions. Secondly, by converting the heart sound signal to images as a compatible input for DCNNs, we can employ a transfer learning scheme to reduce the risk of overfitting. The performance of four popular pre-trained DCNNs – AlexNet, VGG16, InceptionV3, and ResNet50 – has been tested. Furthermore, data augmentation, hyper-parameter optimization, and drop-out techniques are employed. The performance of the proposed system is verified on the heart sounds dataset PhysioNet. Results were obtained using cross-validation techniques. Our experiments demonstrate the potential of our proposed method for achieving excellent performance compared to previous methods on the same dataset with a score of over 0.9200 at a sensitivity of 0.8775 and specificity of 0.9637 using the ResNet with data augmentation, hyperparameter optimization, and dropout techniques.
Combined empirical mode decomposition and phase space reconstruction based psychologically stressed and non-stressed state classification from cardiac sound signals
2023, Biomedical Signal Processing and Control
The psychological stress and associated mental health conditions causes high socioeconomic impacts on society and the onset of pandemic worsened the situation making it imperative to timely detect psychological stress. This paper presents a novel framework for using a combination of empirical mode decomposition (EMD) and phase space reconstruction (PSR) analysis to capture non-linear, non-stationary dynamics of cardiac sound signals acquired from fifty-four healthy male adults for detecting psychological stress. The time interval among successive S1 peaks of acquired cardiac sound signals is extracted to obtain Interbeat Interval signal used for decomposition to Intrinsic Mode Functions (IMFs) using EMD technique. Thereafter, the feature vectors namely- largest singular value, smallest singular value from two-dimensional PSRs of IMFs and the mean value of Euclidean distance, standard deviation of Euclidean distance from three-dimensional PSRs are extracted to detect psychologically stressed state. The non-parametric Kruskal-Wallis statistical test is applied to select statistically significant features that are fed to Decision Tree, Naïve Bayes and Support Vector Machine classifiers for classifying stressed and non-stressed state with fivefold cross-validation to make it a reliable system. The average accuracy, sensitivity and specificity achieved is 97.14%, 99.8% and 94% respectively using SVM and Radial Basis Function kernel function. The proposed framework performed better on the dataset in comparison to ratio of low-frequency to high-frequency (LF/HF) power parameter of Electrocardiography signal. The use of easy to acquire, cost-effective cardiac sound signals for detecting psychological stress makes proposed framework feasible for rural healthcare centres of developing economies, homecare and telemedicine.
Research of heart sound classification using two-dimensional features
2023, Biomedical Signal Processing and Control
Heart sound plays a vital role to achieve an accurate diagnosis of cardiovascular diseases, and its auxiliary diagnosis methods have become a hotspot. Aim: In this paper, novel classification algorithms that transfer heart sound classification into image classification are proposed to select better features. The features used were all important in clinical diagnosis. Method: First, four open datasets are used to construct an integrated dataset. Second, the data is preprocessed. Third, two-dimensional features are extracted. In the end, different methods like traditional machine learning, deep learning, and transfer learning are applied to classify heart sounds. Results: The results show that logmel and logpower can achieve a better effect than envelope and waveform, and the average accuracy is improved by 6–10%, which can achieve around 94%. F1 score shows a trend consistent with accuracy. This is verified by both machine learning and deep learning methods. Under the experimental conditions in this paper, transfer learning can promote the effect of Xception and MobileNet, the accuracy can improve by about 2% on time-domain features. The results of transfer learning are comparatively more stable, and more results are within the 95% confidence interval. Conclusion: This paper uses different methods to systematically compare the effects of different two-dimensional features in heart sound classification, and explains why different features achieve different effects from different perspectives such as clinical, and provides new insights like the application of feature fusion in it.

View all citing articles on Scopus

View full text

Heart sound classification based on scaled spectrogram and partial least squares regression

Highlights

Abstract

Introduction

Section snippets

Data collection

Results

Discussion

Conclusion

Acknowledgements

Am. J. Cardiol.

Expert Syst. Appl.

Future Gener. Comput. Syst.

Biomed. Signal Process. Control

Comput. Biol. Med.