Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps

doi:10.1016/j.future.2016.01.010

Future Generation Computer Systems

Volume 60, July 2016, Pages 13-21

https://doi.org/10.1016/j.future.2016.01.010 Get rights and content

Highlights

•
A novel framework for heart sound classification without segmentation.
•
Extracting the autocorrelation features of the normalized average Shannon energy envelopes at different wavelet sub-bands.
•
Fusing the autocorrelation features into the uniform features by using diffusion maps and classifying them with the SVM classifier.
•
Evaluating the proposed method on two public datasets published in the PASCAL Classifying Heart Sounds Challenge.

Abstract

Heart sound classification, used for the automatic heart sound auscultation and cardiac monitoring, plays an important role in primary health center and home care. However, one of the most difficult problems for the task of heart sound classification is the heart sound segmentation, especially for classifying a wide range of heart sounds accompanied with murmurs and other artificial noise in the real world. In this study, we present a novel framework for heart sound classification without segmentation based on the autocorrelation feature and diffusion maps, which can provide a primary diagnosis in the primary health center and home care. In the proposed framework, the autocorrelation features are first extracted from the sub-band envelopes calculated from the sub-band coefficients of the heart signal with the discrete wavelet decomposition (DWT). Then, the autocorrelation features are fused to obtain the unified feature representation with diffusion maps. Finally, the unified feature is input into the Support Vector Machines (SVM) classifier to perform the task of heart sound classification. Moreover, the proposed framework is evaluated on two public datasets published in the PASCAL Classifying Heart Sounds Challenge. The experimental results show outstanding performance of the proposed method, compared with the baselines.

Introduction

Heart sound auscultation has been a critical part of the clinical examination since the invention of the stethoscope in 1816 by Lannec [1]. The traditional heart auscultation, however, is over-dependent on the ear sensitivity and the subjective experience (auscultation skills) of the physician [2]. Nowadays, heart sound classification, used for the automatic heart sound auscultation [3] and cardiac monitoring [4], becomes a promising research field based on the methods and techniques of modern signal processing and artificial intelligence [5]. With the development and popularization of the electronic stethoscope and the smart phone (e.g., IPhone), heart sound classification plays an important role in primary health centers and home care.

The procedure of the heart sound classification usually consists of three steps: heart sound segmentation, feature extraction, and classification. The heart sound segmentation aims at segmenting the heart sound signal into a series of cardiac cycles. From each of cardiac cycles, the feature is extracted, which captures the information about the mechanical activity of the heart in one cardiac period. The extracted feature is input into the classifier, such as Artificial Neural Networks (ANN), Support Vector Machines (SVM) and Hidden Markov Models (HMMs), to identify the abnormal heart sound which usually relates to some heart condition. In some methods [4], [6], [7], the heart sound segmentation was performed with the electrocardiogram (ECG) as a reference, where the ECG was recorded in parallel. By segmenting the heart sound signal into cardiac cycles according to the ECG, Ahlstrom [6] focused on the murmur classification (distinguishing the pathological murmurs from the physiological murmurs) based on the recurrence quantification analysis (RQA) feature and ANN classifier. Jabbari [7] also relied on the ECG to segment and classify the heart sounds with the feature, extracted by using matching pursuit, based on three-layer feed-forward multilayer perception (MLP) network. However, these methods with the ECG-based segmentation require to simultaneously record and synchronously process the heart sound and the ECG signal, which is very inconvenient, especially in the case of infants or newborn children [8].

Recently, more classification methods without using the ECG signal are proposed. Among these methods, the segmentation with envelope analysis is the most popular one and widely used for extracting feature and classifying the heart sound. The envelope-based segmentation is performed by three steps: (1) extracting the envelope of the heart signal; (2) detecting the peaks of the fundamental heart sounds (FHS), S1 or S2; and (3) identifying the cardiac cycles with the peak conditioning. The envelope extraction algorithms used in the envelope-based classification methods are the normalized average Shannon energy [9], Hilbert transform [10], homomorphic filtering [11], cardiac sound characteristic waveform extraction [2], Hilbert–Huang transform [12], short-time modified Hilbert transform [13], etc. Moreover, to improve the robustness of detecting the FHS peaks, the original heart sound signal is usually represented in the transform domain by using some signal analysis approaches, such as short-time Fourier transform, discrete wavelet transform [14], tunable-Q wavelet transform [15], optimum multi-scale wavelet packet decomposition (OMS-WPD) [16], S-transform [17]. However, due to the unreliability of the peak conditioning for detecting and identifying the FHS peaks, the envelope-based methods mainly suffer from the two drawbacks [18]. The first one is that the true FHS peaks are missed and the extra false peaks are detected, due to the affection of the murmur or background noise. The second is that the common assumption used in the peak conditioning, the systole period is shorter than diastole period, is not always true, especially, in the case of infants, newborn children, or some cardiac patients. Besides these envelope-based methods, other approaches based on the statistical models are also used for the heart sound segmentation in a supervised or unsupervised way, such as HMMs [19], duration dependent HMMs [20], Ensemble Empirical Mode [21], K-means [22], dynamic clustering [23]. The nature of the model-based methods is to characterize or summarize the properties of the FHS by using their models, based on some discriminative information about the FHS, such as the distribution of time–frequency energy, the period (of systole, diastole, or cardiac cycle), and the temporal correlation (Markov property). Unfortunately, the properties of the FHS vary greatly from infants to old people and from healthy people to cardiac patients. It is difficult for the model-based methods to model all the FHS in a unified model, especially, accompanied with some artificial sounds in the real world.

In fact, the primary task of heart sound classification can be performed without heart sound segmentation, as done in [18]. The goal of the primary task is only to detect the presence of a disorder in the heart sound rather than to further identify it, which is helpful to provide a primary diagnosis in the primary health center and home care. Certainly, the results of the primary diagnosis can also be used to later implement the automatic diagnosis. Based on the cardiac period estimated from the heart sound, Yuenyong [18] extracted an equal number of cardiac cycles and classified them into two categories (normal and abnormal) using a neural network classifier. However, the accurate estimation of the cardiac period is also difficult, and the classification problem of heart sounds mixed with artificial sounds is not considered in their method.

In this study, a novel framework is proposed for the primary task of heart sound classification based on the diffusion maps [24], [25] and SVM classifier. The overall framework is shown in Fig. 1. Firstly, the pre-processed heart sound signal is decomposed into the approximation and detail coefficients by using the discrete wavelet decomposition (DWT). Then, with these coefficients, the normalized average Shannon energy envelopes and their autocorrelation functions are calculated, respectively. Thirdly, the sub-band autocorrelation functions, which are associated with the approximation and detail DWT coefficients, are fused by the diffusion maps to obtain the unified feature representation of the heart sound signal. Finally, the feature is input into the SVM for classifying the heart sound signal. In addition, the experiments are performed on the public datasets published in the PASCAL Classifying Heart Sounds Challenge [26], and the proposed method is compared with the best threealgorithms [26], UCI, J48, and MLP, presented in the Challenge competition.

The main contribution of the paper is twofold. (i) Contrasting to existing approaches, the proposed framework is the first one to perform the heart sound classification without using any location information in the heart sound signal, such as segmentation. Although the segmentation is not required in [18], its implementation relies on the accurate estimation of the cardiac period which provides a reference to select the interval from the heart sound signal. However, estimation of the cardiac period is unnecessary in our method. (ii) We proposed a novel approach to fuse the autocorrelation features in different frequency bands based on diffusion maps. Our experiments show that the proposed framework based on the feature fusion is robust for classifying the heart sounds with artifact sound, strong murmurs, and noise.

Section snippets

Pre-processing and envelope extraction

The heart sound signal $x (i)$ is first decimated to 2 kHz sampling frequency, and then is filtered with a band-pass, zero-phase, Butterworth filter order 6 (25–900 Hz) to eliminate out of the band noise. Next, the resulting signal $\hat{x} (i)$ is normalized by $\bar{x} (i) = \frac{\hat{x} (i)}{max (| \hat{x} (i) |)} .$

The decimated and normalized heart sound signal is decomposed into four levels by using the Order Six Daubechies (db6) wavelet, due to its morphological similarities to heart sound components [27]. The approximation

Datasets

The proposed framework is applied on two public heart sound datasets published in the Classifying Heart Sounds Pascal Challenge competition [26]. The first one, named Dataset-A, is collected from the volunteers of the iPhone users and recorded with the iStethoscope (an iPhone application software) in the real world situations. No information is available on the auscultated subjects, such as gender, age, and condition [28]. The Dataset-A contains 176 records in WAV format with 44 100 Hz

Conclusion

This study proposed a novel framework for classifying heart sounds without segmenting them into cardiac cycles. The proposed framework was important and effective to provide the primary diagnosis of the automatic heart sound auscultation in the real world, before further identifying the special murmurs. Instead of characterizing the feature of cardiac cycles obtained by the segmentation, the sub-band autocorrelation features could capture the whole information of the heart sound signal based on

Acknowledgments

This work was supported in part by the Major Research plan of the National Natural Science Foundation of China (No. 91120303), National Natural Science Foundation of China (No. 91220301), Natural Science Foundation of Heilongjiang Province of China (No. F2015012), Academic Core Funding of Young Projects of Harbin Normal University of China (No. KGB201225), and Open Fund by Smart Education and Information Engineering (Harbin Normal University) (No. EIE2013-01).

Shi-Wen Deng received the B.E. degree from the Institute of Technology, Jia Mu Si University, JiaMuSi, China, in 1997, the M.E. from The School of Computer Science, Harbin Normal University, Harbin, China, in 2005, and the Ph.D. degree from The School of Computer Science, Harbin Institute of Technology in 2012. Currently, he is with the School of Mathematical Sciences, Harbin Normal University, Harbin, China. His research interests are in the area of speech and audio signal processing,

References (28)

I. Hanna et al.
A history of cardiac auscultation and some of its contributors
Am. J. Cardiol.
(2002)
Z. Jiang et al.
A cardiac sound characteristic waveform method for in-home heart disorder monitoring with electric stethoscope
Expert Syst. Appl.
(2006)
Y. Watanobe et al.
Hybrid intelligence aspects of programming in *AIDA algorithmic pictures
Future Gener. Comput. Syst.
(2014)
S. Jabbari et al.
Modeling of heart systolic murmurs based on multivariate matching pursuit for diagnosis of valvular disorders
Comput. Biol. Med.
(2011)
A.A. Sepehri et al.
A novel method for pediatric heart sound segmentation without using the ECG
Comput. Methods Programs Biomed.
(2010)
I. Maglogiannis et al.
Support Vectors Machine-based identification of heart valve diseases using heart sounds
Comput. Methods Programs Biomed.
(2009)
S. Sun
An innovative intelligent system based on automatic diagnostic feature extraction for diagnosing heart diseases
Knowl.-Based Syst.
(2015)
S. Sun et al.
Automatic moment segmentation and peak detection analysis of heart sound pattern via short-time modified Hilbert transform
Comput. Methods Programs Biomed.
(2014)
S. Patidar et al.
Segmentation of cardiac sound signals by removing murmurs using constrained tunable-Q wavelet transform
Biomed. Signal Process. Control
(2013)
Y. Wang et al.
Identification of the normal and abnormal heart sounds using wavelet-time entropy features based on OMS-WPD
Future Gener. Comput. Syst.
(2014)

A. Moukadem et al.

A robust heart sounds segmentation module based on S-transform

Biomed. Signal Process. Control

(2013)

C.N. Gupta et al.

Neural network classification of homomorphic segmented heart sounds

Appl. Soft Comput.

(2007)

H. Tang et al.

Segmentation of heart sounds based on dynamic clustering

Biomed. Signal Process. Control

(2012)

A. Hamdy, H. Hefny, M.A. Salama, A.E. Hassanien, T.-H. Kim, The importance of handling multivariate attributes in the...

Cited by (120)

Research of heart sound classification using two-dimensional features
2023, Biomedical Signal Processing and Control
Heart sound plays a vital role to achieve an accurate diagnosis of cardiovascular diseases, and its auxiliary diagnosis methods have become a hotspot. Aim: In this paper, novel classification algorithms that transfer heart sound classification into image classification are proposed to select better features. The features used were all important in clinical diagnosis. Method: First, four open datasets are used to construct an integrated dataset. Second, the data is preprocessed. Third, two-dimensional features are extracted. In the end, different methods like traditional machine learning, deep learning, and transfer learning are applied to classify heart sounds. Results: The results show that logmel and logpower can achieve a better effect than envelope and waveform, and the average accuracy is improved by 6–10%, which can achieve around 94%. F1 score shows a trend consistent with accuracy. This is verified by both machine learning and deep learning methods. Under the experimental conditions in this paper, transfer learning can promote the effect of Xception and MobileNet, the accuracy can improve by about 2% on time-domain features. The results of transfer learning are comparatively more stable, and more results are within the 95% confidence interval. Conclusion: This paper uses different methods to systematically compare the effects of different two-dimensional features in heart sound classification, and explains why different features achieve different effects from different perspectives such as clinical, and provides new insights like the application of feature fusion in it.
Multi-classification neural network model for detection of abnormal heartbeat audio signals
2022, Biomedical Engineering Advances
Nowadays, heart disease is the leading cause of death. The high mortality rate and escalating occurrence of heart diseases worldwide warrant the requirement for a fast and efficient diagnosis of such ailments. The purpose is to design an automated system for the classification of abnormal heartbeat audio signals to assist cardiologists. To the best of our knowledge, this is the first study that uses a single neural network model for the classification of eight different types of heartbeat audio signals. The proposed recurrent neural network (RNN) model using Long short-term memory (LSTM) is developed on two publically available databases such as the PASCAL challenge and the 2017 PhysioNet challenge. Mel frequency cepstrum coefficient (MFCC) is applied to extract the dominant features, and a bandpass filter is used to remove the noise from both of the datasets. Afterward, the downsampling technique is used to fix the size of the sampling rate of each sound signal to 20KHz and 300 Hz for the Pascal and PhysioNet database, respectively. The proposed model is compared with multi-layer perceptron (MLP) in terms of different performance evaluation matrices. Furthermore, the outcomes of five machine learning (ML) models are also analyzed. The proposed model has achieved the highest classification accuracy of 0.9971 on the Pascal database, and 0.9870 accuracy on the PhysioNet challenge dataset, which is consistently superior to its competitor approaches. The proposed model provides significant assistance to the cardiac consultant in detecting heart valve diseases.
Deep attention-based neural networks for explainable heart sound classification
2022, Machine Learning with Applications
Cardiovascular diseases are the leading cause of death and severely threaten human health in daily life. There have been dramatically increasing demands from both the clinical practice and the smart home application for monitoring the heart status of individuals suffering from chronic cardiovascular diseases. However, experienced physicians who can perform efficient auscultation are still lacking in terms of number. Automatic heart sound classification leveraging the power of advanced signal processing and deep learning technologies has shown encouraging results. Nevertheless, a lack of explanation for deep neural networks is a limitation for the applications of automatic heart sound classification. To this end, we propose explaining deep neural networks for heart sound classification with an attention mechanism. We evaluate the proposed approach on the heart sounds shenzhen corpus. Our approach achieves an unweighted average recall of 51.2% for classifying three categories of heart sounds, i. e., normal, mild, and moderate/severe. The experimental results also demonstrate that the global attention pooling layer improves the performance of the learnt representations by estimating the contribution of each unit in high-level features. We further analyse the deep neural networks by visualising the attention tensors.
Artificial intelligence for heart sound classification: A review
2024, Expert Systems
A Cardiac Audio Classification Method Based on Multidimensional Feature Expression
2024, Research Square
A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture
2024, SN Computer Science

View all citing articles on Scopus

Ji-Qing Han received the B.S., M.S. in electrical engineering, and Ph.D. degrees in computer science from the Harbin Institute of Technology, Harbin, China, in 1987, 1990, and 1998, respectively. Currently, he is the associate dean of the School of Computer Science and Technology, Harbin Institute of Technology. He is a member of IEEE, member of the editorial board of Journal of Chinese Information Processing, and member of the editorial board of the Journal of Data Acquisition & Processing. Prof. Han is undertaking several projects from the National Natural Science Foundation, 863Hi-tech Program, National Basic Research Program. He has won three Second Prize and two Third Prize awards of Science and Technology of Ministry/Province. He has published more than 100 papers and 2 books. His research fields include speech signal processing and audio information processing.

View full text

Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps

Highlights

Abstract

Introduction

Section snippets

Pre-processing and envelope extraction

Datasets

Conclusion

Acknowledgments

Am. J. Cardiol.

Expert Syst. Appl.

Future Gener. Comput. Syst.

Comput. Biol. Med.

Comput. Methods Programs Biomed.

Comput. Methods Programs Biomed.

Knowl.-Based Syst.

Comput. Methods Programs Biomed.

Biomed. Signal Process. Control

Future Gener. Comput. Syst.

Biomed. Signal Process. Control

Appl. Soft Comput.

Biomed. Signal Process. Control