A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture

Das, Sunanda; Ahsan, Sk. Md. Masudul; Rahman, Mahfujur; Karim, Md. Shahriar

doi:10.1007/s42979-023-02580-9

A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture

Original Research
Published: 05 February 2024

Volume 5, article number 251, (2024)
Cite this article

SN Computer Science Aims and scope Submit manuscript

Sunanda Das ORCID: orcid.org/0000-0002-7164-6859¹,
Sk. Md. Masudul Ahsan¹,
Mahfujur Rahman¹ &
…
Md. Shahriar Karim¹

101 Accesses
Explore all metrics

Abstract

Cardiovascular diseases (CVDs) are a group of diseases that affect the heart or blood vessels and are the leading cause of mortality around the world. The main focus of this work is to classify heart sounds accurately before the condition of the heart worsens. Over the past few decades, audio-based classifications of medical data have received high consideration among various researchers. However, there are mainly two drawbacks while working with audio data. (1) variable length of the audio data and, (2) an inadequate number of class quintessential audio samples. In this work, reflection operation and the sliding window approach are employed to generate fixed-sized audio data. The proposed method applied Discrete Wavelet Transform (DWT) and Mel-Spectrogram for feature extraction. Furthermore, different augmentation techniques such as time-stretching and pitch-shifting are utilized in the method so that the proposed deep learning-based CNN architecture can be trained on a large amount of data. The proposed method is verified using two datasets provided by the ‘PASCAL Heart Sounds Challenge’, each of which contains a small number of heart sound samples of various lengths. In comparison, the experimental outcomes exhibit that the proposed approach outperforms many state-of-the-art methods with respect to sensitivity, specificity, precision, Youden index, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 7

Fig. 10

Classification of heart sounds based on the combination of the modified frequency wavelet transform and convolutional neural network

Article 07 July 2020

Learning Optimal Time-Frequency Representations for Heart Sound: A Comparative Study

Automatic Heart Sound Classification Using One Dimension Deep Neural Network

Data availability

This study exclusively utilized datasets that are publicly available. Full references to these datasets are provided within the article.

References

Cardiovascular diseases (CVDs). https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) Accessed 7 Apr 2020.
Greenfield DM, Snowden JA (2019) Cardiovascular diseases and metabolic syndrome. In: The EBMT handbook, pp. 415–420.
Rangayyan RM, Lehner RJ. Phonocardiogram signal analysis: a review. Crit Rev Biomed Eng. 1988;15:211–36.
Google Scholar
Daubechies I. The wavelet transform, time-frequency localization and signal analysis. Princeton: Princeton University Press; 2009.
Book Google Scholar
Babaei S, Geranmayeh A. Heart sound reproduction based on neural network classification of cardiac valve disorders using wavelet transforms of PCG signals. Comput Biol Med. 2009;39(1):8–15.
Article PubMed Google Scholar
Djebbari A, Reguig FB. Short-time Fourier transform analysis of the phonocardiogram signal. In: ICECS 2000. 7th IEEE international conference on electronics, circuits and systems (Cat. No.00EX445), vol. 2. IEEE; 2000. pp. 844–847.
Bajric R, Zuber N, Skrimpas GA, Mijatovic N. Feature extraction using discrete wavelet transform for gear fault diagnosis of wind turbine gearbox. Shock Vib. 2016. https://doi.org/10.1155/2016/6748469.
Article Google Scholar
Haghighi-Mood A, Torry J. A sub-band energy tracking algorithm for heart sound segmentation. Comput Cardiol. 1995. https://doi.org/10.1109/CIC.1995.482711.
Article Google Scholar
Liang H, Lukkarinen S, Hartimo I. Heart sound segmentation algorithm based on heart sound envelogram. Comput Cardiol. 1997. https://doi.org/10.1109/CIC.1997.647841.
Article Google Scholar
Gupta CN, Palaniappan R, Swaminathan S, Krishnan SM. Neural network classification of homomorphic segmented heart sounds. Appl Soft Comput. 2007;7(1):286–97.
Article Google Scholar
Huiying L, Sakari L, Iiro H. A heart sound segmentation algorithm using wavelet decomposition and reconstruction. In: Proceedings of the 19th annual international conference of the IEEE engineering in medicine and biology society.’ Magnificent milestones and emerging opportunities in medical engineering’ (Cat. No. 97CH36136), vol. 4. IEEE; 1997. pp. 1630–1633.
Wang P, Kim Y, Ling L, Soh C. First heart sound detection for phonocardiogram segmentation. In: 2005 IEEE engineering in medicine and biology 27th annual conference. IEEE; 2006. pp. 5519–5522.
Hebden JE, Torry J. Neural network and conventional classifiers to distinguish between first and second heart sounds. In: IET Conference Proceedings; 1996.
Gomes EF, Bentley PJ, Pereira E, Coimbra MT, Deng Y (2013) Classifying heart sounds-approaches to the pascal challenge. In: HEALTHINF, pp. 337–340.
Deng SW, Han JQ. Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps. Future Gener Comput Syst. 2016;60:13–21.
Article Google Scholar
Safara F, Doraisamy S, Azman A, Jantan A, Ramaiah ARA. Multi-level basis selection of wavelet packet decomposition tree for heart sound classification. Comput Biol Med. 2013;43(10):1407–14.
Article PubMed Google Scholar
Ari S, Hembram K, Saha G. Detection of cardiac abnormality from PCG signal using LMS based least square SVM classifier. Expert Syst Appl. 2010;37(12):8019–26.
Article Google Scholar
Zhang W, Han J, Deng S. Heart sound classification based on scaled spectrogram and partial least squares regression. Biomed Signal Process Control. 2017;32:20–8.
Article Google Scholar
Zhang W, Han J, Deng S. Heart sound classification based on scaled spectrogram and tensor decomposition. Expert Syst Appl. 2017;84:220–31.
Article Google Scholar
Leung T, White P, Collis W, Brown E, Salmon A. Classification of heart sounds using time-frequency method and artificial neural networks. In: Proceedings of the 22nd annual international conference of the IEEE engineering in medicine and biology society (Cat. No.00CH37143), vol. 2. IEEE; 2000. pp. 988–991.
Bhatikar SR, DeGroff C, Mahajan RL. A classifier based on the artificial neural network approach for cardiologic auscultation in pediatrics. Artif Intell Med. 2005;33(3):251–60.
Article PubMed Google Scholar
Zhang W, Han J. Towards heart sound classification without segmentation using convolutional neural network. In: 2017 computing in cardiology (CinC). IEEE; 2017. pp. 1–4.
Kleć M. Early detection of heart symptoms with convolutional neural network and scattering wavelet transformation. In: International symposium on methodologies for intelligent systems. Springer; 2018. pp. 24–31.
Demir F, Şengür A, Bajaj V, Polat K. Towards the classification of heart sounds based on convolutional deep neural network. Health Inf Sci Syst. 2019;7(1):1–9.
Article Google Scholar
Gaouda A, Salama M. DSP wavelet-based tool for monitoring transformer inrush currents and internal faults. IEEE Trans Power Deliv. 2010;25(3):1258–67.
Article Google Scholar
Gaouda A, El-Saadany E, Salama M, Sood V, Chikhani A. Monitoring HVDC systems using wavelet multi-resolution analysis. IEEE Trans Power Syst. 2001;16(4):662–70.
Article ADS Google Scholar
Garg G, Singh V, Gupta J, Mittal A. Optimal algorithm for ECG denoising using discrete wavelet transforms. In: 2010 IEEE international conference on computational intelligence and computing research. IEEE; 2010. pp. 1–4.
Mahmoodabadi S, Ahmadian A, Abolhasani M. ECG feature extraction using Daubechies wavelets. In: Proceedings of the fifth IASTED international conference on visualization, imaging and image processing. 2005. pp. 343–348.
Murthy HN, Meenakshi M. ECG signal denoising and ischemic event feature extraction using Daubechies wavelets. Int J Comput Appl. 2013;67(2):29–33.
Google Scholar
Saravanan N, Ramachandran K. Fault diagnosis of spur bevel gear box using discrete wavelet features and decision tree classification. Expert Syst Appl. 2009;36(5):9564–73.
Article Google Scholar
Portnoff M. Time-frequency representation of digital signals and systems based on short-time Fourier analysis. IEEE Trans Acoust Speech Signal Process. 1980;28(1):55–69.
Article Google Scholar
Hossan MA, Memon S, Gregory MA (2010) A novel approach for MFCC feature extraction. In: 2010 4th international conference on signal processing and communication systems. IEEE; pp. 1–5.
LeCun Y, Bengio Y, et al. Convolutional networks for images, speech, and time series. In: The handbook of brain theory and neural networks, vol. 3361, no. 10. 1995.
Zhang J, Chen Z. A pitch shifting reverse echo audio effect. In: EE 264 - digital signal processing. Center for Computer Research in Music and Acoustics (CCRMA), Stanford University; 2018.
Huber S. Harmonic audio object processing in frequency domain. In: Master Thesis UPF. Universitat Pompeu Fabra, Barcelona; 2009.
Laroche J, Dolson M. Improved phase vocoder time-scale modification of audio. IEEE Trans Speech Audio Process. 1999;7(3):323–32.
Article Google Scholar
Barry D, Dorran D, Coyle E (2008) Time and pitch scale modification: a real-time framework and tutorial. In: Proceedings of the 11th international conference on digital audio effects (DAFx-08), vol. 9.
Abdoli S, Cardinal P, Koerich AL. End-to-end environmental sound classification using a 1d convolutional neural network. Expert Syst Appl. 2019;136:252–63.
Article Google Scholar
Laguna JO, Olaya AG, Borrajo D (2011) A dynamic sliding window approach for activity recognition. In: International conference on user modeling, adaptation, and personalization. Springer, pp. 219–230.
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014;15(1):1929–58.
MathSciNet Google Scholar
Agarap, AF. Deep learning using rectified linear units (RELU). 2018. arXiv preprint arXiv: 180308375
Bentley P, Nordehn G, Coimbra M, Mannor S. The PASCAL classifying heart sounds challenge 2011 (CHSC2011) results. 2011. http://www.peterjbentley.com/heartchallenge/index.html.
Pereira D, Hedayioglu F, Correia R, Silva T, Dutra I, Almeida F, Mattos S, Coimbra M. Digiscope—unobtrusive collection and annotating of auscultations in real hospital environments. In: 2011 annual international conference of the IEEE engineering in medicine and biology society, IEEE; 2011. pp. 1193–1196.
Audacity^®|Free, open source, cross-platform audio software for multi-track recording and editing. 2020. https://www.audacityteam.org/. Accessed 28 Mar 2020.
Keras: the python deep learning library. 2020. https://keras.io/. Accessed 28 Mar 2020.
Reddi SJ, Kale S, Kumar S. On the convergence of Adam and beyond. 2019. arXiv preprint arXiv: 190409237.
Deng Y, Bentley PJ. A robust heart sound segmentation and classification algorithm using wavelet decomposition and spectrogram. In: Workshop classifying heart sounds, La Palma, Canary Islands. 2012. pp. 1–6.

Download references

Acknowledgements

The authors wish to acknowledge the Department of Computer Science and Engineering (CSE), Khulna University of Engineering & Technology (KUET), Bangladesh for facilitating the work.

Funding

No funding was received for conducting this study.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Khulna University of Engineering & Technology, Khulna, 9203, Bangladesh
Sunanda Das, Sk. Md. Masudul Ahsan, Mahfujur Rahman & Md. Shahriar Karim

Authors

Sunanda Das
View author publications
You can also search for this author in PubMed Google Scholar
Sk. Md. Masudul Ahsan
View author publications
You can also search for this author in PubMed Google Scholar
Mahfujur Rahman
View author publications
You can also search for this author in PubMed Google Scholar
Md. Shahriar Karim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sunanda Das.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Informed consent

This study uses a publicly available ‘Classifying Heart Sounds Challenge’ dataset Sponsored by PASCAL. The authors of this paper have cited the article corresponding to the dataset as per the recommendations of its developers.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Das, S., Ahsan, S.M.M., Rahman, M. et al. A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture. SN COMPUT. SCI. 5, 251 (2024). https://doi.org/10.1007/s42979-023-02580-9

Download citation

Received: 27 March 2022
Accepted: 26 December 2023
Published: 05 February 2024
DOI: https://doi.org/10.1007/s42979-023-02580-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture

Abstract

Access this article

Similar content being viewed by others

Classification of heart sounds based on the combination of the modified frequency wavelet transform and convolutional neural network

Learning Optimal Time-Frequency Representations for Heart Sound: A Comparative Study

Automatic Heart Sound Classification Using One Dimension Deep Neural Network

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture

Abstract

Access this article

Similar content being viewed by others

Classification of heart sounds based on the combination of the modified frequency wavelet transform and convolutional neural network

Learning Optimal Time-Frequency Representations for Heart Sound: A Comparative Study

Automatic Heart Sound Classification Using One Dimension Deep Neural Network

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation