Skip to main content
Log in

A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Cardiovascular diseases (CVDs) are a group of diseases that affect the heart or blood vessels and are the leading cause of mortality around the world. The main focus of this work is to classify heart sounds accurately before the condition of the heart worsens. Over the past few decades, audio-based classifications of medical data have received high consideration among various researchers. However, there are mainly two drawbacks while working with audio data. (1) variable length of the audio data and, (2) an inadequate number of class quintessential audio samples. In this work, reflection operation and the sliding window approach are employed to generate fixed-sized audio data. The proposed method applied Discrete Wavelet Transform (DWT) and Mel-Spectrogram for feature extraction. Furthermore, different augmentation techniques such as time-stretching and pitch-shifting are utilized in the method so that the proposed deep learning-based CNN architecture can be trained on a large amount of data. The proposed method is verified using two datasets provided by the ‘PASCAL Heart Sounds Challenge’, each of which contains a small number of heart sound samples of various lengths. In comparison, the experimental outcomes exhibit that the proposed approach outperforms many state-of-the-art methods with respect to sensitivity, specificity, precision, Youden index, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

This study exclusively utilized datasets that are publicly available. Full references to these datasets are provided within the article.

References

  1. Cardiovascular diseases (CVDs). https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) Accessed 7 Apr 2020.

  2. Greenfield DM, Snowden JA (2019) Cardiovascular diseases and metabolic syndrome. In: The EBMT handbook, pp. 415–420.

  3. Rangayyan RM, Lehner RJ. Phonocardiogram signal analysis: a review. Crit Rev Biomed Eng. 1988;15:211–36.

    Google Scholar 

  4. Daubechies I. The wavelet transform, time-frequency localization and signal analysis. Princeton: Princeton University Press; 2009.

    Book  Google Scholar 

  5. Babaei S, Geranmayeh A. Heart sound reproduction based on neural network classification of cardiac valve disorders using wavelet transforms of PCG signals. Comput Biol Med. 2009;39(1):8–15.

    Article  PubMed  Google Scholar 

  6. Djebbari A, Reguig FB. Short-time Fourier transform analysis of the phonocardiogram signal. In: ICECS 2000. 7th IEEE international conference on electronics, circuits and systems (Cat. No.00EX445), vol. 2. IEEE; 2000. pp. 844–847.

  7. Bajric R, Zuber N, Skrimpas GA, Mijatovic N. Feature extraction using discrete wavelet transform for gear fault diagnosis of wind turbine gearbox. Shock Vib. 2016. https://doi.org/10.1155/2016/6748469.

    Article  Google Scholar 

  8. Haghighi-Mood A, Torry J. A sub-band energy tracking algorithm for heart sound segmentation. Comput Cardiol. 1995. https://doi.org/10.1109/CIC.1995.482711.

    Article  Google Scholar 

  9. Liang H, Lukkarinen S, Hartimo I. Heart sound segmentation algorithm based on heart sound envelogram. Comput Cardiol. 1997. https://doi.org/10.1109/CIC.1997.647841.

    Article  Google Scholar 

  10. Gupta CN, Palaniappan R, Swaminathan S, Krishnan SM. Neural network classification of homomorphic segmented heart sounds. Appl Soft Comput. 2007;7(1):286–97.

    Article  Google Scholar 

  11. Huiying L, Sakari L, Iiro H. A heart sound segmentation algorithm using wavelet decomposition and reconstruction. In: Proceedings of the 19th annual international conference of the IEEE engineering in medicine and biology society.’ Magnificent milestones and emerging opportunities in medical engineering’ (Cat. No. 97CH36136), vol. 4. IEEE; 1997. pp. 1630–1633.

  12. Wang P, Kim Y, Ling L, Soh C. First heart sound detection for phonocardiogram segmentation. In: 2005 IEEE engineering in medicine and biology 27th annual conference. IEEE; 2006. pp. 5519–5522.

  13. Hebden JE, Torry J. Neural network and conventional classifiers to distinguish between first and second heart sounds. In: IET Conference Proceedings; 1996.

  14. Gomes EF, Bentley PJ, Pereira E, Coimbra MT, Deng Y (2013) Classifying heart sounds-approaches to the pascal challenge. In: HEALTHINF, pp. 337–340.

  15. Deng SW, Han JQ. Towards heart sound classification without segmentation via autocorrelation feature and diffusion maps. Future Gener Comput Syst. 2016;60:13–21.

    Article  Google Scholar 

  16. Safara F, Doraisamy S, Azman A, Jantan A, Ramaiah ARA. Multi-level basis selection of wavelet packet decomposition tree for heart sound classification. Comput Biol Med. 2013;43(10):1407–14.

    Article  PubMed  Google Scholar 

  17. Ari S, Hembram K, Saha G. Detection of cardiac abnormality from PCG signal using LMS based least square SVM classifier. Expert Syst Appl. 2010;37(12):8019–26.

    Article  Google Scholar 

  18. Zhang W, Han J, Deng S. Heart sound classification based on scaled spectrogram and partial least squares regression. Biomed Signal Process Control. 2017;32:20–8.

    Article  Google Scholar 

  19. Zhang W, Han J, Deng S. Heart sound classification based on scaled spectrogram and tensor decomposition. Expert Syst Appl. 2017;84:220–31.

    Article  Google Scholar 

  20. Leung T, White P, Collis W, Brown E, Salmon A. Classification of heart sounds using time-frequency method and artificial neural networks. In: Proceedings of the 22nd annual international conference of the IEEE engineering in medicine and biology society (Cat. No.00CH37143), vol. 2. IEEE; 2000. pp. 988–991.

  21. Bhatikar SR, DeGroff C, Mahajan RL. A classifier based on the artificial neural network approach for cardiologic auscultation in pediatrics. Artif Intell Med. 2005;33(3):251–60.

    Article  PubMed  Google Scholar 

  22. Zhang W, Han J. Towards heart sound classification without segmentation using convolutional neural network. In: 2017 computing in cardiology (CinC). IEEE; 2017. pp. 1–4.

  23. Kleć M. Early detection of heart symptoms with convolutional neural network and scattering wavelet transformation. In: International symposium on methodologies for intelligent systems. Springer; 2018. pp. 24–31.

  24. Demir F, Şengür A, Bajaj V, Polat K. Towards the classification of heart sounds based on convolutional deep neural network. Health Inf Sci Syst. 2019;7(1):1–9.

    Article  Google Scholar 

  25. Gaouda A, Salama M. DSP wavelet-based tool for monitoring transformer inrush currents and internal faults. IEEE Trans Power Deliv. 2010;25(3):1258–67.

    Article  Google Scholar 

  26. Gaouda A, El-Saadany E, Salama M, Sood V, Chikhani A. Monitoring HVDC systems using wavelet multi-resolution analysis. IEEE Trans Power Syst. 2001;16(4):662–70.

    Article  ADS  Google Scholar 

  27. Garg G, Singh V, Gupta J, Mittal A. Optimal algorithm for ECG denoising using discrete wavelet transforms. In: 2010 IEEE international conference on computational intelligence and computing research. IEEE; 2010. pp. 1–4.

  28. Mahmoodabadi S, Ahmadian A, Abolhasani M. ECG feature extraction using Daubechies wavelets. In: Proceedings of the fifth IASTED international conference on visualization, imaging and image processing. 2005. pp. 343–348.

  29. Murthy HN, Meenakshi M. ECG signal denoising and ischemic event feature extraction using Daubechies wavelets. Int J Comput Appl. 2013;67(2):29–33.

    Google Scholar 

  30. Saravanan N, Ramachandran K. Fault diagnosis of spur bevel gear box using discrete wavelet features and decision tree classification. Expert Syst Appl. 2009;36(5):9564–73.

    Article  Google Scholar 

  31. Portnoff M. Time-frequency representation of digital signals and systems based on short-time Fourier analysis. IEEE Trans Acoust Speech Signal Process. 1980;28(1):55–69.

    Article  Google Scholar 

  32. Hossan MA, Memon S, Gregory MA (2010) A novel approach for MFCC feature extraction. In: 2010 4th international conference on signal processing and communication systems. IEEE; pp. 1–5.

  33. LeCun Y, Bengio Y, et al. Convolutional networks for images, speech, and time series. In: The handbook of brain theory and neural networks, vol. 3361, no. 10. 1995.

  34. Zhang J, Chen Z. A pitch shifting reverse echo audio effect. In: EE 264 - digital signal processing. Center for Computer Research in Music and Acoustics (CCRMA), Stanford University; 2018.

  35. Huber S. Harmonic audio object processing in frequency domain. In: Master Thesis UPF. Universitat Pompeu Fabra, Barcelona; 2009.

  36. Laroche J, Dolson M. Improved phase vocoder time-scale modification of audio. IEEE Trans Speech Audio Process. 1999;7(3):323–32.

    Article  Google Scholar 

  37. Barry D, Dorran D, Coyle E (2008) Time and pitch scale modification: a real-time framework and tutorial. In: Proceedings of the 11th international conference on digital audio effects (DAFx-08), vol. 9.

  38. Abdoli S, Cardinal P, Koerich AL. End-to-end environmental sound classification using a 1d convolutional neural network. Expert Syst Appl. 2019;136:252–63.

    Article  Google Scholar 

  39. Laguna JO, Olaya AG, Borrajo D (2011) A dynamic sliding window approach for activity recognition. In: International conference on user modeling, adaptation, and personalization. Springer, pp. 219–230.

  40. Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014;15(1):1929–58.

    MathSciNet  Google Scholar 

  41. Agarap, AF. Deep learning using rectified linear units (RELU). 2018. arXiv preprint arXiv: 180308375

  42. Bentley P, Nordehn G, Coimbra M, Mannor S. The PASCAL classifying heart sounds challenge 2011 (CHSC2011) results. 2011. http://www.peterjbentley.com/heartchallenge/index.html.

  43. Pereira D, Hedayioglu F, Correia R, Silva T, Dutra I, Almeida F, Mattos S, Coimbra M. Digiscope—unobtrusive collection and annotating of auscultations in real hospital environments. In: 2011 annual international conference of the IEEE engineering in medicine and biology society, IEEE; 2011. pp. 1193–1196.

  44. Audacity®|Free, open source, cross-platform audio software for multi-track recording and editing. 2020. https://www.audacityteam.org/. Accessed 28 Mar 2020.

  45. Keras: the python deep learning library. 2020. https://keras.io/. Accessed 28 Mar 2020.

  46. Reddi SJ, Kale S, Kumar S. On the convergence of Adam and beyond. 2019. arXiv preprint arXiv: 190409237.

  47. Deng Y, Bentley PJ. A robust heart sound segmentation and classification algorithm using wavelet decomposition and spectrogram. In: Workshop classifying heart sounds, La Palma, Canary Islands. 2012. pp. 1–6.

Download references

Acknowledgements

The authors wish to acknowledge the Department of Computer Science and Engineering (CSE), Khulna University of Engineering & Technology (KUET), Bangladesh for facilitating the work.

Funding

No funding was received for conducting this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sunanda Das.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Informed consent

This study uses a publicly available ‘Classifying Heart Sounds Challenge’ dataset Sponsored by PASCAL. The authors of this paper have cited the article corresponding to the dataset as per the recommendations of its developers.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Das, S., Ahsan, S.M.M., Rahman, M. et al. A Voting Approach for Heart Sounds Classification Using Discrete Wavelet Transform and CNN Architecture. SN COMPUT. SCI. 5, 251 (2024). https://doi.org/10.1007/s42979-023-02580-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-023-02580-9

Keywords

Navigation