Skip to main content
Log in

ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Sound plays an important role in human daily life as humans use sound to communicate with each other and to understand the events occurring in the surroundings. This has prompted the researchers to further study on how to automatically identify the event that is happening by analyzing the acoustic signal. This paper presents a deep learning model enhanced by compressed sensing techniques for acoustic event classification. The compressed sensing first transforms the input acoustic signal into a reconstructed signal to reduce the noise in the input acoustic signal. The reconstructed signals are then fed into a 1-dimensional convolutional neural network (1D-CNN) to train a deep learning model for the acoustic event classification. In addition, the dropout regularization is leveraged in the 1D-CNN to mitigate the overfitting problems. The proposed compressed sensing with 1D-CNN was evaluated on three benchmark datasets, namely Soundscapes1, Soundscapes2, and UrbanSound8K, and achieved F1-scores of 80.5%, 81.1%, and 69.2%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. Phan, H., Hertel, L., Maass, M., Mertins, A.: Robust audio event recognition with 1-max pooling convolutional neural networks. arXiv preprint arXiv:1604.06338 (2016)

  2. Chou, S.-Y., Jang, J.-S.R., Yang, Y.-H.: Framecnn: a weakly-supervised learning framework for frame-wise acoustic event detection and classification. ReCALL 14, 55–64 (2017)

    Google Scholar 

  3. Salamon, J., Bello, J.P.: Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Process. Lett. 24(3), 279–283 (2017)

    Article  Google Scholar 

  4. Zhang, T., Liang, J., Ding, B.: Acoustic scene classification using deep CNN with fine-resolution feature. Expert Syst. Appl. 143, 113067 (2020)

    Article  Google Scholar 

  5. Kim, D., Park, S., Han, D.K., Ko, H.: Multi-band CNN architecture using adaptive frequency filter for acoustic event classification. Appl. Acoust. 172, 107579 (2021)

    Article  Google Scholar 

  6. Feng, M., Kao, C.-C., Tang, Q., Sun, M., Rozgic, V., Matsoukas, S., Wang, C.: Federated self-supervised learning for acoustic event classification. arXiv preprint arXiv:2203.11997 (2022)

  7. Komatsu, T., Watanabe, S., Miyazaki, K., Hayashi, T.: Acoustic event detection with classifier chains. arXiv preprint arXiv:2202.08470 (2022)

  8. Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc.: Ser. B (Methodol.) 58(1), 267–288 (1996)

    MathSciNet  MATH  Google Scholar 

  9. Salamon, J., Jacoby, C., Bello, J.P.: A dataset and taxonomy for urban sound research. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 1041–1044 (2014)

  10. Serizel, R., Turpault, N., Eghbal-Zadeh, H., Shah, A.P.: Large-scale weakly labeled semi-supervised sound event detection in domestic environments. arXiv preprint arXiv:1807.10501 (2018)

Download references

Acknowledgements

Authors would like to thank Collaborative Resea-rch in Engineering, Science and Technology Center (CREST) for their continuous support in this research. This research is supported by Collaborative Research in Engineering Science and Technology (CREST) R &D Grant No. CREST/R &D/P07C2-17/005 and Multimedia University Internal Research Fund MMUI/220028.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kian Ming Lim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tan, P.S., Lim, K.M., Tan, C.H. et al. ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing. SIViP 17, 735–741 (2023). https://doi.org/10.1007/s11760-022-02281-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11760-022-02281-5

Keywords