Abstract
Sound plays an important role in human daily life as humans use sound to communicate with each other and to understand the events occurring in the surroundings. This has prompted the researchers to further study on how to automatically identify the event that is happening by analyzing the acoustic signal. This paper presents a deep learning model enhanced by compressed sensing techniques for acoustic event classification. The compressed sensing first transforms the input acoustic signal into a reconstructed signal to reduce the noise in the input acoustic signal. The reconstructed signals are then fed into a 1-dimensional convolutional neural network (1D-CNN) to train a deep learning model for the acoustic event classification. In addition, the dropout regularization is leveraged in the 1D-CNN to mitigate the overfitting problems. The proposed compressed sensing with 1D-CNN was evaluated on three benchmark datasets, namely Soundscapes1, Soundscapes2, and UrbanSound8K, and achieved F1-scores of 80.5%, 81.1%, and 69.2%, respectively.



Similar content being viewed by others
References
Phan, H., Hertel, L., Maass, M., Mertins, A.: Robust audio event recognition with 1-max pooling convolutional neural networks. arXiv preprint arXiv:1604.06338 (2016)
Chou, S.-Y., Jang, J.-S.R., Yang, Y.-H.: Framecnn: a weakly-supervised learning framework for frame-wise acoustic event detection and classification. ReCALL 14, 55–64 (2017)
Salamon, J., Bello, J.P.: Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Process. Lett. 24(3), 279–283 (2017)
Zhang, T., Liang, J., Ding, B.: Acoustic scene classification using deep CNN with fine-resolution feature. Expert Syst. Appl. 143, 113067 (2020)
Kim, D., Park, S., Han, D.K., Ko, H.: Multi-band CNN architecture using adaptive frequency filter for acoustic event classification. Appl. Acoust. 172, 107579 (2021)
Feng, M., Kao, C.-C., Tang, Q., Sun, M., Rozgic, V., Matsoukas, S., Wang, C.: Federated self-supervised learning for acoustic event classification. arXiv preprint arXiv:2203.11997 (2022)
Komatsu, T., Watanabe, S., Miyazaki, K., Hayashi, T.: Acoustic event detection with classifier chains. arXiv preprint arXiv:2202.08470 (2022)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc.: Ser. B (Methodol.) 58(1), 267–288 (1996)
Salamon, J., Jacoby, C., Bello, J.P.: A dataset and taxonomy for urban sound research. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 1041–1044 (2014)
Serizel, R., Turpault, N., Eghbal-Zadeh, H., Shah, A.P.: Large-scale weakly labeled semi-supervised sound event detection in domestic environments. arXiv preprint arXiv:1807.10501 (2018)
Acknowledgements
Authors would like to thank Collaborative Resea-rch in Engineering, Science and Technology Center (CREST) for their continuous support in this research. This research is supported by Collaborative Research in Engineering Science and Technology (CREST) R &D Grant No. CREST/R &D/P07C2-17/005 and Multimedia University Internal Research Fund MMUI/220028.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tan, P.S., Lim, K.M., Tan, C.H. et al. ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing. SIViP 17, 735–741 (2023). https://doi.org/10.1007/s11760-022-02281-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-022-02281-5