ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing

Tan, Pooi Shiang; Lim, Kian Ming; Tan, Cheah Heng; Lee, Chin Poo; Kwek, Lee Chung

doi:10.1007/s11760-022-02281-5

ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing

Original Paper
Published: 22 June 2022

Volume 17, pages 735–741, (2023)
Cite this article

Signal, Image and Video Processing Aims and scope Submit manuscript

Pooi Shiang Tan¹^na1,
Kian Ming Lim ORCID: orcid.org/0000-0003-1929-7978¹^na1,
Cheah Heng Tan²^na1,
Chin Poo Lee¹^na1 &
…
Lee Chung Kwek³^na1

504 Accesses
Explore all metrics

Abstract

Sound plays an important role in human daily life as humans use sound to communicate with each other and to understand the events occurring in the surroundings. This has prompted the researchers to further study on how to automatically identify the event that is happening by analyzing the acoustic signal. This paper presents a deep learning model enhanced by compressed sensing techniques for acoustic event classification. The compressed sensing first transforms the input acoustic signal into a reconstructed signal to reduce the noise in the input acoustic signal. The reconstructed signals are then fed into a 1-dimensional convolutional neural network (1D-CNN) to train a deep learning model for the acoustic event classification. In addition, the dropout regularization is leveraged in the 1D-CNN to mitigate the overfitting problems. The proposed compressed sensing with 1D-CNN was evaluated on three benchmark datasets, namely Soundscapes1, Soundscapes2, and UrbanSound8K, and achieved F1-scores of 80.5%, 81.1%, and 69.2%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

Recognition of Urban Sound Events Using Deep Context-Aware Feature Extractors and Handcrafted Features

Acoustic scene classification based on three-dimensional multi-channel feature-correlated deep learning networks

Article Open access 12 August 2022

References

Phan, H., Hertel, L., Maass, M., Mertins, A.: Robust audio event recognition with 1-max pooling convolutional neural networks. arXiv preprint arXiv:1604.06338 (2016)
Chou, S.-Y., Jang, J.-S.R., Yang, Y.-H.: Framecnn: a weakly-supervised learning framework for frame-wise acoustic event detection and classification. ReCALL 14, 55–64 (2017)
Google Scholar
Salamon, J., Bello, J.P.: Deep convolutional neural networks and data augmentation for environmental sound classification. IEEE Signal Process. Lett. 24(3), 279–283 (2017)
Article Google Scholar
Zhang, T., Liang, J., Ding, B.: Acoustic scene classification using deep CNN with fine-resolution feature. Expert Syst. Appl. 143, 113067 (2020)
Article Google Scholar
Kim, D., Park, S., Han, D.K., Ko, H.: Multi-band CNN architecture using adaptive frequency filter for acoustic event classification. Appl. Acoust. 172, 107579 (2021)
Article Google Scholar
Feng, M., Kao, C.-C., Tang, Q., Sun, M., Rozgic, V., Matsoukas, S., Wang, C.: Federated self-supervised learning for acoustic event classification. arXiv preprint arXiv:2203.11997 (2022)
Komatsu, T., Watanabe, S., Miyazaki, K., Hayashi, T.: Acoustic event detection with classifier chains. arXiv preprint arXiv:2202.08470 (2022)
Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc.: Ser. B (Methodol.) 58(1), 267–288 (1996)
MathSciNet MATH Google Scholar
Salamon, J., Jacoby, C., Bello, J.P.: A dataset and taxonomy for urban sound research. In: Proceedings of the 22nd ACM International Conference on Multimedia, pp. 1041–1044 (2014)
Serizel, R., Turpault, N., Eghbal-Zadeh, H., Shah, A.P.: Large-scale weakly labeled semi-supervised sound event detection in domestic environments. arXiv preprint arXiv:1807.10501 (2018)

Download references

Acknowledgements

Authors would like to thank Collaborative Resea-rch in Engineering, Science and Technology Center (CREST) for their continuous support in this research. This research is supported by Collaborative Research in Engineering Science and Technology (CREST) R &D Grant No. CREST/R &D/P07C2-17/005 and Multimedia University Internal Research Fund MMUI/220028.

Author information

Pooi Shiang Tan, Kian Ming Lim, Cheah Heng Tan, Chin Poo Lee and Lee Chung Kwek have contributed equally to this work.

Authors and Affiliations

Faculty of Information and Technology, Multimedia University, Jalan Ayer Keroh Lama, Bukit Beruang, 75450, Melaka, Malaysia
Pooi Shiang Tan, Kian Ming Lim & Chin Poo Lee
Motorola Solutions Malaysia Sdn. Bhd., 24, Medan Bayan Lepas, Bayan Lepas Technoplex, 11900, Bayan Lepas, Pulau Pinang, Malaysia
Cheah Heng Tan
Faculty of Engineering and Technology, Multimedia University, Jalan Ayer Keroh Lama, Bukit Beruang, 75450, Melaka, Malaysia
Lee Chung Kwek

Authors

Pooi Shiang Tan
View author publications
You can also search for this author inPubMed Google Scholar
Kian Ming Lim
View author publications
You can also search for this author inPubMed Google Scholar
Cheah Heng Tan
View author publications
You can also search for this author inPubMed Google Scholar
Chin Poo Lee
View author publications
You can also search for this author inPubMed Google Scholar
Lee Chung Kwek
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Kian Ming Lim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tan, P.S., Lim, K.M., Tan, C.H. et al. ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing. SIViP 17, 735–741 (2023). https://doi.org/10.1007/s11760-022-02281-5

Download citation

Received: 24 December 2021
Revised: 29 April 2022
Accepted: 31 May 2022
Published: 22 June 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s11760-022-02281-5

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ComSense-CNN: acoustic event classification via 1D convolutional neural network with compressed sensing

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Deep Convolutional Neural Network with Mixup for Environmental Sound Classification

Recognition of Urban Sound Events Using Deep Context-Aware Feature Extractors and Handcrafted Features

Acoustic scene classification based on three-dimensional multi-channel feature-correlated deep learning networks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now