Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

Zhang, Pengnan; Wei, Juan; Liu, Zhe; Ning, Fangli

doi:10.1007/s11277-020-07405-z

Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

Published: 24 April 2020

Volume 114, pages 1009–1024, (2020)
Cite this article

Wireless Personal Communications Aims and scope Submit manuscript

Pengnan Zhang¹,
Juan Wei¹,
Zhe Liu² &
…
Fangli Ning²

227 Accesses
3 Citations
Explore all metrics

Abstract

This paper presents a feature extraction approach for surveillance system aimed at achieving the automatic detection and recognition of public security events. The proposed approach first generates a Gabor dictionary based on the human auditory critical frequency bands, and then uses the orthogonal matching pursuit (OMP) algorithm to sparse abnormal audio signal. We select the optimal several important atoms from the Gabor dictionary and extract the scale, frequency, and translation parameters of the atoms to form the OMP feature. The performance of OMP feature is compared with traditional acoustic features and their joint features, using support vector machine (SVM) and random forest (RF) classifiers. Experiments have been performed to evaluate the effectiveness of the OMP feature for supplementing traditional acoustic features. The results show the superior performance classifier for abnormal acoustic event detection (AAED) is RF. Furthermore, the introduction of the combined features addresses the problems of low recognition accuracy and poor robustness for the surveillance system in practical applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Article Open access 03 January 2024

Imagined speech classification exploiting EEG power spectrum features

Article 18 April 2024

Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram

Article 20 August 2022

References

Shi, B., Sun, M., Kao, C. C., et al. (2019). Semi-supervised acoustic event detection based on tri-training. In Electrical engineering and systems science audio an speech processing (eess.AS), 29 April.
Passler, S., & Fischer, W. J. (2014). Food intake monitoring: Automated chew event detection in chewing sounds. IEEE Journal of Biomedical and Health Informatics, 18(1), 278–289.
Article Google Scholar
Wang, J. C., Lin, C. H., Chen, B. W., et al. (2014). Gabor-based nonuniform scale-frequency map for environmental sound classification in home automation. IEEE Transactions on Automation Science & Engineering, 11(2), 607–613.
Article Google Scholar
Sehili, M. E. A., Lecouteux, B., Vacher, M., et al. (2012). Sound environment analysis in smart home. Lecture Notes in Computer Science, 7683(7683), 208–223.
Article Google Scholar
Lopatka, K., Kotus, J., & Czyzewski, A. (2016). Detection, classification and localization of acoustic events in the presence of background noise for acoustic surveillance of hazardous situations. Multimedia Tools and Applications, 75(17), 10407–10439.
Article Google Scholar
Astapov, S., Berdnikova, J., Ehala, J., et al. (2018). Gunshot acoustic event identification and shooter localization in a WSN of asynchronous multichannel acoustic ground sensors. Multidimensional Systems and Signal Processing, 29, 563–595.
Article MathSciNet Google Scholar
Dufaux, A., Besacier, L., Ansorge, M., et al. (2000) Automatic sound detection and recognition for noisy environment. In Proceedings of 2000 10th European signal processing conference (pp. 1–4), Tampere, September 2000.
Khunarsa, P., Lursinsap, C., & Raicharoen, T. (2010). Impulsive environment sound detection by neural classification of spectrogram and mel-frequency coefficient images. Advances in Neural Network Research and Applications, 67, 337–346.
Article Google Scholar
Okuyucu, Ç., Sert, M., & Yazici, A. (2013) Audio feature and classifier analysis for efficient recognition of environmental sounds. In Proceedings 2013 IEEE international symposium on multimedia, Anaheim, CA, December 2013
Xia, X., Togneri, R., Sohel, F., et al. (2018). Random forest classification based acoustic event detection utilizing contextual-information and bottleneck features. Pattern Recognition, 81, 1–13.
Article Google Scholar
Cohen R , Lavner Y . (2012). Infant cry analysis and detection. In Proceedings of 27th conference on electrical and electronics engineers (pp. 1–5), Eilat, Israel, November 2012.
Nguyen, Q., & Choi, J. S. (2017). Matching pursuit based robust acoustic event classification for surveillance systems. Computers and Electrical Engineering, 57, 43–54.
Article Google Scholar
Wolpert, D. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390.
Article Google Scholar
Fernandez-Delgado, M., Cernadas, E., Barro, S., et al. (2014). Do we need hundreds of classifiers to solve real world classification problems. Journal of Machine Learning Research, 15, 3133–3181.
MathSciNet MATH Google Scholar
Hrabina, M., & Sigmund, M. (2017). Comparison of feature performance in gunshot detection depending on noise degradation. In Proceedings 27th international conference on RADIOELEKTRONIKA (pp. 1–4), Brno, Czech Republic, April 2017.
Chu, S., Narayanan, S., & Kuo, C. C. J. (2009). Environmental sound recognition with time-frequency audio features. IEEE Transactions on Audio, Speech and Language Processing, 17(6), 1142–1158.
Article Google Scholar
Mallat, S. G., & Zhang, Z. (1993). Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing, 41(12), 3397–3415.
Article Google Scholar
Li, D., Tam, J., & Toub, D. (2013). Auditory scene classification using machine learning techniques. In Proceedings of the IEEE AASP challenge on detection and classification of acoustic scenes and events, New Paltz, NY, October 2013.
Petrovsky, A., Azarov, E., & Petrovsky, A. (2011). Hybrid signal decomposition based on instantaneous harmonic parameters and perceptually motivated wavelet packets for scalable audio coding. Signal Processing, 91(6), 1489–1504.
Article Google Scholar
Sugden, P., & Canagarajah, N. (2003). Underdetermined noisy blind separation using dual matching pursuits. Electronics Letters, 39(1), 158–160.
Article Google Scholar
Wei, P. I. (2010). Generalized demodulation method based on multi-scale chirplet and sparse signal decomposition and its application to gear fault diagnosis. Journal of Mechanical Engineering, 46(15), 63–68.
Google Scholar
Schroder, J., Goetze, S., & Anemuller, J. (2015). Spectro-temporal gabor filterbank features for acoustic event detection. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(12), 2198–2208.
Article Google Scholar
Soussen, C., Rémi, G., Jérôme, I., et al. (2013). Joint K-step analysis of orthogonal matching pursuit and orthogonal least squares. IEEE Transactions on Information Theory, 59(5), 3158–3174.
Article MathSciNet Google Scholar
Pace, M. A., Nicolás, U., Masson, F., et al. (2017). Critical frequencies for the annoyance judgments of noises with tonal component, to develop an equal-annoyance contour. Acoustical Society of America Journal, 141(5), 3883.
Article Google Scholar
Zwicker, E. (1961). Subdivision of the audible frequency range into critical bands (Frequenzgruppen). The Journal of the Acoustical Society of America, 33(2), 248.
Article Google Scholar
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
Article Google Scholar
Phan, H., Marco, M., Mazur, R., et al. (2015). Random regression forests for acoustic event detection and classification. IEEE/ACM Transactions on Audio Speech and Language Processing, 23(1), 20–31.
Article Google Scholar

Download references

Acknowledgements

This work was partially supported by the NSFC Grants 51675425, Natural Science Basic Research Plan in Shaanxi Province of China Grant Nos. 2018SF-365 and 2020ZDLGY06-09, the Dongguan Social Science and Technology Development (key) Project 20185071021600.

Author information

Authors and Affiliations

School of Telecommunications Engineering, Xidian University, Xi’an, 710071, Shaanxi, People’s Republic of China
Pengnan Zhang & Juan Wei
School of Mechanical and Electrical Engineering, Northwestern Polytechnical University, Xi’an, 710072, Shaanxi, People’s Republic of China
Zhe Liu & Fangli Ning

Authors

Pengnan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Juan Wei
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Liu
View author publications
You can also search for this author in PubMed Google Scholar
Fangli Ning
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Juan Wei.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, P., Wei, J., Liu, Z. et al. Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System. Wireless Pers Commun 114, 1009–1024 (2020). https://doi.org/10.1007/s11277-020-07405-z

Download citation

Published: 24 April 2020
Issue Date: September 2020
DOI: https://doi.org/10.1007/s11277-020-07405-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

Abstract

Access this article

Similar content being viewed by others

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Imagined speech classification exploiting EEG power spectrum features

Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Abnormal Acoustic Event Detection Based on Orthogonal Matching Pursuit in Security Surveillance System

Abstract

Access this article

Similar content being viewed by others

Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques

Imagined speech classification exploiting EEG power spectrum features

Heart Sound Classification Using Deep Learning Techniques Based on Log-mel Spectrogram

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation