Skip to main content
Log in

Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms

  • Published:
Circuits, Systems, and Signal Processing Aims and scope Submit manuscript

Abstract

Acoustic surveillance is gaining importance given the pervasive nature of multimedia sensors being deployed in all environments. In this paper, novel probabilistic detection methods using audio histograms are proposed for acoustic event detection in a multimedia surveillance environment. The proposed detection methods use audio histograms to classify events in a well-defined acoustic space. The proposed methods belong to the category of novelty detection methods, since audio data corresponding to the event is not used in the training process. These methods hence alleviate the problem of collecting large amount of audio data for training statistical models. These methods are also computationally efficient since a conventional audio feature set like the Mel frequency cepstral coefficients in tandem with audio histograms are used to perform acoustic event detection. Experiments on acoustic event detection are conducted on the SUSAS database available from Linguistic data consortium. The performance is measured in terms of false detection rate and true detection rate. Receiver operating characteristics curves are obtained for the proposed probabilistic detection methods to evaluate their performance. The proposed probabilistic detection methods perform significantly better than the acoustic event detection methods available in literature. A cell phone-based alert system for an assisted living environment is also discussed as a future scope of the proposed method. The performance evaluation is presented as number of successful cell phone transactions. The results are motivating enough for the system to be used in practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. P.K. Atrey, A. Cavallaro, M.S. Kankanhalli, Intelligent Multimedia Surveillance (Springer, Berlin, 2013)

  2. P.K. Atrey, M. Maddage, M.S. Kankanhalli, in Audio based event detection for multimedia surveillance, in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, vol. 5, (IEEE, 2006), pp. V–V

  3. C.-C. Chang, C.-J. Lin, LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2, 27:1–27:27 (2011)

    Article  Google Scholar 

  4. C. Clavel, T. Ehrette, G. Richard, Events detection for an audio-based surveillance system. in Proceedings of the IEEE International Conference on Multimedia and Expo, (ICME 2005), (2005) pp. 1306–1309

  5. M. Davy, F. Desobry, A. Gretton, C. Doncarli, An online support vector machine for abnormal events detection. Signal Process. 86(8), 2009–2025 (2006). Special Section: Advances in Signal Processing-assisted Cross-layer Designs

    Article  MATH  Google Scholar 

  6. B. Fuglede, F. Topsoe, Jensen–Shannon divergence and Hilbert space embedding. in Proceedings of the International Symposium on Information Theory, (ISIT 2004), (2004) p. 31

  7. J.H.L. Hansen, S. Patil, Speech under stress: analysis, modelling and recognition. in ed. by C. Mller, Speaker Classification I Ser. Lecture Notes in Computer Science, vol. 4343, (Springer, Berlin, 2007) pp. 108–137

  8. J.H.L. Hansen, S.E. Bou-Ghazale, R. Sarikaya, B. Pellom, Getting started with SUSAS: a speech under simulated and actual stress database. Eurospeech 97(4), 1743–1746 (1997)

    Google Scholar 

  9. A. Harma, M.F. McKinney, J. Skowronek, Automatic surveillance of the acoustic activity in our living environment. in Proceedings of the IEEE International Conference on Multimedia and Expo, (ICME 2005) (2005) 4 pp.

  10. Z. Huang, Extensions to the k-means algorithm for clustering large data sets with categorical values. Data Mining Knowl. Disc. 2(3), 283–304 (1998)

    Article  Google Scholar 

  11. K. Hwang, S.-Y. Lee, Environmental audio scene and activity recognition through mobile-based crowdsourcing. IEEE Trans. Consum. Electron. 58(2), 700–705 (2012)

    Article  MathSciNet  Google Scholar 

  12. K. Kashino, T. Kurozumi, H. Murase, A quick search method for audio and video signals based on histogram pruning. IEEE Trans. Multimedia 5(3), 348–357 (2003)

    Article  Google Scholar 

  13. J. Kennedy, M.P. Quine, The total variation distance between the binomial and poisson distributions. Ann. Probab. 17(1), 396–400 (1989)

    Article  MATH  MathSciNet  Google Scholar 

  14. J. Lin, Divergence measures based on the Shannon entropy. IEEE Trans. Inform. Theory 37(1), 145–151 (1991)

    Article  MATH  MathSciNet  Google Scholar 

  15. L. Lu, F. Ge, Q. Zhao, Y. Yan, A SVM-based audio event detection system. in Proceedings of the International Conference on Electrical and Control Engineering (ICECE 2010) (2010) pp. 292–295

  16. G. McLachlan, T. Krishnan, The EM Algorithm and Extensions (Wiley, Hoboken, 2007)

    Google Scholar 

  17. S. Ntalampiras, I. Potamitis, N. Fakotakis, Probabilistic novelty detection for acoustic surveillance under real-world conditions. IEEE Trans. Multimedia 13(4), 713–719 (2011)

    Article  Google Scholar 

  18. D. Reynolds, R.C. Rose, Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Speech Audio Process. 3(1), 72–83 (1995)

    Article  Google Scholar 

  19. B. Schlkopf, R. Williamson, A. Smola, J. Shawe-Taylor, J. Platt, Support vector method for novelty detection. Adv. Neural Inform. Process. Syst. 12, 582–588 (1999)

    Google Scholar 

  20. G. Valenzise, L. Gerosa, M. Tagliasacchi, E. Antonacci, A. Sarti, Scream and gunshot detection and localization for audio-surveillance systems. in Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, (AVSS 2007) (2007) pp. 21–26

  21. S. Xiang, J.J. Huang, Based audio watermarking against time-scale modification and cropping attacks. IEEE Trans. Multimedia 9(7), 1357–1372 (2007)

    Article  Google Scholar 

  22. L. Xie, A. Natsev, X. He, J. Kender, M. Hill, J.R. Smith, Tracking large-scale video remix in real-world events. IEEE Trans. Multimedia 15(6), 1244–1254 (2013)

    Article  Google Scholar 

  23. X. Zhang, X. Yin, Z. Yu, Robust audio watermarking algorithm based on histogram specification. in Proceedings of the International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIHMSP ’08 (2008) pp. 163–166

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Karan Nathwani.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Reddy, M.S.S., Nathwani, K. & Hegde, R.M. Probabilistic Detection Methods for Acoustic Surveillance Using Audio Histograms. Circuits Syst Signal Process 34, 1977–1992 (2015). https://doi.org/10.1007/s00034-014-9942-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00034-014-9942-y

Keywords

Navigation