Abstract
Modern elevators are equipped with closed-circuit television (CCTV) cameras to record videos for post-incident investigation rather than providing proactive event monitoring. While there are some attempts at automated video surveillance, events such as urinating, vandalism, and crimes that involved vulnerable targets may not exhibit significant visual cues. On contrary, such events are more discerning from audio cues. In this work, we propose a hierarchical audio-visual surveillance framework for elevators. Audio analytic module acts as the front line detector to monitor for such events. This means audio cue is the main determining source to infer the event occurrence. The secondary inference process involves queries to visual analytic module to build-up the evidences leading to event detection. We validate the performance of our system at a residential trial site and the initial results are promising.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Song, T., Han, D., Ko, H.: Robust background subtraction using data fusion for real elevator scene. In: Proc. of IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS), pp. 392–397 (2011)
Shao, H., Li, L., Xiao, P., Leung, M.K.H.: ELEVIEW: an active elevator video surveillance system. In: Proc. of Workshop on Human Motion (HUMO), pp. 67–72 (2000)
Kage, H., Seki, M., Sumi, K., Tanaka, K., Kyuma, K.: Pattern recognition for video surveillance and physical security. In: Proc. of International Conference on Instrumentation, Control, Information Technology and System Integration (SICE), pp. 1823–1828 (2007)
Lee, Y., Song, T., Kim, H., Han, D.K., Ko, H.: Hostile intent and behaviour detection in elevators. In: 4th International Conference on Imaging for Crime Detection and Prevention, ICDP (2011)
Schofield, A.J., Stonham, T.J., Mehta, P.A.: Automated people counting to aid lift control. Automation in Construction 6, 437–445 (1997)
Zuniga, M., Bremond, F., Thonnat, M.: Fast and reliable object classification in video based on a 3d generic model. In: Proc. of IET International Conference on Visual Information Engineering (VIE), pp. 433–440 (2006)
Radhakrishnan, R., Divakaran, A.: Systematic acquisition of audio classes for elevator surveillance. In: Proc. of SPIE, pp. 64–71 (2005)
Radhakrishnan, R., Divakaran, A., Smaragdis, P.: Audio analysis for surveillance applications. In: Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (ASPAA), pp. 158–161 (2005)
Radhakrishnan, R., Divakaran, A.: Generative process tracking for audio analysis. In: Proc. of International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. v1–v4 (2006)
Kim, W.-Y., Park, S.-G., Lim, M.-C.: The intelligent video and audio recognition black-box system of the elevator for the disaster and crime prevention. In: Kim, T.-H., Adeli, H., Robles, R.J., Balitanas, M. (eds.) ACN 2011. CCIS, vol. 199, pp. 245–252. Springer, Heidelberg (2011)
Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International Journal of Computer Vision 47, 7–42 (2002)
Heikkilä, M., Pietikäinen, M.: A texture-based method for modeling the background and detecting moving objects. IEEE Trans. Pattern Anal. Machine Intell. 28, 657–662 (2006)
Nie, Y., Ma, K.K.: Adaptive rood pattern search for fast block-matching motion estimation. IEEE Trans. Image Processing 11(12), 1442–1448 (2002)
Laptev, I., Lindeberg, T.: Space-time interest points. In: Proc. of International Conference on Computer Vision (ICCV), pp. 432–439 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Chua, T.W., Leman, K., Gao, F. (2014). Hierarchical Audio-Visual Surveillance for Passenger Elevators. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds) MultiMedia Modeling. MMM 2014. Lecture Notes in Computer Science, vol 8326. Springer, Cham. https://doi.org/10.1007/978-3-319-04117-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-04117-9_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04116-2
Online ISBN: 978-3-319-04117-9
eBook Packages: Computer ScienceComputer Science (R0)