skip to main content
10.1145/1178782.1178809acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

A reinforcement learning approach to active camera foveation

Published: 27 October 2006 Publication History

Abstract

In this paper we report on techniques for automatically learning foveal sensing strategies for an active pan-tilt-zoom camera. The approach uses reinforcement learning to discover foveal actions maximizing the performance of visual detectors, that are in turn assumed to be highly correlated with the task at hand. In our case,the main goal is to recognize people, hence a frontal face detection module is employed. The system uses reinforcement learning to learn if when and how to foveate on a subject, basedonits previous experience in terms or successful actions in similar situations. An action is successful if it leads to a correct face detection in the high resolution images obtained when the subject is zoomed in. In contrast with existing methods,the proposed approach obviates the need for camera calibration and camera performance modeling. Also, the method does not rely on active tracking of targets. Experimental results show how the system can be deployed in unconstrained surveillance environments, and is capable of learning foveation strategies without requiring extensive a priori information or environmental models. Results also illustrate how the system effectively learns a strategy that allows the camera to foveate only in situations where successful detection is highly likely.

References

[1]
A. D. Bagdanov, A. del Bimbo, and W. Nunziati. Improving evidential quality of surveillance imagery through active face tracking. In Proceedings of ICPR 2006 2006.(to appear).
[2]
A. D. Bagdanov, A. del Bimbo, and F. Pernici. Acquisition of high-resolution images through on-line saccade sequence planning. In Proceedings of VSSN2005 pages 121--130, 2005.
[3]
C. J. Costello, C. P. Diehl, A. Banerjee,and H. Fisher. Scheduling an active camera to observe people. In Proceedings of VSSN2004 October 2004.
[4]
M. Erdem and S. Sclaroff. Look there! predicting where to look for motion in an active camera network. In Proceedings of IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS)2005.
[5]
L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4: 237--285, 1996.
[6]
S.-N. Lim, L. S. Davis, and A. Elgammal. Scalable image-based multi-camera visual surveillance system. In Proceedings IEEE Conference on Advanced Video and Signal Based Surveillance pages 205--212, July 2003.
[7]
D. W. Murray, K. J. Bradshaw, P. F. McLauchlan, I. D. Reid, and P.Sharkey. Driving saccade to pursuit using image motion. IJCV 16(3): 205--228, 1995.
[8]
J. Peng and R. J. Williams. Incremental multi-step q-learning. Machine Learning 22(1-3):283--290, March 1996.
[9]
A. Senior, A. Hampapur, and M. Lu. Acquiring multi-scale images by pan-tilt-zoom control and automatic multi-camera calibration.In Proceedings of the IEEE Workshop on Applications on Computer Vision Breckenridge, Colorado, January 2005.
[10]
J. N. Tsitsiklis. Asynchronous stochastic approximation and q-learning. Machine Learning 22(1):185--202, May 1994.
[11]
C. J. C.H. Watkins and P. Dayan. Q-learning. Machine Learning 8(3-4): 279--292, May 1992.
[12]
X. Zhou, R. Collins, T. Kanade, and P. Metes. A master-slave system to acquire biometric imagery of humans at a distance. In ACM SIGMM 2003 Workshop on Video Surveillance Berkeley, CA, 2003.

Cited By

View all
  • (2021)On the Use of Deep Reinforcement Learning for Visual Tracking: A SurveyIEEE Access10.1109/ACCESS.2021.31086239(120880-120900)Online publication date: 2021
  • (2019)Deep Motion Model for Pedestrian Tracking in 360 Degrees VideosImage Analysis and Processing – ICIAP 201910.1007/978-3-030-30642-7_4(36-47)Online publication date: 2-Sep-2019
  • (2013)An advanced active vision system imitating human eye movements2013 16th International Conference on Advanced Robotics (ICAR)10.1109/ICAR.2013.6766517(1-6)Online publication date: Nov-2013
  • Show More Cited By

Index Terms

  1. A reinforcement learning approach to active camera foveation

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    VSSN '06: Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks
    October 2006
    230 pages
    ISBN:1595934960
    DOI:10.1145/1178782
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. reinforcement learning
    2. video surveillance systems

    Qualifiers

    • Article

    Conference

    MM06
    MM06: The 14th ACM International Conference on Multimedia 2006
    October 27, 2006
    California, Santa Barbara, USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)On the Use of Deep Reinforcement Learning for Visual Tracking: A SurveyIEEE Access10.1109/ACCESS.2021.31086239(120880-120900)Online publication date: 2021
    • (2019)Deep Motion Model for Pedestrian Tracking in 360 Degrees VideosImage Analysis and Processing – ICIAP 201910.1007/978-3-030-30642-7_4(36-47)Online publication date: 2-Sep-2019
    • (2013)An advanced active vision system imitating human eye movements2013 16th International Conference on Advanced Robotics (ICAR)10.1109/ICAR.2013.6766517(1-6)Online publication date: Nov-2013
    • (2011)Swarm cognition on off-road autonomous robotsSwarm Intelligence10.1007/s11721-010-0051-75:1(45-72)Online publication date: 4-Jan-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media