Article

A reinforcement learning approach to active camera foveation

Authors:

Andrew D. Bagdanov,

Alberto del Bimbo,

Walter Nunziati,

Federico PerniciAuthors Info & Claims

VSSN '06: Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks

Pages 179 - 186

https://doi.org/10.1145/1178782.1178809

Published: 27 October 2006 Publication History

Get Access

Abstract

In this paper we report on techniques for automatically learning foveal sensing strategies for an active pan-tilt-zoom camera. The approach uses reinforcement learning to discover foveal actions maximizing the performance of visual detectors, that are in turn assumed to be highly correlated with the task at hand. In our case,the main goal is to recognize people, hence a frontal face detection module is employed. The system uses reinforcement learning to learn if when and how to foveate on a subject, basedonits previous experience in terms or successful actions in similar situations. An action is successful if it leads to a correct face detection in the high resolution images obtained when the subject is zoomed in. In contrast with existing methods,the proposed approach obviates the need for camera calibration and camera performance modeling. Also, the method does not rely on active tracking of targets. Experimental results show how the system can be deployed in unconstrained surveillance environments, and is capable of learning foveation strategies without requiring extensive a priori information or environmental models. Results also illustrate how the system effectively learns a strategy that allows the camera to foveate only in situations where successful detection is highly likely.

References

[1]

A. D. Bagdanov, A. del Bimbo, and W. Nunziati. Improving evidential quality of surveillance imagery through active face tracking. In Proceedings of ICPR 2006 2006.(to appear).

Digital Library

Google Scholar

[2]

A. D. Bagdanov, A. del Bimbo, and F. Pernici. Acquisition of high-resolution images through on-line saccade sequence planning. In Proceedings of VSSN2005 pages 121--130, 2005.

Digital Library

Google Scholar

[3]

C. J. Costello, C. P. Diehl, A. Banerjee,and H. Fisher. Scheduling an active camera to observe people. In Proceedings of VSSN2004 October 2004.

Digital Library

Google Scholar

[4]

M. Erdem and S. Sclaroff. Look there! predicting where to look for motion in an active camera network. In Proceedings of IEEE Conference on Advanced Video and Signal Based Surveillance (AVSS)2005.

Google Scholar

[5]

L. P. Kaelbling, M. L. Littman, and A. P. Moore. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4: 237--285, 1996.

Digital Library

Google Scholar

[6]

S.-N. Lim, L. S. Davis, and A. Elgammal. Scalable image-based multi-camera visual surveillance system. In Proceedings IEEE Conference on Advanced Video and Signal Based Surveillance pages 205--212, July 2003.

Digital Library

Google Scholar

[7]

D. W. Murray, K. J. Bradshaw, P. F. McLauchlan, I. D. Reid, and P.Sharkey. Driving saccade to pursuit using image motion. IJCV 16(3): 205--228, 1995.

Digital Library

Google Scholar

[8]

J. Peng and R. J. Williams. Incremental multi-step q-learning. Machine Learning 22(1-3):283--290, March 1996.

Digital Library

Google Scholar

[9]

A. Senior, A. Hampapur, and M. Lu. Acquiring multi-scale images by pan-tilt-zoom control and automatic multi-camera calibration.In Proceedings of the IEEE Workshop on Applications on Computer Vision Breckenridge, Colorado, January 2005.

Digital Library

Google Scholar

[10]

J. N. Tsitsiklis. Asynchronous stochastic approximation and q-learning. Machine Learning 22(1):185--202, May 1994.

Digital Library

Google Scholar

[11]

C. J. C.H. Watkins and P. Dayan. Q-learning. Machine Learning 8(3-4): 279--292, May 1992.

Digital Library

Google Scholar

[12]

X. Zhou, R. Collins, T. Kanade, and P. Metes. A master-slave system to acquire biometric imagery of humans at a distance. In ACM SIGMM 2003 Workshop on Video Surveillance Berkeley, CA, 2003.

Digital Library

Google Scholar

Cited By

View all

Cruciata GLo Presti LCascia M(2021)On the Use of Deep Reinforcement Learning for Visual Tracking: A SurveyIEEE Access10.1109/ACCESS.2021.31086239(120880-120900)Online publication date: 2021
https://doi.org/10.1109/ACCESS.2021.3108623
Lo Presti LLa Cascia M(2019)Deep Motion Model for Pedestrian Tracking in 360 Degrees VideosImage Analysis and Processing – ICIAP 201910.1007/978-3-030-30642-7_4(36-47)Online publication date: 2-Sep-2019
https://doi.org/10.1007/978-3-030-30642-7_4
Xin Wang van de Weem JJonker P(2013)An advanced active vision system imitating human eye movements2013 16th International Conference on Advanced Robotics (ICAR)10.1109/ICAR.2013.6766517(1-6)Online publication date: Nov-2013
https://doi.org/10.1109/ICAR.2013.6766517
Show More Cited By

Index Terms

A reinforcement learning approach to active camera foveation
1. Computing methodologies
  1. Machine learning

Recommendations

Reward Shaping in Episodic Reinforcement Learning
AAMAS '17: Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems

Recent advancements in reinforcement learning confirm that reinforcement learning techniques can solve large scale problems leading to high quality autonomous decision making. It is a matter of time until we will see large scale applications of ...
Handling camera movement constraints in reinforcement learning based active object recognition
DAGM'06: Proceedings of the 28th conference on Pattern Recognition

In real world scenes, objects to be classified are usually not visible from every direction, since they are almost always positioned on some kind of opaque plane. When moving a camera selectively around those objects for classifying them in an active ...
Air-to-Ground Active Object Tracking via Reinforcement Learning
Artificial Neural Networks and Machine Learning – ICANN 2023
Abstract
Over the years, active object tracking has emerged as a prominent topic in object tracking. However, most of these methods are unsuitable for tracking ground objects in high-altitude environments. Therefore, the paper proposes an air-to-ground ...

Comments

Information & Contributors

Information

Published In

VSSN '06: Proceedings of the 4th ACM international workshop on Video surveillance and sensor networks

October 2006

230 pages

ISBN:1595934960

DOI:10.1145/1178782

General Chairs:
Jake K. Aggarwal
University of Texas
,
Rita Cucchiara
University of Modena and Reggio Emilia, Italy
,
Program Chair:
Andrea Prati
University of Modena and Reggio Emilia, Italy

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 October 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

MM06

Sponsor:

MM06: The 14th ACM International Conference on Multimedia 2006

October 27, 2006

California, Santa Barbara, USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
330
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)1

Reflects downloads up to 02 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Cruciata GLo Presti LCascia M(2021)On the Use of Deep Reinforcement Learning for Visual Tracking: A SurveyIEEE Access10.1109/ACCESS.2021.31086239(120880-120900)Online publication date: 2021
https://doi.org/10.1109/ACCESS.2021.3108623
Lo Presti LLa Cascia M(2019)Deep Motion Model for Pedestrian Tracking in 360 Degrees VideosImage Analysis and Processing – ICIAP 201910.1007/978-3-030-30642-7_4(36-47)Online publication date: 2-Sep-2019
https://doi.org/10.1007/978-3-030-30642-7_4
Xin Wang van de Weem JJonker P(2013)An advanced active vision system imitating human eye movements2013 16th International Conference on Advanced Robotics (ICAR)10.1109/ICAR.2013.6766517(1-6)Online publication date: Nov-2013
https://doi.org/10.1109/ICAR.2013.6766517
Santana PCorreia L(2011)Swarm cognition on off-road autonomous robotsSwarm Intelligence10.1007/s11721-010-0051-75:1(45-72)Online publication date: 4-Jan-2011
https://doi.org/10.1007/s11721-010-0051-7

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Reward Shaping in Episodic Reinforcement Learning

Handling camera movement constraints in reinforcement learning based active object recognition

Air-to-Ground Active Object Tracking via Reinforcement Learning

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations