Skip to main content

On the Optimality of Spatial Attention for Object Detection

  • Conference paper
Attention in Cognitive Systems (WAPCV 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5395))

Included in the following conference series:

Abstract

Studies on visual attention traditionally focus on its physiological and psychophysical nature [16,18,19], or its algorithmic applications [1,9,21]. We here develop a simple, formal mathematical model of the advantage of spatial attention for object detection, in which spatial attention is defined as processing a subset of the visual input, and detection is an abstraction with certain failure characteristics. We demonstrate that it is suboptimal to process the entire visual input given prior information about target locations, which in practice is almost always available in a video setting due to tracking, motion, or saliency. This argues for an attentional strategy independent of computational savings: no matter how much computational power is available, it is in principle better to dedicate it preferentially to selected portions of the scene. This suggests, anecdotally, a form of environmental pressure for the evolution of foveated photoreceptor densities in the retina. It also offers a general justification for the use of spatial attention in machine vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Tsotsos, J.K., Culhane, S.M., Kei Wai, W.Y., Lai, Y., Davis, N.: Modeling visual attention via selective tuning. Artificial Intelligence (1995)

    Google Scholar 

  2. Amit, Y., Geman, D.: A Computational Model for Visual Selection. Neural Computation (1999)

    Google Scholar 

  3. Yu, A., Dayan, P.: Inference, Attention, and Decision in a Bayesian Neural Architecture. In: Proc. Neural Information Processing Systems (NIPS) (2004)

    Google Scholar 

  4. Bonaiuto, J., Itti, L.: Combining attention and recognition for rapid scene analysis. In: Proc. IEEE-CVPR Workshop on Attention and Performance in Computer Vision (WAPCV 2005) (2005)

    Google Scholar 

  5. Rutishauser, U., Walther, D., Koch, C., Perona, P.: Is attention useful for object recognition? In: Proc. International Conference on Computer Vision and Pattern Recognition (CVPR) (2004)

    Google Scholar 

  6. Miau, F., Papageorgiou, C.S., Itti, L.: Neuromorphic algorithms for computer vision and attention. In: Proceedings of Annual International Symposium on Optical Science and Technology (SPIE) (2001)

    Google Scholar 

  7. Moosmann, F., Larlus, D., Jurie, F.: Learning Saliency Maps for Object Categorization. In: ECCV International Workshop on The Representation and Use of Prior Knowledge in Vision (2006)

    Google Scholar 

  8. Koch, C., Ullman, S.: Shifts in selective visual attention: towards the underlying neural circuitry. Hum. Neurobiol. (1985)

    Google Scholar 

  9. Itti, L., Koch, C.: Computational modeling of visual attention. Nature Reviews Neuroscience (2001)

    Google Scholar 

  10. Ye, Y., Tsotos, J.K.: Where to Look Next in 3D Object Search. In: Proc. of Internat. Symp. on Comp. Vis. (1995)

    Google Scholar 

  11. http://cbcl.mit.edu/software-datasets/streetscenes/

  12. http://labelme.csail.mit.edu/

  13. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2006)

    Google Scholar 

  14. Lowe, D.G.: Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision (2004)

    Google Scholar 

  15. Waydo, S., Kraskov, A., Quian Quiroga, R., Fried, I., Koch, C.: Sparse Representation in the Human Medial Temporal Lobe. Journal of Neuroscience (2006)

    Google Scholar 

  16. Treisman, A.: How the deployment of attention determines what we see. Visual Cognition (2006)

    Google Scholar 

  17. Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proc. Computer Vision and Pattern Recognition (CVPR)(2001)

    Google Scholar 

  18. Pashler, H.E.: The Psychology of Attention. MIT Press, Cambridge (1998)

    Google Scholar 

  19. Braun, J., Koch, C., Davis, J.L. (eds.): Visual Attention and Cortical Circuits. MIT Press, Cambridge (2001)

    Google Scholar 

  20. Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Networks (2006)

    Google Scholar 

  21. Mitri, S., Frintrop, S., Pervolz, K., Surmann, H., Nuchter, A.: Robust Object Detection at Regions of Interest with an Application in Ball Recognition. In: Proc. of International Conference on Robotics and Automation (ICRA) (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Harel, J., Koch, C. (2009). On the Optimality of Spatial Attention for Object Detection. In: Paletta, L., Tsotsos, J.K. (eds) Attention in Cognitive Systems. WAPCV 2008. Lecture Notes in Computer Science(), vol 5395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00582-4_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00582-4_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00581-7

  • Online ISBN: 978-3-642-00582-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics