Skip to main content

Object Detection

  • Reference work entry
  • First Online:
Computer Vision

Definition

Object detection involves detecting instances of objects from a particular class in an image.

Background

The goal of object detection is to detect all instances of objects from a known class, such as people, cars, or faces in an image. Typically, only a small number of instances of the object are present in the image, but there is a very large number of possible locations and scales at which they can occur and that need to somehow be explored.

Each detection is reported with some form of poseinformation. This could be as simple as the location of the object, a location and scale, or the extent of the object defined in terms of a bounding box. In other situations, the pose information is more detailed and contains the parameters of a linear or nonlinear transformation. For example, a face detector may compute the locations of the eyes, nose, and mouth, in addition to the bounding box of the face. An example of a bicycle detection that specifies the locations of certain...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 649.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 899.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amit Y (2002) 2d object detection and recognition: models, algorithms and networks. MIT Press, Cambridge

    Google Scholar 

  2. Felzenszwalb P, Huttenlocher D (2005) Pictorial structures for object recognition. Int J Comput Vis 61(1):55–79

    Article  Google Scholar 

  3. . Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: IEEE conference on computer vision pattern recognition (CVPR 2003), Madison

    Google Scholar 

  4. Amit Y, Trouvé A (2007) POP: patchwork of parts models for object recognition. Int J Comput Vis 75(2):267–282

    Article  Google Scholar 

  5. . Jin Y, Geman S (2006) Context and hierarchy in a probabilistic image model. In: IEEE conference on computer vision pattern recognition (CVPR 2006), New York

    Google Scholar 

  6. Rowley HA, Baluja S, Kanade T (1998) Neural network-based face detection. IEEE Trans Pattern Anal Mach Intell 20(1):23–38

    Article  Google Scholar 

  7. Viola P, Jones MJ (2004) Robust real time face detection. Int J Comput Vis 57(2):137–154

    Article  Google Scholar 

  8. Fleuret F, Geman D (2001) Coarse-to-fine face detection. Int J Comput Vis 41(1–2):85–107

    Article  MATH  Google Scholar 

  9. . Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE conference on computer vision pattern recognition (CVPR 2005), San Diego

    Google Scholar 

  10. Felzenszwalb P, Girshick R, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645

    Article  Google Scholar 

  11. Riesenhuber M, Poggio T (2000) Models of object recognition. Nat Neurosci 3:1199–1204. Supplement

    Article  Google Scholar 

  12. Lampert C, Blaschko M, Hofmann T (2009) Efficient subwindow search: a branch and bound framework for object localization. IEEE Trans Pattern Anal Mach Intell 31(12):2129–2142

    Article  Google Scholar 

  13. Lowe D (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  14. Chang LB, Jin Y, Zhang W, Borenstein E, Geman S (2011) Context computation, and optimal roc performance in hierarchical models. Int J Comput Vis 93:117–140

    Article  MathSciNet  MATH  Google Scholar 

  15. Ullman S (1996) High-level vision. MIT Press, Cambridge

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this entry

Cite this entry

Amit, Y., Felzenszwalb, P. (2014). Object Detection. In: Ikeuchi, K. (eds) Computer Vision. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-31439-6_660

Download citation

Publish with us

Policies and ethics