Skip to main content

Model-Based Object Recognition

  • Reference work entry
  • First Online:
Computer Vision
  • 441 Accesses

Synonyms

Object models; Object parameterizations; Object representations; Visual patterns

Related Concepts

Human Pose Estimation; Object Class Recognition (Categorization); Object Detection

Definition

Model-based object recognition addresses the problem of recognizing objects from images by means of a suitable mathematical model that is used to describe the object.

Background

In model-based object recognition, an object model is typically defined so as to capture object’s geometrical and appearance properties at the appropriate level of specificity. For instance, an object model can be designed to recognize a generic “face” as opposed to “someone’s face” or vice versa. In the former case, which is often referred to as the object categorization problem, the main challenge is to design models that are capable of retaining key visual properties for representing an object category, such as a “face,” at the appropriate level of abstraction. Such models can be then used to recognize novel...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 649.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 899.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Binford T (1971) Visual perception by computer. IEEE conference on systems and control

    Google Scholar 

  2. Marr D (1978) Representing visual information. Computer vision systems

    Google Scholar 

  3. Palmer S, Rosch E, Chase P (1981) Canonical perspective and the perception of objects. Atten Perform 9:135–151

    Google Scholar 

  4. Tarr M, Pinker S (1989) Mental rotation and orientation-dependence in shape recognition. Cogn Psychol 21: 233–282

    Article  Google Scholar 

  5. Poggio T, Edelman S (1990) A neural network that learns to recognize three-dimensional objects. Nature 343:263–266

    Article  Google Scholar 

  6. Ullman S, Basri R (1991) Recognition by linear combinations of models. TPAMI 13:992–1006

    Article  Google Scholar 

  7. Koenderink J, Doorn AV (1979) The internal representation of solid shape with respect to vision. Biol Cybern 32: 211–216

    Article  MATH  Google Scholar 

  8. Huttenlocher DP, Ullman S (1987) Object recognition using alignment. In: ICCV

    Google Scholar 

  9. Lowe D, Binford T (1985) The recovery of three-dimensional structure from image curves. TPAMI 7: 320–326

    Article  Google Scholar 

  10. Lowe DG (1999) Object recognition from local scale-invariant features. In: ICCV

    Book  Google Scholar 

  11. Rothganger F, Lazebnik S, Schmid C, Ponce J (2003) 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  12. Brown M, Lowe DG (2005) Unsupervised 3D object recognition and reconstruction in unordered datasets. In: 3DIM

    Google Scholar 

  13. Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. IJCV 67:159–188

    Article  Google Scholar 

  14. Mikolajczyk K, Schmid C (2002) An affine invariant interest point detector. In: European conference on computer vision (ECCV)

    Google Scholar 

  15. Dance C, Willamowski J, Fan L, Bray C, Csurka G (2004) Visual categorization with bags of keypoints. In: European conference on computer vision (ECCV) international workshop on statistical learning in computer vision, Prague

    Google Scholar 

  16. Grauman K, Darrell T (2005) The pyramid match kernel: Discriminative classification with sets of image features. In: ICCV

    Google Scholar 

  17. Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  18. Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  19. Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. IJCV 61(1):55–79

    Article  Google Scholar 

  20. Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: European conference on computer vision (ECCV) workshop on statistical learning in computer vision

    Google Scholar 

  21. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  22. Savarese S, Winn J, Criminisi A (2006) Discriminative object class models of appearance and shape by correlatons. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  23. Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc 39:1–38

    MathSciNet  MATH  Google Scholar 

  24. Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT

    Google Scholar 

  25. Wainwright MJ, Jordan MI (2008) Graphical models, exponential families, and variational inference. Found Trends Mach Learn 1:1–305

    Article  MATH  Google Scholar 

  26. Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML

    Google Scholar 

  27. Ballard D (1981) Generalizing the hough transform to detect arbitrary shapes. Pattern Recognit 13:111–122

    Article  MATH  Google Scholar 

  28. Fischler MA, Bolles RC (1981) Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24:381–395

    Article  MathSciNet  Google Scholar 

  29. Schneiderman H, Kanade T (2000) A statistical approach to 3D object detection applied to faces and cars. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  30. Weber M, Einhaeuser W, Welling M, Perona P (2000) Viewpoint-invariant learning and detection of human heads. In: International conference on automatic face and gesture recognition

    Book  Google Scholar 

  31. Savarese S, Fei-Fei L (2007) 3D generic object categorization, localization and pose estimation. In: ICCV

    Book  Google Scholar 

  32. Su H, Sun M, Fei-Fei L, Savarese S (2009) Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories. In: ICCV

    Google Scholar 

  33. Hoiem D, Savarese S (2011) Representations and techniques for 3D object recognition and scene interpretation. In: Synthesis lecture on artificial intelligence and machine learning. Morgan Claypool, San Rafael

    Google Scholar 

  34. Thomas A, Ferrari V, Leibe B, Tuytelaars T, Schiele B, Goo LV (2006) Towards multi-view object class detection. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  35. Bowyer K, Dyer CR (1990) Aspect graphs: An introduction and survey of recent results. Int J Imaging Syst Technol 2:315–328

    Article  Google Scholar 

  36. Sun M, Bradski G, Xu BX, Savarese S (2010) Depth-encoded hough voting for joint object detection and shape recovery. In: European conference on computer vision (ECCV)

    Google Scholar 

  37. Hoiem D, Rother C, Winn J (2007) 3D layoutcrf for multi-view object class recognition and segmentation. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  38. Liebelt J, Schmid C (2010) Multi-view object class detection with a 3D geometric model. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  39. Pepik B, Stark M, Gehler P, Schiele B (2012) Teaching 3D geometry to deformable part models. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  40. Arie-Nachimson M, Basri R (2009) Constructing implicit 3D shape models for pose estimation. In: ICCV

    Book  Google Scholar 

  41. Xiang Y, Savarese S (2012) Estimating the aspect layout of object categories. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  42. Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  43. Yao B, Bradski G, Fei-Fei L (2012) A codebook-free and annotation-free approach for fine-grained image categorization. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  44. Duan K, Parikh D, Crandall D, Grauman K (2012) Discovering localized attributes for fine-grained recognition. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  45. Perona P (2010) Visions of a visipedia. Proc IEEE 98: 1526–1534

    Article  Google Scholar 

  46. Farhadi A, Endres I, Hoiem D, Forsyth D (2009) Describing objects by their attributes. In: IEEE conference on computer vision pattern recognition (CVPR), Miami

    Google Scholar 

  47. Ferrari V, Zisserman A (2007) Learning visual attributes. In: NIPS

    Google Scholar 

  48. Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  49. Lee H, Grosse R, Ranganath R, Ng AY (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: ICML

    Book  Google Scholar 

  50. Yann LeCun FJH, Bottou L (2004) Learning methods for generic object recognition with invariance to pose and lighting. In: IEEE conference on computer vision pattern recognition (CVPR)

    Google Scholar 

  51. Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916

    Article  Google Scholar 

  52. Zhu L, Chen Y, Yuille A (2006) Unsupervised learning of a probabilistic grammar for object detection and parsing. In: NIPS

    Google Scholar 

  53. Todorovic S, Ahuja N (2008) Unsupervised category modeling, recognition, and segmentation in images. IEEE Trans Pattern Anal Mach Intell 30(12):2158–2174

    Article  Google Scholar 

  54. Felzenszwalb P, Girshick R, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. TPAMI 32:1627–1645

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Min Sun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this entry

Cite this entry

Sun, M., Savarese, S. (2014). Model-Based Object Recognition. In: Ikeuchi, K. (eds) Computer Vision. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-31439-6_334

Download citation

Publish with us

Policies and ethics