Model-Based Object Recognition

Sun, Min; Savarese, Silvio

doi:10.1007/978-0-387-31439-6_334

Min Sun² &
Silvio Savarese³

441 Accesses

Synonyms

Object models; Object parameterizations; Object representations; Visual patterns

Related Concepts

Human Pose Estimation; Object Class Recognition (Categorization); Object Detection

Definition

Model-based object recognition addresses the problem of recognizing objects from images by means of a suitable mathematical model that is used to describe the object.

Background

In model-based object recognition, an object model is typically defined so as to capture object’s geometrical and appearance properties at the appropriate level of specificity. For instance, an object model can be designed to recognize a generic “face” as opposed to “someone’s face” or vice versa. In the former case, which is often referred to as the object categorization problem, the main challenge is to design models that are capable of retaining key visual properties for representing an object category, such as a “face,” at the appropriate level of abstraction. Such models can be then used to recognize novel...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 649.99; Price excludes VAT (USA)

Hardcover Book: USD 899.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Binford T (1971) Visual perception by computer. IEEE conference on systems and control
Google Scholar
Marr D (1978) Representing visual information. Computer vision systems
Google Scholar
Palmer S, Rosch E, Chase P (1981) Canonical perspective and the perception of objects. Atten Perform 9:135–151
Google Scholar
Tarr M, Pinker S (1989) Mental rotation and orientation-dependence in shape recognition. Cogn Psychol 21: 233–282
Article Google Scholar
Poggio T, Edelman S (1990) A neural network that learns to recognize three-dimensional objects. Nature 343:263–266
Article Google Scholar
Ullman S, Basri R (1991) Recognition by linear combinations of models. TPAMI 13:992–1006
Article Google Scholar
Koenderink J, Doorn AV (1979) The internal representation of solid shape with respect to vision. Biol Cybern 32: 211–216
Article MATH Google Scholar
Huttenlocher DP, Ullman S (1987) Object recognition using alignment. In: ICCV
Google Scholar
Lowe D, Binford T (1985) The recovery of three-dimensional structure from image curves. TPAMI 7: 320–326
Article Google Scholar
Lowe DG (1999) Object recognition from local scale-invariant features. In: ICCV
Book Google Scholar
Rothganger F, Lazebnik S, Schmid C, Ponce J (2003) 3D object modeling and recognition using affine-invariant patches and multi-view spatial constraints. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Brown M, Lowe DG (2005) Unsupervised 3D object recognition and reconstruction in unordered datasets. In: 3DIM
Google Scholar
Ferrari V, Tuytelaars T, Gool L (2006) Simultaneous object recognition and segmentation from single or multiple model views. IJCV 67:159–188
Article Google Scholar
Mikolajczyk K, Schmid C (2002) An affine invariant interest point detector. In: European conference on computer vision (ECCV)
Google Scholar
Dance C, Willamowski J, Fan L, Bray C, Csurka G (2004) Visual categorization with bags of keypoints. In: European conference on computer vision (ECCV) international workshop on statistical learning in computer vision, Prague
Google Scholar
Grauman K, Darrell T (2005) The pyramid match kernel: Discriminative classification with sets of image features. In: ICCV
Google Scholar
Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Fergus R, Perona P, Zisserman A (2003) Object class recognition by unsupervised scale-invariant learning. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Felzenszwalb PF, Huttenlocher DP (2005) Pictorial structures for object recognition. IJCV 61(1):55–79
Article Google Scholar
Leibe B, Leonardis A, Schiele B (2004) Combined object categorization and segmentation with an implicit shape model. In: European conference on computer vision (ECCV) workshop on statistical learning in computer vision
Google Scholar
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Savarese S, Winn J, Criminisi A (2006) Discriminative object class models of appearance and shape by correlatons. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the em algorithm. J R Stat Soc 39:1–38
MathSciNet MATH Google Scholar
Koller D, Friedman N (2009) Probabilistic graphical models: principles and techniques. MIT
Google Scholar
Wainwright MJ, Jordan MI (2008) Graphical models, exponential families, and variational inference. Found Trends Mach Learn 1:1–305
Article MATH Google Scholar
Lafferty J, McCallum A, Pereira F (2001) Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML
Google Scholar
Ballard D (1981) Generalizing the hough transform to detect arbitrary shapes. Pattern Recognit 13:111–122
Article MATH Google Scholar
Fischler MA, Bolles RC (1981) Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Commun ACM 24:381–395
Article MathSciNet Google Scholar
Schneiderman H, Kanade T (2000) A statistical approach to 3D object detection applied to faces and cars. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Weber M, Einhaeuser W, Welling M, Perona P (2000) Viewpoint-invariant learning and detection of human heads. In: International conference on automatic face and gesture recognition
Book Google Scholar
Savarese S, Fei-Fei L (2007) 3D generic object categorization, localization and pose estimation. In: ICCV
Book Google Scholar
Su H, Sun M, Fei-Fei L, Savarese S (2009) Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories. In: ICCV
Google Scholar
Hoiem D, Savarese S (2011) Representations and techniques for 3D object recognition and scene interpretation. In: Synthesis lecture on artificial intelligence and machine learning. Morgan Claypool, San Rafael
Google Scholar
Thomas A, Ferrari V, Leibe B, Tuytelaars T, Schiele B, Goo LV (2006) Towards multi-view object class detection. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Bowyer K, Dyer CR (1990) Aspect graphs: An introduction and survey of recent results. Int J Imaging Syst Technol 2:315–328
Article Google Scholar
Sun M, Bradski G, Xu BX, Savarese S (2010) Depth-encoded hough voting for joint object detection and shape recovery. In: European conference on computer vision (ECCV)
Google Scholar
Hoiem D, Rother C, Winn J (2007) 3D layoutcrf for multi-view object class recognition and segmentation. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Liebelt J, Schmid C (2010) Multi-view object class detection with a 3D geometric model. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Pepik B, Stark M, Gehler P, Schiele B (2012) Teaching 3D geometry to deformable part models. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Arie-Nachimson M, Basri R (2009) Constructing implicit 3D shape models for pose estimation. In: ICCV
Book Google Scholar
Xiang Y, Savarese S (2012) Estimating the aspect layout of object categories. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Yao B, Bradski G, Fei-Fei L (2012) A codebook-free and annotation-free approach for fine-grained image categorization. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Duan K, Parikh D, Crandall D, Grauman K (2012) Discovering localized attributes for fine-grained recognition. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Perona P (2010) Visions of a visipedia. Proc IEEE 98: 1526–1534
Article Google Scholar
Farhadi A, Endres I, Hoiem D, Forsyth D (2009) Describing objects by their attributes. In: IEEE conference on computer vision pattern recognition (CVPR), Miami
Google Scholar
Ferrari V, Zisserman A (2007) Learning visual attributes. In: NIPS
Google Scholar
Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Lee H, Grosse R, Ranganath R, Ng AY (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: ICML
Book Google Scholar
Yann LeCun FJH, Bottou L (2004) Learning methods for generic object recognition with invariance to pose and lighting. In: IEEE conference on computer vision pattern recognition (CVPR)
Google Scholar
Arbelaez P, Maire M, Fowlkes C, Malik J (2011) Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell 33(5):898–916
Article Google Scholar
Zhu L, Chen Y, Yuille A (2006) Unsupervised learning of a probabilistic grammar for object detection and parsing. In: NIPS
Google Scholar
Todorovic S, Ahuja N (2008) Unsupervised category modeling, recognition, and segmentation in images. IEEE Trans Pattern Anal Mach Intell 30(12):2158–2174
Article Google Scholar
Felzenszwalb P, Girshick R, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. TPAMI 32:1627–1645
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Michigan, Ann Arbor, MI, USA
Min Sun
Department of Electrical and Computer Engineering, University of Michigan, 1301 Beal Avenue, Room 4120, 48109-2122, Ann Arbor, MI, USA
Silvio Savarese

Authors

Min Sun
View author publications
You can also search for this author in PubMed Google Scholar
Silvio Savarese
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Min Sun .

Editor information

Editors and Affiliations

Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
Katsushi Ikeuchi

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Sun, M., Savarese, S. (2014). Model-Based Object Recognition. In: Ikeuchi, K. (eds) Computer Vision. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-31439-6_334

Download citation

DOI: https://doi.org/10.1007/978-0-387-31439-6_334
Published: 05 February 2016
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30771-8
Online ISBN: 978-0-387-31439-6
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics