Automatic extraction of deformable part models

Pentland, Alex P.

doi:10.1007/BF00127812

Automatic extraction of deformable part models

Published: March 1990

Volume 4, pages 107–126, (1990)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Alex P. Pentland¹

446 Accesses
164 Citations
3 Altmetric
Explore all metrics

Abstract

I deseribe a system that first segments an image into parts, and then fits deformable models to range data associated with the image. Models are represented by using modal dynamics applied to volumetric primitives, which significantly improves the computational complexity of both model recovery and subsequent processing. The segmentation procedure uses two simple mechanisms: a filtering operation to produce a large set of potential object parts, followed by a quadratic integer optimization that searches among these part hypotheses to produce a maximum-likelihood estimate of the image's part structure. Once a segmentation has been produced, a volumetric description is obtained by a fitting procedure that minimizes squared error between the range measurements an the model's visible surface. For simple part shapes, it is possible to compute the deformable model's parameters using only the shape of its symmetry axes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

T.O. Binford, “Visual perception by computer,” Proc. IEEE Conf. Systems and Control, Miami, December, 1971.
G.J. Agin and T.O. Binford, “Computer descriptions of curved objects,” IEEE Tran. PAMI C 25 (4):439–449, 1976
Google Scholar
R. Nevatia and T.O. Binford, “Description and recognition of curved objects,” Artificial Intelligence 8 (1):77–98, 1977.
Google Scholar
D. Marr and K. Nishihara, “Representation and recognition of the spatial organization of three-dimensional shapes,” Proc. Roy. Soc.—London B, 200:269–294, 1978.
Google Scholar
D. Ballard, “Reference frames for active vision,” Proc. Ilth Intern. Joint Conf. Artificial Intelligence, pp. 1635–1641, Detroit, August 20–25, 1989.
I. Beiderman, “Human image understanding: Recent research and a theory,” Comp. Vision, Graphics and Image Process. 32:29–73, 1985.
Google Scholar
Jan J. Konderink and Andrea J.van Doorn, “The shape of smooth objects and the way contours end,” Perception 11:129–137, 1982.
Google Scholar
Jan J. Konderink and Andrea J.van Doorn, “The internal representation of solid shape with respect to vision,” Biological Cybernetics 32:211–216, 1979.
Google Scholar
D. Hoffman and W. Richards, “Parts of recognition.” In From Pixels to Predicates, A. Pentland (ed.), Ablex Publishing: Norwood, N.J., 1985.
Google Scholar
M. Leyton, “Perceptual organization as nested control,” Biological Cybernetics 51:141–153, 1984.
Google Scholar
D. Terzopolis, A. Witkin, and M. Kass, “Symmetry-seeking models for 3D object reconstruction,” Proc. Ist Intern. Conf. Computer Vision, pp. 269–276, London, 1987.
R. Bajcsy and F. Solina, “Three-dimensional object representation revisited,” Ist Intern. Conf. Computer Vision, London, June 8–11, 1987.
T. Bolt and A. Gross, “Recovery of superquadrics from depth information,” Proc. AAAI Workshop on Spatial Reasoning and Multi-Sensor Fusion, pp. 128–137, October 1987.
A. Pentland, “Perceptual organization and the representation of natural form,” Artificial Intelligence, 28(2):1–38, 1986.
Google Scholar
A. Pentland, “Recognition by parts,” Ist Intern. Conf. Computer Vision, London, June 8–11, 1987.
J. Rissanen, “Minimum-length description principle,” Encyclopedia of Statistical Sciences, vol. 5, Wiley: New York, pp. 523–527, 1983.
Google Scholar
A. Pentland, “Fractal-based description of natural scenes,” IEEE Trans. PAMI 6(6):661–674, 1984.
Google Scholar
M. Gardiner, “The superellipse: A curve that lies between the ellipse and the rectangle, Scientific American, September 1965.
J. Williams, “The theoretical basis of the discrete element method.” In Numerical Methods in Engineering, Theory, and Application, Balkema Publishers: rotterdam, 1987.
Google Scholar
A. Pentland and J. Williams, “The perception of non-rigid motion: Interference of material properties and force,” Proc. Intern. Joint Conf. Artificial Intelligence, Detroit, August 20–25, 1989.
A. Pentland and J. Williams, “Good vibrations: Modal dynamics for graphies and animation,” Computer Graphics 23(3):215–222, 1989.
Google Scholar
A. Barr, “Global and local deformations of solid primitives,” Computer Graphics 18(3):21–30, 1984.
Google Scholar
D'Arcy Thompson, On Growth and Form, 2d ed. University Press: Cambridge, England, 1942.
Google Scholar
A. Pentland, “A new sense for depth of field,” IEEE Trans. PAMI 9(4):523–531, 1987.
Google Scholar
J.J. Hopfield and D.W. Tank, “neural computation of decisions in optimization problems,” Biological Cybernetics, 52: 141–152, 1985.
Google Scholar
P.J. Burt, “Fast algorithms for estimating local image properties,” Comput. Vision, Graphics, and Image Process. 21:368–382. 1983.
Google Scholar
A. Pentland and J. Williams, “Virtual Manufacturing,” NSF Engineering Design Res. Conf., Amherst, MA, pp. 301–316, June 11–14, 1989.
M. Bove, “Discrete fourier transform based depth-from-focus,” Proc. Opt. Soc. Am. Conf. Image Understanding and Machine Vision, pp. 115–119, N. Falmouth, MA, June 12–14, 1989.
M. Raibert, Legged Robots that Balance. M.I.T. Press: Cambridge, 1986.
Google Scholar
R.A. Hummel and S.W. Zucker, “On the foundations of relaxation labeling processes,” IEEE Trans. PAMI 5(3):267–287, 1983.
Google Scholar
J. Tsotsos, “A complexity level for analysis of vision,” Proc. Ist Intern. Conf. Computer Vision, London, pp. 346–355, June 8–11, 1987.
A. Pentland, “Part segmentation for object recognition,” Neural Computation, 1(1):82–91, 1989.
Google Scholar
Y.G. Leclerc, “Constructing simple stable descriptions for image partitioning,” Intern. J. Comput. Vision. (1):73–102, 1989.

Download references

Author information

Authors and Affiliations

Vision and Modeling Group, The Media Lab, Massachusetts Institute of Technology, Room E15-387, 20 Ames St., 02139, Cambridge, MA
Alex P. Pentland

Authors

Alex P. Pentland
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pentland, A.P. Automatic extraction of deformable part models. Int J Comput Vision 4, 107–126 (1990). https://doi.org/10.1007/BF00127812

Download citation

Issue Date: March 1990
DOI: https://doi.org/10.1007/BF00127812

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic extraction of deformable part models

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

2D-3D Pose Estimation of Heterogeneous Objects Using a Region Based Approach

Parametric Image Segmentation of Humans with Structural Shape Priors

A framework for interpreting, modeling and recognizing human body gestures through 3D eigenpostures

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Automatic extraction of deformable part models

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

2D-3D Pose Estimation of Heterogeneous Objects Using a Region Based Approach

Parametric Image Segmentation of Humans with Structural Shape Priors

A framework for interpreting, modeling and recognizing human body gestures through 3D eigenpostures

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now