Abstract
We present an efficient method for estimating the pose of a three-dimensional object. Its implementation is embedded in a computer vision system which is motivated by and based on cognitive principles concerning the visual perception of three-dimensional objects. Viewpoint-invariant object recognition has been subject to controversial discussions for a long time. An important point of discussion is the nature of internal object representations. Behavioral studies with primates, which are summarized in this article, support the model of view-based object representations. We designed our computer vision system according to these findings and demonstrate that very precise estimations of the poses of real-world objects are possible even if only a few number of sample views of an object is available. The system can be used for a variety of applications.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
S. Edelman and H. H. Bülthoff. Orientation Dependence in the Recognition of Familiar and Novel Views of Three-Dimensional Objects. Vision Research, 32(12):2385–2400, 1992.
M. J. Tarr. Orientation Dependence in Three-Dimensional Object Recognition. Ph.D. Thesis, MIT, 1989.
F. Cutzu and S. Edelman. Canonical Views in Object Representation and Recognition. Vision Research, 34:3037–3056, 1994.
N. K. Logothetis, J. Pauls, H. H. Bülthoff, and Poggio T. View-Dependent Object Recognition by Monkeys. Current Biology, 4:401–414, 1994.
M. Wexler, S. M. Kosslyn, and A. Berthoz. Motor processes in mental rotation. Cognition, 68: 77–94, 1998.
N. K. Logothetis, J. Pauls, and Poggio T. Shape Representation in the Inferior Temporal Cortex of Monkeys. Current Biology, 5(5):552–563, 1995.
D. C. Burr, M. C. Morrone, and D. Spinelli. Evidence for Edge and Bar Detectors in Human Vision. Vision Research, 29(4):419–431, 1989.
J. P. Jones and L. A. Palmer. An evaluation of the two-dimensional gabor filter model of simple receptive fields in cat striate cortex. Journal of Neurophysiology, 58(6):1233–1258, 1987.
C. Eckes and J. C. Vorbrüggen. Combining Data-Driven and Model-Based Cues for Segmentation of Video Sequences. In Proc. WCNN96, pages 868–875, 1996.
T. Maurer and C. von der Malsburg. Tracking and Learning Graphs and Pose on Image Sequences of Faces. In Proc. Int. Conf. on Automatic Face-and Gesture-Recognition, pages 176–181, 1996.
V. Chvatal. A Greedy Heuristic for the Set-Covering Problem. Mathematics of Operations Research, 4(3):233–235, 1979.
R. Horaud, B. Conio, O. Leboulleux, and B. Lacolle. An Analytic Solution for the Perspective 4-Point Problem. Computer Vision, Graphics and Image Processing, 47:33–44, 1989.
M. Dhome, M. Richetin, J. Lapreste, and G. Rives. Determination of the Attitude of 3-D Objects from a Single Perspective View. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11(12):1265–1278, 1989.
R. M. Haralick, C. Lee, K. Ottenberg, and M. Nölle. Analysis and Solutions of the Three Point Perspective Pose Estimation Problem. In Proc. of the IEEE Comp. Society Conf. on Computer Vision and Pattern Recognition, pages 592–598, 1991.
D. G. Lowe. Three-Dimensional Object Recognition from Single Two-Dimensional Images. Artificial Intelligence, 31:355–395, 1987.
J. Yuan. A General Photogrammetric Method for Determining Object Position and Orientation. IEEE Journal of Robotics and Automation, 5(2):129–142, 1989.
M. Pötzsch. Die Behandlung der Wavelet-Transformation von Bildern in der Nähe von Objektkanten. Technical Report IRINI 94-04, Institut für Neuroinformatik, Ruhr-Universität Bochum, Germany, 1994.
S. Ullman and R. Basri. Recognition by Linear Combinations of Models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):992–1006, 1991.
G. Peters and C. von der Malsburg. View Reconstruction by Linear Combination of Sample Views. In Proc. BMVC 2001, pages 223–232, 2001.
M. Lades, J. C. Vorbrüggen, J. Buhmann, J. Lange, C. von der Malsburg, R. P. Würtz, and W. Konen. Distortion Invariant Object Recognition in the Dynamic Link Architecture. IEEE Trans. Comp., 42: 300–311, 1993.
G. Peters. A View-Based Approach to Three-Dimensional Object Perception. Ph.D. Thesis, Shaker Verlag, Aachen, Germany, 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Peters, G. (2003). Efficient Pose Estimation Using View-Based Object Representations. In: Crowley, J.L., Piater, J.H., Vincze, M., Paletta, L. (eds) Computer Vision Systems. ICVS 2003. Lecture Notes in Computer Science, vol 2626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36592-3_2
Download citation
DOI: https://doi.org/10.1007/3-540-36592-3_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00921-4
Online ISBN: 978-3-540-36592-1
eBook Packages: Springer Book Archive