Model-based invariant functions and their use for recognition

Weinshall, Daphna

doi:10.1007/3-540-58240-1_19

Model-based invariant functions and their use for recognition

Daphna Weinshall¹

Recovery
Conference paper
First Online: 01 January 2005

303 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 825))

Abstract

Using three dimensional invariant representations, we address the problem of changes in appearance that result from a change in camera orientation (or change of viewpoint). This approach is based on a Euclidean invariant representation of three dimensional objects, where the metric information is kept using the Gramian of 4 basis points and the affine coordinates of the remaining points, or using the generalized inverse Gramian of all the object points. We describe functions which operate on two dimensional images of three dimensional objects, and which are invariant under changes of viewpoint. These functions can be used to improve and extend various existing recognition approaches, including alignment, linear combination, and indexing. The invariant representation can be computed with a linear algorithm from a sequence of images.

This paper describes research done at IBM T.J. Watson Res. Ctr., Hawthorne, NY.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

J.B. Burns, R. Weiss, and E. Riseman. View variation of point-set and line segment features. In Proceedings Image Understanding Workshop, pages 650–659, April 1990.
Google Scholar
D. T. Clemens and D. W. Jacobs. Space and time bounds on indexing 3-D models from 2-D images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):1007–1017, 1991.
Google Scholar
O. Faugeras. What can be seen in three dimensions with an uncalibrated stereo rig? In Proceedings of the 2nd European Conference on Computer Vision, pages 563–578, Santa Margherita Ligure, Italy, 1992. Springer-Verlag.
Google Scholar
D. Forsyth, J. L. Mundy, A. Zisserman, C. Coelho, A. Heller, and C. Rothwell. Invariant descriptors for 3-D object recognition and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:971–991, 1991.
Google Scholar
D. P. Huttenlocher and S. Ullman. Object recognition using alignment. In Proceedings of the 1st International Conference on Computer Vision, pages 102–111, London, England, June 1987. IEEE, Washington, DC.
Google Scholar
J. J. Koenderink and A. J. van Doorn. Affine structure from motion. Journal of the Optical Society of America, 8(2):377–385, 1991.
PubMed Google Scholar
Y. Lamdan, J. T. Schwartz, and H. Wolfson. Object recognition by affine invariant matching. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, pages 335–344, Ann Arbor, MI, 1988.
Google Scholar
Y. Lamdan and H. Wolfson. Geometric hashing: a general and efficient recognition scheme. In Proceedings of the 2nd International Conference on Computer Vision, pages 238–251, Tarpon Springs, FL, 1988. IEEE, Washington, DC.
Google Scholar
R. Mohan, D. Weinshall, and R. R. Sarukkai. 3D object recognition by indexing structural invariants from multiple views. In Proceedings of the 4th International Conference on Computer Vision, pages 264–268, Berlin, Germany, 1993. IEEE, Washington, DC.
Google Scholar
Y. Moses and S. Ullman. Limitations of non model-based schemes. A.I. Memo No. 1301, Artificial Intelligence Laboratory, Mass. Inst. of Tech., 1991.
Google Scholar
H. S. Sawhney, J. Oliensis, and A. R. Hanson. Description and reconstruction from image trajectories of rotational motion. In Proceedings of the 3rd International Conference on Computer Vision, pages 494–498, Osaka, Japan, 1990. IEEE, Washington, DC.
Google Scholar
A. Shashua. Projective depth: a geometric invariant for 3D reconstruction from two perspective/orthographic views and for visual recognition. In Proceedings of the 4th International Conference on Computer Vision, pages 583–590, Berlin, Germany, 1993. IEEE, Washington, DC.
Google Scholar
C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: a factorization method. International Journal of Computer Vision, 9(2):137–154, 1992.
Google Scholar
S. Ullman. Computational studies in the interpretation of structure and motion: summary and extension. In J. Beck, B. Hope, and A. Rosenfeld, editors, Human and Machine Vision. Academic Press, New York, 1983.
Google Scholar
S. Ullman. Maximizing rigidity: the incremental recovery of 3D structure from rigid and rubbery motion. Perception, 13:255–274, 1984.
PubMed Google Scholar
S. Ullman and R. Basri. Recognition by linear combinations of models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:992–1006, 1991.
Google Scholar
D. Weinshall. Model-based invariants for 3D vision. International Journal of Computer Vision, 10(1):27–42, 1993.
Google Scholar
D. Weinshall and R. Basri. Distance metric between 3d models and 2d images for recognition and classification. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, New-York City, NY, 1993. IEEE, Washington, DC.
Google Scholar
D. Weinshall and C. Tomasi. Linear and incremental acquisition of invariant shape models from image sequences. In Proceedings of the 4th International Conference on Computer Vision, pages 675–682, Berlin, Germany, 1993. IEEE, Washington, DC.
Google Scholar
A. P. Witkin. Scale-space filtering. In Proceedings IJCAI, pages 1019–1022, 1983.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel
Daphna Weinshall

Authors

Daphna Weinshall
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Joseph L. Mundy Andrew Zisserman David Forsyth

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Weinshall, D. (1994). Model-based invariant functions and their use for recognition. In: Mundy, J.L., Zisserman, A., Forsyth, D. (eds) Applications of Invariance in Computer Vision. AICV 1993. Lecture Notes in Computer Science, vol 825. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58240-1_19

Download citation

DOI: https://doi.org/10.1007/3-540-58240-1_19
Published: 03 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58240-3
Online ISBN: 978-3-540-48583-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics