Skip to main content

Model-based invariant functions and their use for recognition

  • Recovery
  • Conference paper
  • First Online:
  • 303 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 825))

Abstract

Using three dimensional invariant representations, we address the problem of changes in appearance that result from a change in camera orientation (or change of viewpoint). This approach is based on a Euclidean invariant representation of three dimensional objects, where the metric information is kept using the Gramian of 4 basis points and the affine coordinates of the remaining points, or using the generalized inverse Gramian of all the object points. We describe functions which operate on two dimensional images of three dimensional objects, and which are invariant under changes of viewpoint. These functions can be used to improve and extend various existing recognition approaches, including alignment, linear combination, and indexing. The invariant representation can be computed with a linear algorithm from a sequence of images.

This paper describes research done at IBM T.J. Watson Res. Ctr., Hawthorne, NY.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. J.B. Burns, R. Weiss, and E. Riseman. View variation of point-set and line segment features. In Proceedings Image Understanding Workshop, pages 650–659, April 1990.

    Google Scholar 

  2. D. T. Clemens and D. W. Jacobs. Space and time bounds on indexing 3-D models from 2-D images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):1007–1017, 1991.

    Google Scholar 

  3. O. Faugeras. What can be seen in three dimensions with an uncalibrated stereo rig? In Proceedings of the 2nd European Conference on Computer Vision, pages 563–578, Santa Margherita Ligure, Italy, 1992. Springer-Verlag.

    Google Scholar 

  4. D. Forsyth, J. L. Mundy, A. Zisserman, C. Coelho, A. Heller, and C. Rothwell. Invariant descriptors for 3-D object recognition and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:971–991, 1991.

    Google Scholar 

  5. D. P. Huttenlocher and S. Ullman. Object recognition using alignment. In Proceedings of the 1st International Conference on Computer Vision, pages 102–111, London, England, June 1987. IEEE, Washington, DC.

    Google Scholar 

  6. J. J. Koenderink and A. J. van Doorn. Affine structure from motion. Journal of the Optical Society of America, 8(2):377–385, 1991.

    PubMed  Google Scholar 

  7. Y. Lamdan, J. T. Schwartz, and H. Wolfson. Object recognition by affine invariant matching. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, pages 335–344, Ann Arbor, MI, 1988.

    Google Scholar 

  8. Y. Lamdan and H. Wolfson. Geometric hashing: a general and efficient recognition scheme. In Proceedings of the 2nd International Conference on Computer Vision, pages 238–251, Tarpon Springs, FL, 1988. IEEE, Washington, DC.

    Google Scholar 

  9. R. Mohan, D. Weinshall, and R. R. Sarukkai. 3D object recognition by indexing structural invariants from multiple views. In Proceedings of the 4th International Conference on Computer Vision, pages 264–268, Berlin, Germany, 1993. IEEE, Washington, DC.

    Google Scholar 

  10. Y. Moses and S. Ullman. Limitations of non model-based schemes. A.I. Memo No. 1301, Artificial Intelligence Laboratory, Mass. Inst. of Tech., 1991.

    Google Scholar 

  11. H. S. Sawhney, J. Oliensis, and A. R. Hanson. Description and reconstruction from image trajectories of rotational motion. In Proceedings of the 3rd International Conference on Computer Vision, pages 494–498, Osaka, Japan, 1990. IEEE, Washington, DC.

    Google Scholar 

  12. A. Shashua. Projective depth: a geometric invariant for 3D reconstruction from two perspective/orthographic views and for visual recognition. In Proceedings of the 4th International Conference on Computer Vision, pages 583–590, Berlin, Germany, 1993. IEEE, Washington, DC.

    Google Scholar 

  13. C. Tomasi and T. Kanade. Shape and motion from image streams under orthography: a factorization method. International Journal of Computer Vision, 9(2):137–154, 1992.

    Google Scholar 

  14. S. Ullman. Computational studies in the interpretation of structure and motion: summary and extension. In J. Beck, B. Hope, and A. Rosenfeld, editors, Human and Machine Vision. Academic Press, New York, 1983.

    Google Scholar 

  15. S. Ullman. Maximizing rigidity: the incremental recovery of 3D structure from rigid and rubbery motion. Perception, 13:255–274, 1984.

    PubMed  Google Scholar 

  16. S. Ullman and R. Basri. Recognition by linear combinations of models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13:992–1006, 1991.

    Google Scholar 

  17. D. Weinshall. Model-based invariants for 3D vision. International Journal of Computer Vision, 10(1):27–42, 1993.

    Google Scholar 

  18. D. Weinshall and R. Basri. Distance metric between 3d models and 2d images for recognition and classification. In Proceedings IEEE Conf. on Computer Vision and Pattern Recognition, New-York City, NY, 1993. IEEE, Washington, DC.

    Google Scholar 

  19. D. Weinshall and C. Tomasi. Linear and incremental acquisition of invariant shape models from image sequences. In Proceedings of the 4th International Conference on Computer Vision, pages 675–682, Berlin, Germany, 1993. IEEE, Washington, DC.

    Google Scholar 

  20. A. P. Witkin. Scale-space filtering. In Proceedings IJCAI, pages 1019–1022, 1983.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Joseph L. Mundy Andrew Zisserman David Forsyth

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Weinshall, D. (1994). Model-based invariant functions and their use for recognition. In: Mundy, J.L., Zisserman, A., Forsyth, D. (eds) Applications of Invariance in Computer Vision. AICV 1993. Lecture Notes in Computer Science, vol 825. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58240-1_19

Download citation

  • DOI: https://doi.org/10.1007/3-540-58240-1_19

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58240-3

  • Online ISBN: 978-3-540-48583-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics