Subspace methods for recovering rigid motion I: Algorithm and implementation

Heeger, David J.; Jepson, Allan D.

doi:10.1007/BF00128130

Subspace methods for recovering rigid motion I: Algorithm and implementation

Published: January 1992

Volume 7, pages 95–117, (1992)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

David J. Heeger^1,2 &
Allan D. Jepson³

895 Accesses
6 Altmetric
Explore all metrics

Abstract

As an observer moves and explores the environment, the visual stimulation in his/her eye is constantly changing. Somehow he/she is able to perceive the spatial layout of the scene, and to discern his/her movement through space. Computational vision researchers have been trying to solve this problem for a number of years with only limited success. It is a difficult problem to solve because the optical flow field is nonlinearly related to the 3D motion and depth parameters.

Here, we show that the nonlinear equation describing the optical flow field can be split by an exact algebraic manipulation to form three sets of equations. The first set relates the flow field to only the translational component of 3D motion. Thus, depth and rotation need not be known or estimated prior to solving for translation. Once the translation has been recovered, the second set of equations can be used to solve for rotation. Finally, depth can be estimated with the third set of equations, given the recovered translation and rotation.

The algorithm applies to the general case of arbitrary motion with respect to an arbitrary scene. It is simple to compute, and it is plausible biologically. The results reported in this article demonstrate the potential of our new approach, and show that it performs favorably when compared with two other well-known algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Adelson, E.H., and Bergen, J.R., 1986. The extraction of spatiotemporal energy in human and machine vision. Proc. IEEE Workshop on Motion: Representation and Analysis, Charleston, S. Carolina, pp. 151–156.
AdivG., 1985. Determining three-dimensional motion and structure from optical flow generated by several moving objects. IEEE Trans. Patt. Anal. Mach. Intell. 7:384–401.
Google Scholar
AdivG., 1989. Inherent ambiguities in recovering 3D motion and structure from a noisy flow field. IEEE Trans. Anal. Mach. Intell. 11:477–489.
Google Scholar
AggarwalJ.K. and NandhakumarN., 1988. On the computation of motion from sequences of images—a review. Proc. IEEE. 76: 917–935.
Google Scholar
AnandanP., 1989. A computational framework and an algorithm for the measurement of visual motion. Intern. J. Comput. Vision. 2:283–310.
Google Scholar
BallardD.H. and KimballO.A., 1983. Rigid body motion from depth and optical flow. Comput. Vision, Graph. Image Process 22:95–115.
Google Scholar
Barron, J., 1984. A survey of approaches for determining optic flow, environmental layout and egomotion. Techn. Rept. RBCV-TR-84-5, Department of Computer Science, University of Toronto.
BarronJ.L., JepsonA.D., and TsotsosJ.K., 1990. The feasibility of motion and structure from noisy time-varying image velocity information. Intern. J. Comput. Vision 5:239–269.
Google Scholar
BollesR.C., BakerH.H., and MarimontD.H., 1987. Epipolarplane image analysis: An approach to determining structure from motion. Intern. J. Comput. Vision 1:7–55.
Google Scholar
BroidaT.J., and ChellappaR., 1986. Estimation of object motion parameters from noisy images. IEEE Trans. Patt. Anal. Mach. Intell. 8:90–99.
Google Scholar
BrussA.R., and HornB.K.P., 1983. Passive navigation. Comput. Vision, Graph. Image Process. 21:3–20.
Google Scholar
Faugeras, O.D., Lustman, F., and Toscani, G., 1987. Motion and structure from motion from point and line matches. Proc. 1st Intern. Conf. Comput. Vision, London, June, pp. 25–34.
FleetD.J. and JepsonA.D., 1990. Computation of component image velocity from local phase information. Intern. J. Comput. Vision. 5:77–104.
Google Scholar
GibsonJ.J. 1950. The Perception of the Visual World. Houghton Mifflin, Boston.
Google Scholar
GibsonJ.J., and GibsonE.J., 1957. Continuous perspective transformation and the perception of rigid motions. J. Exp. Psychol. 54: 129–138.
Google Scholar
GrzywaczN.M., and YuilleA.L., 1990. A model for the estimate of local image velocity by cells in the visual cortex. Proc. Roy. Soc. London A, 239:129–161.
Google Scholar
HayJ.C., 1966. Optical motions and space perception: An extension of Gibson's analysis. Psychological Review, 73:550–565.
Google Scholar
HeegerD.J., 1987. Model for the extraction of image flow. J. Opt. Soc. Amer. A 4:1455–1471.
Google Scholar
HeegerD.J., 1988. Optical flow using spatiotemporal filters. Intern. J. Comput. Vision 1:279–302.
Google Scholar
HeegerD.J., and JepsonA., 1990a. Visual perception of three-dimensional motion. Neural Computation 2:129–137.
Google Scholar
HeegerD.J., and JepsonA., 1990b. Visual perception of 3D motion and depth. Invest. Opthal. Vis. Sci. Suppl. 31:173.
Google Scholar
Heeger, D.J., and Jepson, A., 1990c. Simple method for computing 3D motion and depth. Proc. 3rd. Intern. Conf. Comput. Vision, Osaka, Japan, December, pp. 96–100.
HeegerD.J., and JepsonA., 1991. Recovering observer translation with center-surround motion-opponent mechanisms. Invest. Opthal. Vis. Sci. Suppl. 32:823.
Google Scholar
Heel, J., 1989a. Direct estimation of structure and motion for multiple frames. Tech. Rep. 1190, MIT AI Lab.
Heel, J., 1989b. Dynamic motion vision. Proc. SPIE. Philadelphia.
Heel, J., 1990. Direct dynamic motion vision. Proc. IEEE Conf. Robot. Autom. Cincinnati.
HornB.K.P., 1986. Robot Vision. MIT Press: Cambridge, Ma.
Google Scholar
HornB.K.P., 1987. Motion fields are hardly ever ambiguous. Intern. J. Comput. Vision, 1:259–274.
Google Scholar
HornB.K.P., and NegahdaripourS., 1987. Direct passive navigation: Analytical solution for planes. IEEE Trans. Patt. Anal. Mach. Intell. 9:168–176.
Google Scholar
HornB.K.P., and SchunkB.G., 1981. Determining optical flow. Artificial Intelligence 17:185–203.
Google Scholar
HornB.K.P., and WeldonE.J., 1988. Direct methods for recovering motion. Intern. J. Comput. Vision 2:51–76.
Google Scholar
Jepson, A., and Heeger, D.J., 1989. Egomotion without depth estimation. Optics News 15:A-20.
Google Scholar
Jepson, A., and Heeger, D.J., 1990. Subspace methods for recovering rigid motion II: Theory Submitted to International Journal of Computer Vision, available as Tech. Rept. RBCV-TR-90-36, Department of Computer Science, University of Toronto.
Jepson, A. and Heeger, D.J. 1991. A fast subspace algorithm for recovering rigid motion. Proc. IEEE Workshop on Visual Motion, Princeton, N.J., pp. 124–131.
JohanssonG., 1975. Visual motion perception. Scientific American 232: 76–88.
Google Scholar
KoenderinkJ.J. and vanDornA.J. 1975. Invariant properties of the motion parallax field due to the movement of rigid bodies relative to an observer. Optica Acta 22: 773–791.
Google Scholar
KoenderinkJ.J., and vanDornA.J. 1976. Local structure of movement parallax of the plane. J. Opt. Soc. Amer. 66:717–723.
Google Scholar
KoenderinkJ.J., and vanDornA.J. 1981. Exterospecific component of the motion parallax field. J. Opt. Soc. Amer. 71:953–957.
Google Scholar
Longuet-HigginsH.C., 1981. A computer algorithm for reconstructing a scene from two projections. Nature 293:133–135.
Google Scholar
Longuet-HigginsH.C., 1984. The visual ambiguity of a moving plane. Proc. Roy. Soc. London B 223:165–175.
Google Scholar
Longuet-HigginsH.C., 1988. Multiple interpretations of a pair of images of a surface. Proc. Roy. Soc. London B 418:1–15.
Google Scholar
Longuet-Higgins, and PrazdnyK. 1980. The interpretation of a moving retinal image. Proc. Roy. Soc. London B 208:385–397.
Google Scholar
Lucas, B.D., and Kanade, T., 1981. An iterative image registration technique with an application to stereo vision. Proc. 7th Intern. Joint Conf. Artif. Intell. Vancouver, pp. 674–679.
MaloneyL.T., and WandellB.A., 1986. Color constancy: a method for recovering surface spectral reflectance. J. Opt. Soc. Amer. A 1:29–33.
Google Scholar
MatthiesL., SzeliskiR., and KanadeT., 1989. Kalman filter-based algorithms for estimating depth from image sequences. Intern. J. Comput. Vision. 3:209–238.
Google Scholar
MaybankS.J., 1985. The angular velocity associated with the optical flow field arising from motion through a rigid environment. Proc. Roy. Soc. London A 410:317–326.
Google Scholar
Maybank, S.J., 1987. A Theoretical Study of Optical flow. Ph.D. thesis, University of London.
NagelH.H., 1987. On the estimation of optical flow: relations between different approaches and some new results. Artificial Intelligence 33:299–324.
Google Scholar
NakayamaK., 1985. Biological image motion processing: A review. Vision Research 25:625–660.
Google Scholar
NegahdaripourS., and HornB.K.P., 1989. A direct method for locating the focus of expansion. Comput. Vision, Graph. Image Process. 46:303–326.
Google Scholar
PrazdnyK., 1980. Egomotion and relative depth from optical flow, Biological Cybermetics 36:87–102.
Google Scholar
PrazdnyK., 1981. Determining the instantaneous direction of motion from optical flow generated by a curvilinearly moving observer. Comput. Graph. Image Process. 17:238–248.
Google Scholar
PrazdnyK., 1983. On the information in optical flows. Comput. Graph. Image Process. 22:239–259.
Google Scholar
ReigerJ.H., and LawtonD.T., 1985. Processing differential image motion. J. Opt. Soc. Amer. A. 2:354–359.
Google Scholar
RoachJ.W., and AggarwalJ.K., 1980. Determining the movement of objects from a sequence of images. IEEE Trans. Patt. Anal. Mach. Intell. 2:554–562.
Google Scholar
SimoncelliE.P., and AdelsonE.H., 1991. Relationship between gradient, spatio-temporal energy, and regression models for motion perception. Invest. Opthal. Vis. Sci. Suppl. 32:893.
Google Scholar
Simoncelli, E.R., Adelson, E.H., and Heeger, D.J., 1991. Probability distributions of optical flow. Proc. Comput. Vision Patt. Recog., Maui, HI, June, pp. 310–315.
Southall, J.P.C., editor, 1962. Helmholtz's Treatise on Physiological Optics. Dover Publications: NY Originally published by the Optical Society of America in 1925.
StrangG. 1980. Linear Algebra and Its Applications. Academic Press: New York.
Google Scholar
Sundareswaran, V. 1991. Egomotion from global flow field data. Proc. IEEE Workshop on Visual Motion, Princeton, N.J., pp. 140–145.
TsaiR.Y., and HuangT.S., 1984. Uniqueness and estimation of three-dimensional motion parameters of rigid objects with curved surfaces. IEEE, Trans. Patt. Anal. Mach. Intell. 6:13–27.
Google Scholar
UllmanS., 1979. The Interpretation of Visual Motion. MIT Press: Cambridge, MA
Google Scholar
UllmanS., 1984. Maximizing rigidity: the incremental recovery of 3-D structure from rigid and rubbery motion. Perception 13:255–274.
Google Scholar
WallachH., and O'ConnellD.N., 1953. The kinetic depth effect. J. Exp. Psychol. 45:205–217.
Google Scholar
WandellB.A., 1987. The synthesis and analysis of color images. IEEE Trans. Patt. Anal. Mach. Intell. 9:2–13.
Google Scholar
WarrenW.H., and HannonD.J., 1988. Direction of self-motion is perceived from optical flow. Nature 336:162–163.
Google Scholar
WarrenW.H., and HannonD.J., 1990. Eye movements and optical flow. J. Opt. Soc. Amer. A 7:160–169.
Google Scholar
WatsonA.B., and AhumadaA.J., 1985. Model of human visualmotion sensing. J. Opt. Soc. Amer. A 2:322–342.
Google Scholar
WaxmanA.M., and UllmanS. 1985. Surface structure and three-dimensional motion from image flow kinematics. Intern. J. Robot. Res. 4:72–94.
Google Scholar
WaxmanA.M., and WohnK., 1985. Contour evolution, neighborhood deformation, and global image flow: planar surfaces in motion. Intern J. Robot. Res. 4:95–108.
Google Scholar
WaxmanA.M., and WohnK., 1988. Image flow theory: A framework for 3-D inference from time-varying imagery. In Advances in Computer Vision. vol. 1, pp. 165–224. Lawrence Erlbaum Assoc.: Hillsdale, NJ.
Google Scholar
WaxmanA.M., Kamgar-ParsiB., and SubbaraoM., 1987. Closed-form solutions to image flow equations and 3D structure and motion. Intern. J. Comput. Vision 1:239–258.
Google Scholar
WengJ., HuangT.S., and AhujaN., 1989. Motion and structure from two perspective views: Algorithms, error analysis, and error estimation. IEEE Trans. Patt. Anal. Mach. Intell. 11:451–476.
Google Scholar

Download references

Author information

Authors and Affiliations

NASA-Ames Research Center, mail stop 262-2, 94035, Moffett Field, CA
David J. Heeger
Psychology Department, Stanford University, 94305, Stanford, CA
David J. Heeger
Computer Science Department, University of Toronto, M5S 1A4, Toronto, Ontario
Allan D. Jepson

Authors

David J. Heeger
View author publications
You can also search for this author inPubMed Google Scholar
Allan D. Jepson
View author publications
You can also search for this author inPubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Heeger, D.J., Jepson, A.D. Subspace methods for recovering rigid motion I: Algorithm and implementation. Int J Comput Vision 7, 95–117 (1992). https://doi.org/10.1007/BF00128130

Download citation

Received: 21 September 1990
Revised: 09 July 1991
Issue Date: January 1992
DOI: https://doi.org/10.1007/BF00128130

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Subspace methods for recovering rigid motion I: Algorithm and implementation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Nonlinear and Nonparametric Extensions

A Flexible Scheme for Constructing (Quasi-)Invariant Signal Representations

Image Representation

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Subspace methods for recovering rigid motion I: Algorithm and implementation

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Nonlinear and Nonparametric Extensions

A Flexible Scheme for Constructing (Quasi-)Invariant Signal Representations

Image Representation

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now