Abstract
Algorithms to perform point-based motion estimation under orthographic and scaled orthographic projection abound in the literature. A key limitation of many existing algorithms is that they operate on the minimum amount of data required, often requiring the selection of a suitable minimal set from the available data to serve as a “local coordinate frame”. Such approaches are extremely sensitive to errors and noise in the minimal set, and forfeit the advantages of using the full data set. Furthermore, attention is seldom paid to the statistical performance of the algorithms.
We present a new framework that allowsall available features to be used in the motion computations, without the need to select a frame explicitly. This theory is derived in the context of theaffine camera, which preserves parallelism and generalises the orthographic, scaled orthographic and para-perspective models. We define the affine epipolar geometry for two such cameras, giving the fundamental matrix in this case. The noise resistant computation of the epipolar geometry is discussed, and a statistical noise model constructed so that confidence in the results can be assessed.
The rigid motion parameters are then determineddirectly from the epipolar geometry, using the novel rotation representation of Koenderink and van Doorn (1991). The two-view partial motion solution comprises the scale factor between views, the projection of the 3D axis of rotation and the cyclotorsion angle, while the addition of a third view allows the true 3D rotation axis to be computed (up to a Necker reversal). The computed uncertainties in these parameters permit optimal estimates to be obtained over time by means of a linear Kalman filter. Our theory extends work by Huang and Lee (1989), Harris (1990), and Koenderink and van Doorn (1991), and results are given on both simulated and real data.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Aloimonos, J. 1986. Detection of surface orientation from texture I: The case of planes. InIEEE Conference on Computer Vision and Pattern Recognition (CVPR'86), Florida, pp. 584–593.
Aloimonos, J. and Bandyopadhyay, A. 1985. Perception of structure from motion: lower bound results. Tech. Report 158, Dept. Computer Science, University of Rochester.
Aloimonos, J.Y. 1992. Perspective approximations.Image and Vision Computing, 8(3):179–192.
Arnold, R.D. and Binford, T.O. 1980. Geometrical constraints in stereo vision. InProceedings S.P.I.E., 238:281–292.
Bar-Shalom, Y. and Fortmann, T.E. 1988. Tracking and Data Association, Academic Press Inc., USA.
Barnard, S.T. and Thompson, W.B. 1980. Disparity analysis of images.IEEE Transactions on Pattern Analysis and Machine Intelligence, 2(4):333–340.
Beardsley, P.A., Zisserman, A., and Murray, D.W. 1993. Projective structure from image sequences. Tech. Report OUEL 1985/93, Dept. Engineering Science, University of Oxford.
Beardsley, P.A., Zisserman, A., and Murray, D.W. 1994. Navigation using affine structure from motion. In J.O. Eklundh (ed.),Proceedings European Conference on Computer Vision (ECCV-94), II:85–96.
Bennett, B.M., Hoffman, D.D., Nicola, J.E., and Prakash, C. 1989. Structure from two orthographic views of rigid motion.Journal of Optical Society of America, 6(7): 1052–1069.
Berger, M. 1980.Geometry I, Springer Verlag.
Bookstein, F.L. 1979. Fitting conic sections to scattered data.Computer Graphics and Image Processing, 9:56–71.
Charnley, D., Harris, C., Pike, M., Sparks, E., and Stephens, M. 1988. The DROID 3D vision system: algorithms for geometric integration. Plessey Research, Roke Manor, Technical Note 72/88/N488U.
Chen, H.H. and Huang, T.S. 1991. Using motion from orthographic views to verify 3D point matches.IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(9):872–878.
Demey, S., Zisserman, A., and Beardsley, P. 1992. Affine and projective structure from motion.Proceedings British Machine Vision Conference (BMVC'92), pp. 49–58.
Durrant-Whyte, H.F. 1993.Methods and Systems in Data Fusion, in press.
Faugeras, O.D., Luong, Q.-T., and Maybank, S.J. 1992. Camera self-calibration: theory and experiments. In G. Sandini (ed.),Proceedings European Conference on Computer Vision (ECCV-92), pp. 321–334.
Faugeras, O.D. 1992. What can be seen in three dimensions with an uncalibrated stereo rig? In G. Sandini (ed.),Proceedings European Conference on Computer Vision (ECCV-92), pp. 563–578.
Gelb, M. 1974.Applied Optimal Estimation, MIT Press.
Harris, C. 1990. Structure-from-motion under orthographic projection. InProceedings European Conference on Computer Vision (ECCV-90), pp. 118–123.
Hartley, R.I. 1992. Estimation of relative camera positions for uncalibrated cameras. InProceedings European Conference on Computer Vision (ECCV-92), pp. 579–587.
Hollinghurst, N. and Cipolla, R. 1993. Uncalibrated stereo hand-eye coordination. InProceedings British Machine Vision Conference (BMVC'93), Surrey, pp. 389–398.
Hu, X. and Ahuja, N. 1991. Motion estimation under orthographic projection. InIEEE Transactions on Robotics and Automation, 7(6):848–853.
Huang, T.S. and Lee, C.H. 1989. Motion and structure from orthographic projections.IEEE Trans. Pattern Anal. Machine Intell., PAMI-11(5):536–540.
Kanatani, K. 1993.Geometric Computation for Computer Vision, Oxford University Press, UK.
Koenderink, J.J. and van Doorn A.J. 1991. Affine structure from motion.Journal of Optical Society of America, 8(2):377–385.
Lee, C. and Huang, T. 1990. Finding point correspondences and determining motion of a rigid object from two weak perspective views.Computer Vision, Graphics and Image Processing, 52: 309–327.
Longuet-Higgins, H.C. 1981. A computer algorithm for reconstructing a scene from two projections.Nature, 293:133–135.
Longuet-Higgins, H.C. 1991. A method of determining the relation positions of 4 points from 3 perspective projections. In P. Mowforth (ed.),Proceedings British Machine Vision Conference (BMVC'91), pp. 86–94.
Luong, Q.-T., Deriche, R., Faugeras, O., and Papadopoulo, T. 1993. On determining the fundamental matrix: analysis of different methods and experimental results. Tech. Report 1894, INRIA (Sophia Antipolis).
Mundy, J.L. and Zisserman A. (eds). 1992.Geometric Invariance in Computer Vision, MIT Press, USA.
Olsen, S.I. 1992. Epipolar line estimation. In G. Sandini, (ed.), Proceedings European Conference on Computer Vision (ECCV-92), pp. 307–311.
Pollard, S.B., Mayhew, J.E.W., and Frisby, J.P. 1985. PMF: A Stereo Correspondence Algorithm using a Disparity Gradient Limit.Perception, 14:449–470.
Press, W.H., Flannery, B.P., Teukolsky, S.A., and Vetterling, W.T. 1988.Numerical Recipes in C, Cambridge University Press, USA.
Quan, L. and Mohr, R. 1991. Towards structure from motion for linear features through reference points.IEEE Workshop on Visual Motion, New Jersey.
Reid, I.D. and Murray, D.W. 1993. Tracking foveated corner clusters using affine structure. InProceedings International Conference on Computer Vision (ICCV-4), Berlin, pp. 76–83.
Reid, I.D. and Murray, D.W. 1994. Active tracking of foveated feature clusters using affine structure. To appear IJCV.
Shapiro, L.S. 1993.Affine Analysis of Image Sequences, Ph.D. thesis, Dept. Engineering Science, Oxford University. Also 1995, Cambridge University Press, UK.
Shapiro, L.S. and Brady, J.M. 1993. Rejecting outliers and estimating errors in an orthogonal regression framework. Tech. Report OUEL 1974/93, Dept. Engineering Science, University of Oxford. Also inPhil. Tran. R. Soc. Lond., A (1995) 350: 407–439.
Shapiro, L.S., Wang H., and Brady, J.M. 1992. A matching and tracking strategy for independently moving objects. In D. Hogg and R. Boyle (eds.),Proceedings British Machine Vision Conference, Leeds, Springer-Verlag, U.K., pp. 306–315.
Shapiro, L.S., Zisserman, A., and Brady, J.M. 1994. Motion from point matches using affine epipolar geometry. In J.O. Eklundh (ed.),Proceedings European Conference on Computer Vision (ECCV-94), II:73–84.
Strang, G. 1988.Linear Algebra and its Applications, 3rd ed., Harcourt Brace Jovanovich Inc., U.S.A.
Thompson, D.W. and Mundy, J.L. 1987. Three dimensional model matching from an unconstrained viewpoint. In IEEE Conference on Robotics and Automation, Raleigh, NC, pp. 208–220.
Tomasi, C. and Kanade, T. 1992. Shape and motion from image streams under orthography: a factorization method.International Journal of Computer Vision, 9(2):137–154.
Torr, P.H.S. 1993. Notes on epipole at infinity. Personal communication.
Torr, P.H.S. and Murray, D.W. 1993. Outlier detection and motion segmentation. In Schenker (ed.),Sensor Fusion VI, SPIE Vol. 2059, Boston, pp. 432–443.
Ullman, S. 1979.The Interpretation of Visual Motion, MIT Press, U.S.A.
Ullman, S. and Basri, R. 1991. Recognition by linear combinations of models.IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(10):992–1006.
Wang, H. and Brady, J.M. 1992. Corner detection: some new results.IEE Colloquium Digest of System Aspects of Machine Perception and Vision, pp. 1.1–1.4. London: IEE.
Weinshall, D. and Tomasi, C. 1993. Linear and incremental acquisition of invariant shape models from image sequences. InProceedings International Conference on Computer Vision (ICCV-4), pp. 675–682.
Weinshall, D. 1993. Model-based invariants for 3-D vision.International Journal of Computer Vision, 10(1):27–42.
Weng, J., Huang, T.S., and Ahuja, N. 1989. Motion and structure from two perspective views: algorithms, error analysis and error estimation.IEEE Trans. Pattern Anal. Machine Intell., PAMI-11(5):451–476.
Weng, J., Ahuja, N., and Huang, T.S. 1993. Optimal motion and structure estimation.IEEE Trans. Pattern Anal. Machine Intell., PAMI-15(9):864–884.
Xu, G., Nishimura, E., and Tsuji, S. 1993. Image correspondence and segmentation by epipolar lines: theory, algorithm and applications. Technical Report, Dept. Systems Engineering, Osaka University.
Zisserman, A. 1992.Notes on geometric invariance in vision: BMVC'92 tutorial, Leeds University.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Shapiro, L.S., Zisserman, A. & Brady, M. 3D Motion recovery via affine Epipolar geometry. Int J Comput Vision 16, 147–182 (1995). https://doi.org/10.1007/BF01539553
Issue Date:
DOI: https://doi.org/10.1007/BF01539553