Abstract
This paper describes a system which robustly estimates motion, and the 3D structure of a rigid environment, as a stereo vision platform moves through it. The system can cope with any camera motion, and any scene structure and is successful even in the presence of large jumps in camera position between the capture of successive image pairs, and when point matching is ambiguous. The system was developed to provide robust obstacle avoidance for a partially sighted person.
The process described attempts to maximise use of the abundant information present in a stereo sequence. Key features include the use of multiple stereo match hypotheses, efficient motion computation from three images, and the use of this motion to ensure reliable matching, and to eliminate multiple stereo matches. Points are reconstructed in 3D space and tracked in a static coordinate frame with a Kalman Filter.
This results in good 3D scene reconstructions. Structure which is impossible to match with certainty is absent, rather than being incorrectly reconstructed. As a result, the system is appropriate for obstacle detection. The results of processing some indoor and outdoor scenes, are given in the paper, and practical issues are highlighted throughout.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bouguet, J. and Perona, P. 1995. Visual navigation using a single camera. In Proc. 5th International Conference on Computer Vision, Boston MA, Los Alamitos, CA, USA. IEEE Computer Society Press, pp. 645–652.
Ferrari, F., Grosso, E., Sandini, G., and Magrassi, M. 1990. A stereo vision system for real time obstacle avoidance in unknown environment. In Proceedings of IEEE International Workshop on Intelligent Robots and Systems IROS’ 90, pp. 703–708.
Fischler, M.A. and Bolles, R.C. 1981. Random sample consensus: A paradigm for model fitting with application to image analysis and automated cartography. Commun. Assoc. Comp. Mach., 24:381–395.
Harris, C. 1987. Determination of ego-motion from matched points. In Proc. 3rd Alvey Vision Conference, Cambridge, UK, pp. 189–192.
Harris, C. and Pike, J.M. 1987. 3d positional integration from image sequences. In Proc. 3rd Alvey Vision Conference, Cambridge, UK, pp. 233–236.
Harris, C. and Stephens, M. 1988. A combined corner and edge detector. In Proc. 4th Alvey Vision Conference, Manchester, UK, pp. 147–151.
Highnam, R. and Brady, M. 1999. Mammographic Image Processing. Kluwer Academic Publishers.
Konolige, K. 1998. Small vision systems: Hardware and implementation. In Robotics Research: Proceedings of the 8th International Symposium, Springer-Verlag, pp. 203–212.
Li, F., Brady, J.M., and Hu, H. 1995. Visual guidance of an AGV. In Proceedings of ISRR’ 95, Germany.
Matthies, L. and Grandjean, P. 1994. Stochastic performance modeling and evaluation of obstacle detectability with imaging range sensors. IEEE Trans. on Robotics andAutomation, 10(6):783–792.
Matthies, L., Kelly, A., Litwin, T., and Tharp, G. 1996. Obstacle detection for unmanned ground vehicles: A progress report. In Robotics Research: Proceedings of the 7th International Symposium, G. Giralt and G. Hirzinger (Eds.). Springer-Verlag, pp. 475–486.
Matthies, S.A. and Shafer, L. 1987. Error modeling in stereo navigation. IEEE Journal on Robotics and Automation, 3(3):239–248.
Molton, N.D. 1999. Computer vision as an aid for the visually impaired. DPhil Thesis, Dept of Engineering Science, University of Oxford.
Molton, N.D. and Brady, J.M. Accepted for publication. Modelling the motion of a sensor attached to a walking person. Robotics and Autonomous Systems.
Molton, N.D., Se, S., Brady, J.M., Lee, D., and Probert, P. 1999. Robotic sensing for the partially sighted. Robotics and Autonomous Systems, 26(3):185–201.
Moravec, H.P. 1983. The stanford cart and the cmu rover. Proceedings of IEEE, 71(7):872–884.
Murray, D. and Little, J. 1998. Using real-time stereo vision for mobile robot navigation. In Proceedings IEEE Workshop on Perception for Mobile Agents, pp. 19–27.
Pollard, S.B., Mayhew, J.E.W., and Frisby, J.P. 1991. Implementation details of the pmf stereo algorithm. In 3D Model Recognition from Stereoscopic Cues, J.E.W. Mayhew and J.P. Frisby (Eds.). MIT Press, pp. 33–39.
Pollard, S.B., Porrill, J., Mayhew, J.E.W., and Frisby, J.P. 1991. Disparity gradient, Lipschitz continuity, and computing binocular correspondences. In 3D Model Recognition from Stereoscopic Cues, J.E.W. Mayhew and J.P. Frisby (Eds.). MIT Press, pp. 25–32.
Se, S. 1998. Computer vision aids for the partially sighted. DPhil Thesis, Dept of Engineering Science, University of Oxford.
Se, S. and Brady, M. 1998. Stereo vision-based obstacle detection for partially sighted people. In Proc. Third Asian Conference on Computer Vision ACCV’ 98, pp. 152–159.
Stein, G. and Shashua, A. 1998. Direct estimation of motion and extended scene structure from amoving stereo rig. In Proc. Computer Vision and Pattern Recognition CVPR'98, Santa Barbara, California, USA, June 1998, IEEE Computer Society Press, pp. 211–218.
Torr, P.H.S. and Murray, D.W. 1997. The development and comparison of robust methods for estimating the fundamental matrix. Int Journal of Computer Vision, 24(3):271–300.
Tsai, R.Y. 1987. A versatile camera calibration technique for high-accuracy 3d machine vision metrology using off-the shelf tv cameras and lenses. IEEE Journal of Robotics and Automation, 3(4):323–344.
Wang, H. and Brady, M. 1995. Real-time corner detection algorithm for motion estimation. Image and Vision Computing, 13(9):695–703.
Wolfe, W.J., Mathis, D., Sklair, C.W., and Magee, M. 1991. The perspective view of three points. IEEE Transactions on Pattern Analysis and Machine Intelligence, 13(1):66–73.
Young, G.J. and Chellappa, R. 1990. 3-d motyion estimation using a sequence of noisy stereo images: Models, estimation, and uniqueness results. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12(8):735–759.
Zhang, Z. and Faugeras, O.D. 1992. Estimation of displacements from two 3-d frames obtained from stereo. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(12):1141–1155.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Molton, N., Brady, M. Practical Structure and Motion from Stereo When Motion is Unconstrained. International Journal of Computer Vision 39, 5–23 (2000). https://doi.org/10.1023/A:1008191416557
Issue Date:
DOI: https://doi.org/10.1023/A:1008191416557