Abstract
Integration of 21/2D sketches obtained at different observation stations into a consistent world (or object) representation is one of the central issues in computer vision and robotics. The resolution and accuracy of 21/2D sketches may be different from one view point to another, and inconsistent data between different observations may occur. This article presents an approach to building a spatiotemporal representation of dynamic scenes including moving objects from a sequence of range images taken by a moving observer. A range image is transformed into a height-map representation, which is segmented into the ground plane and objects on it. In order to capture the resolution and accuracy of the range information and the consistency of the height information between different height maps, we define a reliability measure of the height information for each bucket on the height map. Using this reliability, the system finds the correspondences of both static and moving objects between different observations, and successively refines the height information and its reliability with newly acquired data, dealing with inconsistent data. Final representation of the integrated height map consists of the time stamp of the last observation, region labels of static and moving objects and their spatiotemporal properties such as height information, its reliability, and the velocities of both the observer and independently moving objects. We applied the method to road scenes physically simulated by landscape toy models and show the experimental results.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
W.E.L.Grimson, From Images to Surfaces. M.I.T. Press: Cambridge, MA, 1981.
O.D. Faugeras, F. Lustman, and G. Toscani, Motion and structure from motion from point and line matches, Proc. Intern. Conf. Comput. Vis., London, pp. 24–34, 1987.
N.Ayache and O.D.Faugeras, Building, registering, and fusing noisy visual maps, Intern. J. Robot. Res. 7(6): 45–65, 1988.
L. Matthies, R. Szleski, and T. Kanade, Kalman filter-based algorithm for estimating depth from image sequences, Proc. Image Understanding Workshop, Cambridge, MA, pp. 199–213, 1988.
A.Elfes, Using occupancy grids for mobile robot perception and navigation, Computer 22(6): 46–57, 1989.
C.Thorpe, M.H.Hebert, T.Kanade, and S.A.Shafer, Vision and navigation for the Carnegie-Mellon Navlab, IEEE Trans. Patt. Anal. Mach. Intell. 10: 362–373, 1988.
M.A. Turk, D.G. Morgenthaler, K.D. Gremban, and M. Marra, Video road-following for the autonomous land vehicle, Proc. IEEE Intern. Conf. Robot. Autmat., Raleigh, NC, pp. 273–280, 1987.
I. Kweon and T. Kanade, High resolution terrain map from multiple sensory data, Proc. IEEE Intern. Workshop Intell. Robot. and Syst., Tsuchiura, Japan, pp. 127–134, 1990.
R.C.Bolles, H.H.Baker, and D.H.Mariomont, Epipolar-plane image analysis: An approach to determining structure from motion, Intern. J. Comput. Vis. 1: 7–55, 1987.
A.M. Waxman, J. LeMoigne, L.S. Davis, E. Liang, and T. Siddalingaiah, A visual navigation system, Proc. IEEE Intern. Conf. Robot. Autmat., San Francisco, CA, pp. 1600–1606, 1986.
D.DeMenthon, T.Siddalingaiah, and L.S.Davis, Production of dense range images with the CVL light-stripe range seanner, Center for Automation Research Technical Report, University of Maryland, College Park, MD, CAR-TR-337, 1987.
M. Asada, E. Ikeda, and Y. Shirai, Interpretation and integration of height maps from a range image sequence, Proc. IEEE/RSJ Intern. Workshop Intell. Robot. Syst., Tsukuba, Japan, pp. 54–61, 1989.
K.E. Olin, M.J. Daily, J.G. Harris, and F.M. Vilnrotter, Knowledge-based vision technology overview for obstacle detection and avoidance, Proc. Image Understanding Workshop, pp. 134–143, 1989.
M.Nagao and T.Matsuyama, Edge preserving smoothing, CGIP, 9: 394–407, 1979.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Asada, M., Kimura, M., Taniguchi, Y. et al. Dynamic integration of height maps into a 3D world representation from range image sequences. Int J Comput Vision 9, 31–53 (1992). https://doi.org/10.1007/BF00163582
Issue Date:
DOI: https://doi.org/10.1007/BF00163582