Abstract
A method is described of visually tracking a known three-dimensional object as it moves with six degrees of freedom. The method uses the predicted position of known features on the object to find the features in images from one or more cameras, measures the position of the features in the images, and uses these measurements to update the estimates of position, orientation, linear velocity, and angular velocity of the object model. The features usually used are brightness edges that correspond to markings or the edges of solid objects, although point features can be used. The solution for object position and orientation is a weighted least-squares adjustment that includes filtering over time, which reduces the effects of errors, allows extrapolation over times of missing data, and allows the use of stereo information from multiple-camera images that are not coincident in time. The filtering action is derived so as to be optimum if the acceleration is random. (Alternatively, random torque can be assumed for rotation.) The filter is equivalent to a Kalman filter, but for efficiency it is formulated differently in order to take advantage of the dimensionality of the observations and the state vector which occur in this problem. The method can track accurately with arbitrarily large angular velocities, as long as the angular acceleration (or torque) is small. Results are presented showing the successful tracking of partially obscured objects with clutter.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ayache, N., and Faugeras, O.D. 1988. Building, registrating, and fusing noisy visual maps. Intern. J. Robotics Res. 7:45–65.
Brand, L. 1947. Vector and Tensor Analysis. Wiley: New York.
Broida, T.J., and Chellappa, R. 1986. Estimation of object motion parameters from noisy images. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 8:90–99.
Corben, H.C., and Stehle, P. 1960. Classical Mechanics (2nd ed.). Wiley: New York.
Dickmanns, E.D., and Graefe, V. 1988a. Dynamic monocular machine vision. Mach. Vis. App. 1:223–240.
Dickmanns, E.D., and Graefe, V. 1988b. Applications of dynamic monocular machine vision. Mach. Vis. App. 1:241–261.
Eskenazi, R., and Wilf, J.M. 1979. Low-level processing for real-time image analysis. Jet Propulsion Laboratory, Pasadena, CA, JPL Publication 79–79.
Gennery, D.B. 1982. Tracking known three-dimensional objects. Proc. AAAI 2nd Natl. Conf. Artif. Intell., Pittsburgh, PA, pp. 13–17.
Gennery, D.B. 1986. Stereo vision for the acquisition and tracking of moving three-dimensional objects. In Techniques for 3-D Machine Perception (A.Rosenfeld, ed.). Elsevier: Amsterdam.
Gennery, D.B. 1990. Properties of a random-acceleration recursive filter. Jet Propulsion Laboratory, Pasadena, CA, JPL internal report D-8057.
Gennery, D.B. 1991. Camera calibration including lens distortion. Jet Propulsion Laboratory, Pasadena, CA, JPL internal report D-8580.
Gennery, D.B., Litwin, T., Wilcox, B., and Bon, B. 1987. Sensing and perception research for space telerobotics at JPL. Proc. IEEE Intern. Conf. Robot. Autom., Raleigh, NC, pp. 311–317.
Gennery, D.B., and Wilcox, B. 1985. A pipelined processor for low-level vision. Proc. IEEE Comput. Soc. Conf. Comput. Vision Patt. Recog., San Francisco, CA, pp. 608–613.
Gilbert, A.L., Giles, M.K., Flachs, G.M., Rogers, R.B., and U, Y.H. 1980. A real-time video tracking system. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 2:47–56.
Goldstein, H. 1980. Classical Mechanics (2nd ed.). Addison-Wesley: Reading, MA.
Hedgley, D.R. 1982. A general solution to the hidden-line problem. Ames Research Center, Dryden Flight Research Facility, Edwards, CA, NASA Reference Publication 1085.
Kalman, R.E. 1960. A new approach to linear filtering and prediction problems. Trans. ASME, series D, J. Basic Engin. 82:35–45.
Martin, W.N., and Aggarwal, J.K. 1978. Dynamic scene analysis. Comput. Graph. Image Process. 7:356–374.
Maybeck, P.M. 1979. Stochastic Models, Estimation, and Control, vol. 1. Academic Press: New York.
Mikhail, E.M. (with contributions by F. Ackermann) 1976. Observations and Least Squares. Harper & Row: New York.
Nagel, H.-H. 1978. Analysis techniques for image sequences. Proc. 4th Intern. Conf. Patt. Recog., Tokyo, pp. 186–211.
Pinkney, H.F.L. 1978. Theory and development of an on-line 30 Hz video photogrammetry system for real-time 3-dimensional control. Proc. ISP Symp. Photogramm. Industry, Stockholm, Sweden.
Roach, J.W., and Aggarwal, J.K. 1979. Computer tracking of objects moving in space. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 1:127–135.
Saund, E., Gennery, D.B., and Cunningham, R.T. 1981. Visual tracking in stereo. Joint Autom. Cont. Conf., sponsored by ASME, University of Virginia.
Verghese, V., and Dyer, C.R. 1988. Real-time, model-based tracking of three-dimensional objects. University of Wisconsin, Madison, WI, Computer Sciences Tech. Rept. #806.
Wilcox, B., Tso, K., Litwin, T., Hayati, S., and Bon, B. 1989. Autonomous sensor-based dual-arm satellite grappling. Proc. NASA Conf. Space Telerobotics, Pasadena, CA (JPL Publication 87–7), vol. III, pp. 307–316.
Wu, J.J., Rink, R.E., Caelli, T.M., and Gourishankar, V.G. 1989. Recovery of the 3-D location and motion of a rigid object through camera image (An extended Kalman filter approach). Intern. J. Comput. Vis. 2:373–394.
Wünsche, H.-J. 1986. Detection and control of mobile robot motion by real-time computer vision. In Mobile Robots. W.J. Wolfe and N. Marquina, eds., Proc. SPIE, 727, Cambridge, MA, pp. 100–109.
Yakimovsky, Y., and Cunningham, R.T. 1978. A system for extracting three-dimensional measurements from a stereo pair of TV cameras. Comput. Graph. Image Process. 7:195–210.
Young, G.-S.J., and Chellappa, R. 1990. 3-D motion estimation using a sequence of noisy sterco images: Models, estimation, and uniqueness results. IEEE Trans. Patt. Anal. Mach. Intell. PAMI 12:735–759.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Gennery, D.B. Visual tracking of known three-dimensional objects. Int J Comput Vision 7, 243–270 (1992). https://doi.org/10.1007/BF00126395
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00126395