Abstract
We present a system to detect and track moving objects from an airborne platform. Given a global map, such as a satellite image, our approach can locate and track the targets in geo-coordinates, namely longitude and latitude obtained from geo-registration. A motion model in geo-coordinates is more physically meaningful than the one in image coordinates. We propose to use a two-step geo-registration approach to stitch images acquired by satellite and UAV cameras. Mutual information is used to find correspondences between these two very different modalities. After motion segmentation and geo-registration, tracking is performed in a hierarchical manner: at the temporally local level, moving image blobs extracted by motion segmentation are associated into tracklets; at the global level, tracklets are linked by their appearance and spatio-temporal consistency on the global map. To achieve efficient time performance, graphics processing unit techniques are applied in the geo-registration and motion detection modules, which are the bottleneck of the whole system. Experiments show that our method can efficiently deal with long term occlusion and segmented tracks even when targets fall out the field of view.
Similar content being viewed by others
References
Ali, S., Shah, M.: Cocoa—tracking in aerial imagery. In: SPIE (2006)
Babenko, P., Shah, M.: Mingpu: A minimum gpu library for computer vision. http://server.cs.ucf.edu/~vision/MinGPU/
Bar-Shalom, Y., Fortmann, T., Scheffe, M.: Joint probabilistic data association for multiple targets in clutter. In: Proceedings of Conference on Information Sciences and Systems (1980)
Brown, M., Lowe, D.G.: Recognizing panoramas. In: ICCV’03. Proceedings of Ninth IEEE International Conference on Computer Vision, pp. 1218–1225 (2003)
Fischler M.A., Bolles R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)
Griesser, A., et al.: Real-time, gpu-based foreground-background segmentation. In: Vision, Modeling, and Visualization, pp. 319–326 (2005)
Hanna, K., Sawhney, H., Kumar, R., Guo, Y., Samarasekara, S.: Annotation of video by alignment to reference imagery. In: ICCV’99, pp. 253–264 (1999)
Huang, X., Sun, Y., Metaxas, D., Sauer, F., Xu, C.: Hybrid image registration based on configural matching of scale-invariant salient region features. In: CVPRW ’04: Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW’04), vol. 11, pp. 167
Irani, M., Anandan, P., Hsu, S.: Mosaic based representations of video sequences. In: ICCV, pp. 605–611 (1995)
James Fung, C.A., Steve Mann: Openvidia: Parallel gpu computer vision. In: Proceedings of the ACM Multimedia 2005, pp. 849–852 (2005)
Javed, O., Shafique, K., Shah, M.: A hierarchical approach to robust background subtraction using color and gradient information. In: Proceedings of the Workshop on Motion and Video Computing, pp. 23–28 (2002)
Kang, J., Cohen, I., Medioni, G.: Continuous tracking within and across camera streams. In: CVPR, vol. 1, pp. 267–272 (2003)
Kang, J., Cohen, I., Medioni, G.: Object reacquisition using invariant appearance model. In: ICPR, pp. 759–762 (2004)
Kaucic, R., Perera, A.G.A., Brooksby, G., Kaufhold, J., Hoogs, A.: A unified framework for tracking through occlusions and across sensor gaps. In: CVPR, pp. 990–997 (2005)
Khan, Z., Balch, T., Dellaert, F.: MCMC-based particle filtering for tracking a variable number of interacting targets. IEEE PAMI 11, 1805–1918 (2005)
Kim, J., Kolmogorov, V., Zabih, R.: Visual correspondence using energy minimization and mutual information. In: ICCV’03, p. 1033
Kruger J., Westermann, R.: Linear algebra operators for gpu implementation of numerical algorithms. In: International Conference on Computer Graphics and Interactive Techniques, pp. 908–916 (2003)
Kuhn H.W.: The hungarian method for the assignment problem. Naval Res. Logist. Q. 2, 83–97 (1955)
Lowe D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Mann S., Picard R.W.: Video orbits of the projective group a simple approach to featureless estimation of parameters. IEEE Trans. Image Process. 6, 1281–1295 (1997)
Medioni, G.: Matching of a map with an aerial image. In: Proceedings of the 6th International Conference on Pattern Recognition, pp. 517–519 (1982)
Meihe, X., Srinivasan, R., Nowinski, W.L.: A fast mutual information method for multi-modal registration. In: Information Processing in Medical Imaging, pp. 466–471 (1999)
NVIDIA CUDA Programming Guide 1.1. (2007)
Oh, S., Russell, S., Sastry, S.: Markov chain monte carlo data association for general multiple-target tracking problems. In: Proceedings of the 43rd IEEE Conference on Decision and Control, pp. 735–742 (2004)
Patrick Labatut, R.K., Pons, J.-P.: A gpu implementation of level set multiview stereo. In: International Conference on Computational Science, pp. 212–219 (2006)
Prati A., Mikic I., Trivedi M.M., Cucchiara R.: Detecting moving shadows: algorithms and evaluation. PAMI 25(7), 918–923 (2003)
Sinha S.N., Frahm J.-M., Pollefeys M., Genc Y.: Gpu-based video feature tracking and matching. Technical report. Department of Computer Science, UNC, Chapel Hill (2006)
Smith, K., Gatica-Perez, D., Odobez, J.-M.: Using particles to track varying numbers of interacting people. In: CVPR, pp. 962–969 (2005)
Studholme C., Hill D.L.G., Hawkes D.J.: An ovelap invariant entropy measure of 3d medical image alignment. Pattern Recognit. 32(1), 710486 (1999)
Tang, C., Medioni, G., Lee, M.: N-dimensional tensor voting, application to epipolar geometry estimation. In: PAMI, pp. 829–844 (2001)
Viola P., William M.W.I.: Alignment by maximization of mutual information. Int. J. Comput. Vis. 24(2), 137–154 (1997)
Yalcin, H., Collins, R., Hebert, M.: Background estimation under rapid gain change in thermal imagery. In: Object Tracking and Classification in and Beyond the Visible Spectrum (2005)
Yang, R., Pollefeys, M.: Multi-resolution real-time stereo on commodity graphics hardware. In: CVPR, pp. 211–217 (2003)
Yin, Z., Collins, R.: Moving object localization in thermal imagery by forward–backward mhi. In: CVPR Workshop on Object Tracking and Classification in and Beyond the Visible Spectrum (OTCBVS) (2006)
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
The Below are the Electronic Supplementary Material.
Rights and permissions
About this article
Cite this article
Lin, Y., Yu, Q. & Medioni, G. Efficient detection and tracking of moving objects in geo-coordinates. Machine Vision and Applications 22, 505–520 (2011). https://doi.org/10.1007/s00138-010-0264-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-010-0264-1