Skip to main content
Log in

Efficient detection and tracking of moving objects in geo-coordinates

  • Original Paper
  • Published:
Machine Vision and Applications Aims and scope Submit manuscript

Abstract

We present a system to detect and track moving objects from an airborne platform. Given a global map, such as a satellite image, our approach can locate and track the targets in geo-coordinates, namely longitude and latitude obtained from geo-registration. A motion model in geo-coordinates is more physically meaningful than the one in image coordinates. We propose to use a two-step geo-registration approach to stitch images acquired by satellite and UAV cameras. Mutual information is used to find correspondences between these two very different modalities. After motion segmentation and geo-registration, tracking is performed in a hierarchical manner: at the temporally local level, moving image blobs extracted by motion segmentation are associated into tracklets; at the global level, tracklets are linked by their appearance and spatio-temporal consistency on the global map. To achieve efficient time performance, graphics processing unit techniques are applied in the geo-registration and motion detection modules, which are the bottleneck of the whole system. Experiments show that our method can efficiently deal with long term occlusion and segmented tracks even when targets fall out the field of view.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Ali, S., Shah, M.: Cocoa—tracking in aerial imagery. In: SPIE (2006)

  2. Babenko, P., Shah, M.: Mingpu: A minimum gpu library for computer vision. http://server.cs.ucf.edu/~vision/MinGPU/

  3. Bar-Shalom, Y., Fortmann, T., Scheffe, M.: Joint probabilistic data association for multiple targets in clutter. In: Proceedings of Conference on Information Sciences and Systems (1980)

  4. Brown, M., Lowe, D.G.: Recognizing panoramas. In: ICCV’03. Proceedings of Ninth IEEE International Conference on Computer Vision, pp. 1218–1225 (2003)

  5. Fischler M.A., Bolles R.C.: Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24(6), 381–395 (1981)

    Article  MathSciNet  Google Scholar 

  6. Griesser, A., et al.: Real-time, gpu-based foreground-background segmentation. In: Vision, Modeling, and Visualization, pp. 319–326 (2005)

  7. Hanna, K., Sawhney, H., Kumar, R., Guo, Y., Samarasekara, S.: Annotation of video by alignment to reference imagery. In: ICCV’99, pp. 253–264 (1999)

  8. Huang, X., Sun, Y., Metaxas, D., Sauer, F., Xu, C.: Hybrid image registration based on configural matching of scale-invariant salient region features. In: CVPRW ’04: Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW’04), vol. 11, pp. 167

  9. Irani, M., Anandan, P., Hsu, S.: Mosaic based representations of video sequences. In: ICCV, pp. 605–611 (1995)

  10. James Fung, C.A., Steve Mann: Openvidia: Parallel gpu computer vision. In: Proceedings of the ACM Multimedia 2005, pp. 849–852 (2005)

  11. Javed, O., Shafique, K., Shah, M.: A hierarchical approach to robust background subtraction using color and gradient information. In: Proceedings of the Workshop on Motion and Video Computing, pp. 23–28 (2002)

  12. Kang, J., Cohen, I., Medioni, G.: Continuous tracking within and across camera streams. In: CVPR, vol. 1, pp. 267–272 (2003)

  13. Kang, J., Cohen, I., Medioni, G.: Object reacquisition using invariant appearance model. In: ICPR, pp. 759–762 (2004)

  14. Kaucic, R., Perera, A.G.A., Brooksby, G., Kaufhold, J., Hoogs, A.: A unified framework for tracking through occlusions and across sensor gaps. In: CVPR, pp. 990–997 (2005)

  15. Khan, Z., Balch, T., Dellaert, F.: MCMC-based particle filtering for tracking a variable number of interacting targets. IEEE PAMI 11, 1805–1918 (2005)

    Google Scholar 

  16. Kim, J., Kolmogorov, V., Zabih, R.: Visual correspondence using energy minimization and mutual information. In: ICCV’03, p. 1033

  17. Kruger J., Westermann, R.: Linear algebra operators for gpu implementation of numerical algorithms. In: International Conference on Computer Graphics and Interactive Techniques, pp. 908–916 (2003)

  18. Kuhn H.W.: The hungarian method for the assignment problem. Naval Res. Logist. Q. 2, 83–97 (1955)

    Article  Google Scholar 

  19. Lowe D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  20. Mann S., Picard R.W.: Video orbits of the projective group a simple approach to featureless estimation of parameters. IEEE Trans. Image Process. 6, 1281–1295 (1997)

    Article  Google Scholar 

  21. Medioni, G.: Matching of a map with an aerial image. In: Proceedings of the 6th International Conference on Pattern Recognition, pp. 517–519 (1982)

  22. Meihe, X., Srinivasan, R., Nowinski, W.L.: A fast mutual information method for multi-modal registration. In: Information Processing in Medical Imaging, pp. 466–471 (1999)

  23. NVIDIA CUDA Programming Guide 1.1. (2007)

  24. Oh, S., Russell, S., Sastry, S.: Markov chain monte carlo data association for general multiple-target tracking problems. In: Proceedings of the 43rd IEEE Conference on Decision and Control, pp. 735–742 (2004)

  25. Patrick Labatut, R.K., Pons, J.-P.: A gpu implementation of level set multiview stereo. In: International Conference on Computational Science, pp. 212–219 (2006)

  26. Prati A., Mikic I., Trivedi M.M., Cucchiara R.: Detecting moving shadows: algorithms and evaluation. PAMI 25(7), 918–923 (2003)

    Google Scholar 

  27. Sinha S.N., Frahm J.-M., Pollefeys M., Genc Y.: Gpu-based video feature tracking and matching. Technical report. Department of Computer Science, UNC, Chapel Hill (2006)

    Google Scholar 

  28. Smith, K., Gatica-Perez, D., Odobez, J.-M.: Using particles to track varying numbers of interacting people. In: CVPR, pp. 962–969 (2005)

  29. Studholme C., Hill D.L.G., Hawkes D.J.: An ovelap invariant entropy measure of 3d medical image alignment. Pattern Recognit. 32(1), 710486 (1999)

    Article  Google Scholar 

  30. Tang, C., Medioni, G., Lee, M.: N-dimensional tensor voting, application to epipolar geometry estimation. In: PAMI, pp. 829–844 (2001)

  31. Viola P., William M.W.I.: Alignment by maximization of mutual information. Int. J. Comput. Vis. 24(2), 137–154 (1997)

    Article  Google Scholar 

  32. Yalcin, H., Collins, R., Hebert, M.: Background estimation under rapid gain change in thermal imagery. In: Object Tracking and Classification in and Beyond the Visible Spectrum (2005)

  33. Yang, R., Pollefeys, M.: Multi-resolution real-time stereo on commodity graphics hardware. In: CVPR, pp. 211–217 (2003)

  34. Yin, Z., Collins, R.: Moving object localization in thermal imagery by forward–backward mhi. In: CVPR Workshop on Object Tracking and Classification in and Beyond the Visible Spectrum (OTCBVS) (2006)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuping Lin.

Electronic Supplementary Material

The Below are the Electronic Supplementary Material.

ESM 1 (PDF 679 kb)

ESM 2 (PDF 1,971 kb)

ESM 3 (PDF 1,064 kb)

ESM 4 (PDF 898 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, Y., Yu, Q. & Medioni, G. Efficient detection and tracking of moving objects in geo-coordinates. Machine Vision and Applications 22, 505–520 (2011). https://doi.org/10.1007/s00138-010-0264-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00138-010-0264-1

Keywords

Navigation