Skip to main content
Log in

Object tracking with dual field-of-view switching in aerial videos

  • Research Article
  • Published:
International Journal of Automation and Computing Aims and scope Submit manuscript

Abstract

Visual object tracking plays an important role in intelligent aerial surveillance by unmanned aerial vehicles (UAV). In ordinary applications, aerial videos are captured by cameras with a fixed-focus lens or a zoom lens, for which the field-of-view (FOV) of the camera is fixed or smoothly changed. In this paper, a special application of the visual tracking in aerial videos captured by the dual FOV camera is introduced, which is different from ordinary applications since the camera quickly switches its FOV during the capturing. Firstly, the tracking process with the dual FOV camera is analyzed, and a conclusion is made that the critical part for the whole process depends on the accurate tracking of the target at the moment of FOV switching. Then, a cascade mean shift tracker is proposed to deal with the target tracking under FOV switching. The tracker utilizes kernels with multiple bandwidths to execute mean shift locating, which is able to deal with the abrupt motion of the target caused by FOV switching. The target is represented by the background weighted histogram to make it well distinguished from the background, and a modification is made to the weight value in the mean shift process to accelerate the convergence of the tracker. Experimental results show that our tracker presents a good performance on both accuracy and efficiency for the tracking. To the best of our knowledge, this paper is the first attempt to apply a visual object tracking method to the situation where the FOV of the camera switches in aerial videos.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. L. Mejias, J. F. Correa, I. Mondragón, P. Campoy. COLIBRI: A vision-guided UAV for surveillance and visual inspection. In Proceedings of IEEE International Conference on Robotics and Automation, IEEE, Roma, Italy, pp. 2760–2761, 2007.

    Google Scholar 

  2. E. M. Hemerly. Automatic Georeferencing of Images Acquired by UAV’s. International Journal of Automation and Computing, vol. 11, no. 4, pp. 357–352, 2014.

    Article  Google Scholar 

  3. D. J. Lee, I. Kaminer, V. Dobrokhodov, K. Jones. Autonomous feature following for visual surveillance using a small unmanned aerial vehicle with gimbaled camera system. International Journal of Control, Automation and Systems, vol. 8, no. 5, pp. 957–966, 2010.

    Article  Google Scholar 

  4. J. J. Xiao, C. J. Yang, F. Han, H. Cheng. Vehicle and person tracking in aerial videos. In Proceedings of International Evaluation Workshops CLEAR 2007 and RT 2007, Springer, Baltimore, USA, pp. 203–214, 2007.

    Google Scholar 

  5. S. Saripalli, J. F. Montgomery, G. Sukhatme. Visually guided landing of an unmanned aerial vehicle. IEEE Transactions on Robotics and Automation, vol. 19, no. 3, pp. 371–380, 2003.

    Article  Google Scholar 

  6. J. Prokaj, X. M. Zhao, G. Medioni. Tracking many vehicles in wide area aerial surveillance. In Proceedings of 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, IEEE, Providence, USA, pp. 37–43, 2012.

    Google Scholar 

  7. I. Szottka, M. Butenuth. Advanced particle filtering for airborne vehicle tracking in urban areas. IEEE Geoscience and Remote Sensing Letters, vol. 11, no. 3, pp. 686–690, 2014.

    Article  Google Scholar 

  8. J. J. Xiao, H. Cheng, H. Sawhney, F. Han. Vehicle detection and tracking in wide field-of-view aerial video. In Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, San Francisco, USA, pp. 679–684, 2010.

    Chapter  Google Scholar 

  9. W. S. Yu, X. D. Yin, B. Chen, J. H. Xie. Object tracking with particle filter in UAV video. In Proceedings of SPIE 8918, MIPPR 2013: Automatic Target Recognition and Navigation, SPIE, Wuhan, China, pp. 891810, 2013.

    Chapter  Google Scholar 

  10. H. Shen, S. X. Li, C. F. Zhu, H. X. Chang, J. L. Zhang. Moving object detection in aerial video based on spatiotemporal saliency. Chinese Journal of Aeronautics, vol. 26, no. 5, pp. 1211–1217, 2013.

    Article  Google Scholar 

  11. A. Yilmaz, O. Javed, M. Shah. Object tracking: A survey. ACM Computing Surveys, vol. 38, no. 4, pp. 13, 2006.

    Article  Google Scholar 

  12. Y. Wu, J. Lim, M. H. Yang. Online object tracking: A benchmark. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Portland, USA, pp. 2411–2418, 2013.

    Google Scholar 

  13. J. Prokaj, G. Medioni. Persistent tracking for wide area aerial surveillance. In Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Columbus, USA, pp. 1186–1193, 2014.

    Chapter  Google Scholar 

  14. Y. C. Li, Y. J. Zhang. Design of dual field of view and zoom infrared optical system. Advanced Materials Research, vol. 403–408, pp. 2919–2922, 2012.

    Google Scholar 

  15. C. J. Huang, C. S. Tsai, B. R. Chen, J. Y. Yen, J. F. Lee, L. H. Lin, M. S. Chen. High performance FOV switching mechanism design for an infrared zoom lens. International Journal of Automation and Smart Technology, vol. 1, no. 2, pp. 111–119, 2011.

    Article  Google Scholar 

  16. P. Pérez, C. Hue, J. Vermaak, M. Gangnet. Color-based probabilistic tracking. In Proceedings of the 7th European Conference on Computer Vision Copenhagen, Springer, Copenhagen, Denmark, pp. 661–675, 2002.

    Google Scholar 

  17. J. Kwon, K. M. Lee. Wang-Landau Monte Carlo-based tracking methods for abrupt motions. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 4, pp. 1011–1024, 2013.

    Article  Google Scholar 

  18. D. Comaniciu, V. Ramesh, P. Meer. Kernel-based object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 564–577, 2003.

    Article  Google Scholar 

  19. G. Lupatini, C. Saraceno, R. Leonardi. Scene break detection: A comparison. In Proceedings of the 8th International Workshop on Research Issues in Data Engineering Continuous-media Databases and Applications, IEEE, Orlando, USA, pp. 34–41, 1998.

    Chapter  Google Scholar 

  20. C. H. Shen, M. J. Brooks, A. Van Den Hengel. Fast global kernel density mode seeking: Applications to localization and tracking. IEEE Transactions on Image Processing, vol. 16, no. 5, pp. 1457–1469, 2007.

    Article  MathSciNet  Google Scholar 

  21. J. Jeyakar, R. V. Babu, K. R. Ramakrishnan. Robust object tracking with background-weighted local kernels. Computer Vision and Image Understanding, vol. 112, no. 3, pp. 296–309, 2008.

    Article  Google Scholar 

  22. S. X. Li, H. X. Chang, C. F. Zhu. Adaptive pyramid mean shift for global real-time visual tracking. Image and Vision Computing, vol. 28, no. 3, pp. 424–437, 2010.

    Article  Google Scholar 

  23. Z. L. Jiang, S. F. Li, D. F. Gao. An adaptive mean shift tracking method using multiscale images. In Proceedings of IEEE International Conference on Wavelet Analysis and Pattern Recognition, IEEE, Beijing, China, vol. 3, pp. 1060–1066, 2007.

    Google Scholar 

  24. VIVID tracking evaluation. Networks, [Online], Available: http://vision.cse.psu.edu/data/vividEval/datasets/ datasets.html, March 14, 2015.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Cheng-Fei Zhu.

Additional information

This work was supported by National Natural Science Foundation of China (Nos. 61175032, 61302154 and 61304096).

Recommended by Associate Editor Yasushi Yagi

Yi Song graduated from Hunan University, China in 2010. He is now a Ph.D. candidate in Institute of Automation, Chinese Academy of Sciences, China.

His research interests include computer vision and image analysis.

ORCID iD: 0000-0003-0932-8806

Shu-Xiao Li graduated from Xi’an Jiaotong University, China in 2003. He received the Ph. D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA), China in 2008. He is currently an associate professor in CASIA.

His research interests include computer vision, image processing, and its applications.

Cheng-Fei Zhu graduated from University of Science and Technology of China in 2004. He received the Ph.D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA), China in 2010. He is currently an assistant professor in CASIA.

His research interests include computer vision and image processing.

ORCID iD: 0000-0002-6484-7089

Hong-Xing Chang graduated from Beihang University in 1986. He received the M. Sc. degree from Beihang University, China in 1991. He is currently a professor in Institute of Automation, Chinese Academy of Sciences, China.

His research interests include computer vision, integrated information processing, and its applications.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Song, Y., Li, SX., Zhu, CF. et al. Object tracking with dual field-of-view switching in aerial videos. Int. J. Autom. Comput. 13, 565–573 (2016). https://doi.org/10.1007/s11633-016-0949-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11633-016-0949-7

Keywords

Navigation