Abstract
Visual object tracking plays an important role in intelligent aerial surveillance by unmanned aerial vehicles (UAV). In ordinary applications, aerial videos are captured by cameras with a fixed-focus lens or a zoom lens, for which the field-of-view (FOV) of the camera is fixed or smoothly changed. In this paper, a special application of the visual tracking in aerial videos captured by the dual FOV camera is introduced, which is different from ordinary applications since the camera quickly switches its FOV during the capturing. Firstly, the tracking process with the dual FOV camera is analyzed, and a conclusion is made that the critical part for the whole process depends on the accurate tracking of the target at the moment of FOV switching. Then, a cascade mean shift tracker is proposed to deal with the target tracking under FOV switching. The tracker utilizes kernels with multiple bandwidths to execute mean shift locating, which is able to deal with the abrupt motion of the target caused by FOV switching. The target is represented by the background weighted histogram to make it well distinguished from the background, and a modification is made to the weight value in the mean shift process to accelerate the convergence of the tracker. Experimental results show that our tracker presents a good performance on both accuracy and efficiency for the tracking. To the best of our knowledge, this paper is the first attempt to apply a visual object tracking method to the situation where the FOV of the camera switches in aerial videos.
Similar content being viewed by others
References
L. Mejias, J. F. Correa, I. Mondragón, P. Campoy. COLIBRI: A vision-guided UAV for surveillance and visual inspection. In Proceedings of IEEE International Conference on Robotics and Automation, IEEE, Roma, Italy, pp. 2760–2761, 2007.
E. M. Hemerly. Automatic Georeferencing of Images Acquired by UAV’s. International Journal of Automation and Computing, vol. 11, no. 4, pp. 357–352, 2014.
D. J. Lee, I. Kaminer, V. Dobrokhodov, K. Jones. Autonomous feature following for visual surveillance using a small unmanned aerial vehicle with gimbaled camera system. International Journal of Control, Automation and Systems, vol. 8, no. 5, pp. 957–966, 2010.
J. J. Xiao, C. J. Yang, F. Han, H. Cheng. Vehicle and person tracking in aerial videos. In Proceedings of International Evaluation Workshops CLEAR 2007 and RT 2007, Springer, Baltimore, USA, pp. 203–214, 2007.
S. Saripalli, J. F. Montgomery, G. Sukhatme. Visually guided landing of an unmanned aerial vehicle. IEEE Transactions on Robotics and Automation, vol. 19, no. 3, pp. 371–380, 2003.
J. Prokaj, X. M. Zhao, G. Medioni. Tracking many vehicles in wide area aerial surveillance. In Proceedings of 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, IEEE, Providence, USA, pp. 37–43, 2012.
I. Szottka, M. Butenuth. Advanced particle filtering for airborne vehicle tracking in urban areas. IEEE Geoscience and Remote Sensing Letters, vol. 11, no. 3, pp. 686–690, 2014.
J. J. Xiao, H. Cheng, H. Sawhney, F. Han. Vehicle detection and tracking in wide field-of-view aerial video. In Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, San Francisco, USA, pp. 679–684, 2010.
W. S. Yu, X. D. Yin, B. Chen, J. H. Xie. Object tracking with particle filter in UAV video. In Proceedings of SPIE 8918, MIPPR 2013: Automatic Target Recognition and Navigation, SPIE, Wuhan, China, pp. 891810, 2013.
H. Shen, S. X. Li, C. F. Zhu, H. X. Chang, J. L. Zhang. Moving object detection in aerial video based on spatiotemporal saliency. Chinese Journal of Aeronautics, vol. 26, no. 5, pp. 1211–1217, 2013.
A. Yilmaz, O. Javed, M. Shah. Object tracking: A survey. ACM Computing Surveys, vol. 38, no. 4, pp. 13, 2006.
Y. Wu, J. Lim, M. H. Yang. Online object tracking: A benchmark. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Portland, USA, pp. 2411–2418, 2013.
J. Prokaj, G. Medioni. Persistent tracking for wide area aerial surveillance. In Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition, IEEE, Columbus, USA, pp. 1186–1193, 2014.
Y. C. Li, Y. J. Zhang. Design of dual field of view and zoom infrared optical system. Advanced Materials Research, vol. 403–408, pp. 2919–2922, 2012.
C. J. Huang, C. S. Tsai, B. R. Chen, J. Y. Yen, J. F. Lee, L. H. Lin, M. S. Chen. High performance FOV switching mechanism design for an infrared zoom lens. International Journal of Automation and Smart Technology, vol. 1, no. 2, pp. 111–119, 2011.
P. Pérez, C. Hue, J. Vermaak, M. Gangnet. Color-based probabilistic tracking. In Proceedings of the 7th European Conference on Computer Vision Copenhagen, Springer, Copenhagen, Denmark, pp. 661–675, 2002.
J. Kwon, K. M. Lee. Wang-Landau Monte Carlo-based tracking methods for abrupt motions. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 4, pp. 1011–1024, 2013.
D. Comaniciu, V. Ramesh, P. Meer. Kernel-based object tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 564–577, 2003.
G. Lupatini, C. Saraceno, R. Leonardi. Scene break detection: A comparison. In Proceedings of the 8th International Workshop on Research Issues in Data Engineering Continuous-media Databases and Applications, IEEE, Orlando, USA, pp. 34–41, 1998.
C. H. Shen, M. J. Brooks, A. Van Den Hengel. Fast global kernel density mode seeking: Applications to localization and tracking. IEEE Transactions on Image Processing, vol. 16, no. 5, pp. 1457–1469, 2007.
J. Jeyakar, R. V. Babu, K. R. Ramakrishnan. Robust object tracking with background-weighted local kernels. Computer Vision and Image Understanding, vol. 112, no. 3, pp. 296–309, 2008.
S. X. Li, H. X. Chang, C. F. Zhu. Adaptive pyramid mean shift for global real-time visual tracking. Image and Vision Computing, vol. 28, no. 3, pp. 424–437, 2010.
Z. L. Jiang, S. F. Li, D. F. Gao. An adaptive mean shift tracking method using multiscale images. In Proceedings of IEEE International Conference on Wavelet Analysis and Pattern Recognition, IEEE, Beijing, China, vol. 3, pp. 1060–1066, 2007.
VIVID tracking evaluation. Networks, [Online], Available: http://vision.cse.psu.edu/data/vividEval/datasets/ datasets.html, March 14, 2015.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by National Natural Science Foundation of China (Nos. 61175032, 61302154 and 61304096).
Recommended by Associate Editor Yasushi Yagi
Yi Song graduated from Hunan University, China in 2010. He is now a Ph.D. candidate in Institute of Automation, Chinese Academy of Sciences, China.
His research interests include computer vision and image analysis.
ORCID iD: 0000-0003-0932-8806
Shu-Xiao Li graduated from Xi’an Jiaotong University, China in 2003. He received the Ph. D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA), China in 2008. He is currently an associate professor in CASIA.
His research interests include computer vision, image processing, and its applications.
Cheng-Fei Zhu graduated from University of Science and Technology of China in 2004. He received the Ph.D. degree from Institute of Automation, Chinese Academy of Sciences (CASIA), China in 2010. He is currently an assistant professor in CASIA.
His research interests include computer vision and image processing.
ORCID iD: 0000-0002-6484-7089
Hong-Xing Chang graduated from Beihang University in 1986. He received the M. Sc. degree from Beihang University, China in 1991. He is currently a professor in Institute of Automation, Chinese Academy of Sciences, China.
His research interests include computer vision, integrated information processing, and its applications.
Rights and permissions
About this article
Cite this article
Song, Y., Li, SX., Zhu, CF. et al. Object tracking with dual field-of-view switching in aerial videos. Int. J. Autom. Comput. 13, 565–573 (2016). https://doi.org/10.1007/s11633-016-0949-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11633-016-0949-7