Skip to main content
Log in

Real-time manifold regularized context-aware correlation tracking

  • Research Article
  • Published:
Frontiers of Computer Science Aims and scope Submit manuscript

Abstract

Despite the demonstrated success of numerous correlation filter (CF) based tracking approaches, their assumption of circulant structure of samples introduces significant redundancy to learn an effective classifier. In this paper, we develop a fast manifold regularized context-aware correlation tracking algorithm that mines the local manifold structure information of different types of samples. First, different from the traditional CF based tracking that only uses one base sample, we employ a set of contextual samples near to the base sample, and impose a manifold structure assumption on them. Afterwards, to take into account the manifold structure among these samples, we introduce a linear graph Laplacian regularized term into the objective of CF learning. Fortunately, the optimization can be efficiently solved in a closed form with fast Fourier transforms (FFTs), which contributes to a highly efficient implementation. Extensive evaluations on the OTB100 and VOT2016 datasets demonstrate that the proposed tracker performs favorably against several state-of-the-art algorithms in terms of accuracy and robustness. Especially, our tracker is able to run in real-time with 28 fps on a single CPU.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Li X, Hu W M, Shen C H, Zhang Z F, Dick A, Hengel A V D. A survey of appearance models in visual object tracking. ACM Transactions on Intelligent Systems and Technology, 2013, 4(4): 1–13

    Article  Google Scholar 

  • Wang H J, Ge H J. Visual tracking using discriminative representation with l 2 regularization. Frontiers of Computer Science, 2018, 12(1): 1–13

    Article  Google Scholar 

  • Ali A, Jalil A, Niu J W, Zhao X K, Rathore S, Ahmed J, Iftikhar M A. Visual object tracking-classical and contemporary approaches. Frontiers of Computer Science, 2016, 10(1): 167–188

    Article  Google Scholar 

  • Zhang K H, Liu Q S, Ynag J, Yang M H. Visual tracking via boolean map representations. Pattern Recognition, 2018, 81: 147–160

    Article  Google Scholar 

  • Zhang K H, Li X J, Song H H, Liu Q S, Lian W. Visual tracking using spatio-temporally nonlocally regularized correlation filter. Pattern Recognition, 2083, 185: 185–195

    Google Scholar 

  • Bolme D S, Beveridge J R, Draper B A, Lui Y M. Visual object tracking using adaptive correlation filters. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2010, 2544–2550

    Google Scholar 

  • Henriques J F, Caseiro R, Martins P, Batista J. High-speed tracking with kernelized correlation filters. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(3): 583–596

    Article  Google Scholar 

  • Zhang K H, Zhang L, Liu Q S, Zhang D, Yang M H. Fast visual tracking via dense spatio-temporal context learning. In: Proceedings of European Conference on Computer Vision. 2014, 127–141

    Google Scholar 

  • Bertinetto L, Valmadre J, Golodetz S, Miksik O, Torr P H S. Staple: complementary learners for real-time tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016, 1401–1409

    Google Scholar 

  • Zhang K H, Liu Q S, Wu Y, Yang M H. Robust visual tracking via convolutional networks without training. IEEE Transactions on Image Processing, 2016, 25(4): 1779–1792

    MathSciNet  MATH  Google Scholar 

  • Ma C, Xu Y, Ni B B, Yang X K. When correlation filters meet convolutional neural networks for visual tracking. IEEE Signal Processing Letters, 2016, 23(10): 1454–1458

    Article  Google Scholar 

  • Danelljan M, Häger G, Khan F, Felsberg M. Accurate scale estimation for robust visual tracking. In: Proceedings of the British Machine Vision Conference. 2014

    Google Scholar 

  • Kristan M, Leonardis A, Matas J. The visual object tracking VOT2017 challenge results. In: Proceedings of the IEEE International Conference on Computer Vision Workshop. 2017, 1949–1972

    Google Scholar 

  • Mueller M, Smith N, Ghanem B. Context-aware correlation filter tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 1396–1404

    Google Scholar 

  • Galoogahi K H, Fagg A, Lucey S. Learning background-aware correlation filters for visual tracking. In: Proceedings of the IEEE International Conference on Computer Vision. 2017, 1135–1143

    Google Scholar 

  • Yan Y, Nie F, Li W, Gao C Q, Yang Y, Xu D. Image classification by cross-media active learning with privileged information. IEEE Transactions on Multimedia, 2016, 18(12): 2494–2502

    Article  Google Scholar 

  • Yang Y, Ma Z G, Nie F P, Chang X J, Hauptmann A G. Multi-class active learning by uncertainty sampling with diversity maximization. International Journal of Computer Vision, 2015, 113(2): 113–127

    Article  MathSciNet  Google Scholar 

  • Yang Y, Nie F P, Xu D, Luo J B, Zhuang Y T, Pan Y H. A multimedia retrieval framework based on semi-supervised ranking and relevance feedback. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34(4): 723–742

    Article  Google Scholar 

  • Wu Y, Lim J W, Yang M H. Object tracking benchmark. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 37(9): 1834–1848

    Article  Google Scholar 

  • Danelljan M, Shahbaz K F, Felsberg M, Joost V W. Adaptive color attributes for real-time visual tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014, 1090–1097

    Google Scholar 

  • Ma C, Huang J B, Yang X K, Yang M H. Hierarchical convolutional features for visual tracking. In: Proceedings of the IEEE International Conference on Computer Vision. 2015, 3074–3082

    Google Scholar 

  • Danelljan M, Hager G, Khan F S, Felsberg M. Convolutional features for correlation filter based visual tracking. In: Proceedings of the IEEE International Conference on Computer Vision Workshops. 2015, 58–66

    Google Scholar 

  • Danelljan M, Robinson A, Khan F S, Felsberg M. Beyond correlation filters: learning continuous convolution operators for visual tracking. In: Proceedings of European Conference on Computer Vision. 2016, 472–488

    Google Scholar 

  • Danelljan M, Bhat G, Khan F S, Felsberg M. ECO: rfficient convolution operators for tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 21–26

    Google Scholar 

  • Liu S, Zhang T Z, Cao X C, Xu C S. Structural correlation filter for robust visual tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016, 4312–4320

    Google Scholar 

  • Lukezic A, Vojír T, Zajc L C, Matas J, Kristan M. Discriminative correlation filter with channel and spatial reliability. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017, 6309–6318

    Google Scholar 

  • Danelljan M, Hager G, Shahbaz K F, Felsberg M. Learning spatially regularized correlation filters for visual tracking. In: Proceedings of European Conference on Computer Vision. 2015, 4310–4318

    Google Scholar 

  • Belkin M, Niyogi P, Sindhwani V. Manifold regularization: a geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 2006, 7(Nov): 2399–2434

    MathSciNet  MATH  Google Scholar 

  • Chang X J, Yang Y. Semisupervised feature analysis by mining correlations among multiple tasks. IEEE Transactions on Neural Networks and Learning Systems, 2017, 28(10): 2294–2305

    Article  MathSciNet  Google Scholar 

  • Yu S, Yang Y, Hauptmann A. Harry potter’s marauder’s map: localizing and tracking multiple persons-of-interest by nonnegative discretization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2013, 3714–3720

    Google Scholar 

  • Bai Y C, Tang M. Robust tracking via weakly supervised ranking SVM. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2012, 1854–1861

    Google Scholar 

  • Hu H W, Ma B, Shen J B, Shao L. Manifold regularized correlation object tracking. IEEE Transactions on Neural Networks and Learning Systems, 2018, 29(5): 1786–1795

    Article  MathSciNet  Google Scholar 

  • Zhuang B, Lu H C, Xiao Z Y, Wang D. Visual tracking via discriminative sparse similarity map. IEEE Transactions on Image Processing, 2014, 23(4): 1872–1881

    Article  MathSciNet  MATH  Google Scholar 

  • Belkin M, Niyogi P. Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in Neural Processing Systems, 2002, 585: 585–591

    Google Scholar 

  • Ma C, Yang X K, Zhang C Y, Yang M H. Long-term correlation tracking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015, 5388–5396

    Google Scholar 

  • Zhang J M, Ma S G, Sclaroff S. MEEM: robust tracking via multiple experts using entropy minimization. In: Proceedings of European Conference on Computer Vision. 2014, 188–203

    Google Scholar 

  • Hare S, Golodetz S, Saffari A, Vineet V, Cheng M M, Hicks S L, Torr P H S. Struck: structured output tracking with kernels. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(10): 2096–2109

    Article  Google Scholar 

  • Wu Y, Lim J W, Yang M H. Online object tracking: a benchmark. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2013, 2411–2418

    Google Scholar 

  • Song Y B, Ma C, Gong L J, Zhang J W, Lau R W H, Yang M H. Crest: convolutional residual learning for visual tracking. In: Proceedings of the IEEE International Conference on Computer Vision. 2017, 2574–2583

    Google Scholar 

  • Zhu G, Porikli F, Li H D. Beyond local search: tracking objects everywhere with instance-specific proposals. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016, 943–951

    Google Scholar 

Download references

Acknowledgments

This work was supported in part by NSF of Jiangsu province (BK20170040), in part by the National Natural Science Foundation of China (Grant Nos. 61872189, 61876088, 61605083), in part by the NSF of Jiangsu Higher Education Institutions of China (16KJB510023), in part by Postgraduate Research & Practice Innovation Program of Jiangsu Province (KYCX17_0903).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kaihua Zhang.

Additional information

Jiaqing Fan recieved Bachelor degree from the School of Information and Control in Nanjing University of Information Science and Technology, China. He is currently pursuing the MS degree with the School of Information and Control, Nanjing University of Information Science and Technology, China. His current research interests include visual object tracking algorithms.

Huihui Song is a professor with the Jiangsu Key Laboratory of Big Data Analysis Technology, Nanjing University of Information Science and Technology, China. She received her BS degree in technology and science of electronic information from Ocean University of China in 2008, master’s degree in communication and information system from University of Science and Technology of China (USTC) in 2011, and PhD degree in geography and resource management from the Chinese University of Hong Kong, China in 2014. Her research interests include remote sensing image processing and image fusion.

Kaihua Zhang is a professor in the School of Information and Control, Nanjing University of Information Science and Technology, China. He received the BS degree in Technology and Science of Electronic Information from Ocean University of China (OUC), China in 2006, the MS degree in Signal and Information Processing from the University of Science and Technology of China (USTC), China in 2009 and PhD degree from the Department of Computing in the Hong Kong Polytechnic University, China in 2013. From August 2009 to August 2010, he worked as a Research Assistant in the Department of Computing, The Hong Kong Polytechnic University, China. His research interests include image segmentation, level sets, and visual tracking.

Qingshan Liu is a professor with the School of Information and Control, Nanjing University of Information Science and Technology, China. He received the PhD degree from the National Laboratory of Pattern Recognition, Chinese Academic of Science, China in 2003, and the MS degree from the Department of Auto Control, Southeast University, China in 2000. He was an Assistant Research Professor with the Department of Computer Science, Computational Biomedicine Imaging and Modeling Center, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA, from 2010 to 2011. Before he joined Rutgers University, he was an Associate Professor with the National Laboratory of Pattern Recognition, Chinese Academic of Science, and an Associate Researcher with the Multimedia Laboratory, Chinese University of Hong Kong, China, from 2004 and 2005. He was a recipient of the President Scholarship of the Chinese Academy of Sciences, China in 2003. His current research interests are image and vision analysis, including face image analysis, graph and hypergraph-based image and video understanding, medical image analysis, and event-based video analysis.

Fei Yan is a PhD, lecturer. He graduated from the University of Chinese Academy of Sciences in the electrical and systems specialty, China, mainly engaged in image processing, information display technology. Wei Lian received the BS degree in automation from Taiyuan University of Technology, China in 2000, and the MS and PhD degrees in automatic control theory and engineering from Northwestern Polytechnical University, China in 2003 and 2007, respectively. He is an associate professor in the Department of Computer Science, Changzhi University, China. He was a research fellow in the Department of Computing, The Hong Kong Polytechnic University, China in 2018.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fan, J., Song, H., Zhang, K. et al. Real-time manifold regularized context-aware correlation tracking. Front. Comput. Sci. 14, 334–348 (2020). https://doi.org/10.1007/s11704-018-8104-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11704-018-8104-y

Keywords

Navigation