Abstract
When objects undergo large pose change, illumination variation or partial occlusion, most existing visual tracking algorithms tend to drift away from targets and even fail to track them. To address the issue, in this paper we propose a multi-scale patch-based appearance model with sparse representation and provide an efficient scheme involving the collaboration between multi-scale patches encoded by sparse coefficients. The key idea of our method is to model the appearance of an object by different scale patches, which are represented by sparse coefficients with different scale dictionaries. The model exploits both partial and spatial information of targets based on multi-scale patches. Afterwards, a similarity score of one candidate target is input into a particle filter framework to estimate the target state sequentially over time in visual tracking. Additionally, to decrease the visual drift caused by frequently updating model, we present a novel two-step object tracking method which exploits both the ground truth information of the target labeled in the first frame and the target obtained online with the multi-scale patch information. Experiments on some publicly available benchmarks of video sequences showed that the similarity involving complementary information can locate targets more accurately and the proposed tracker is more robust and effective than others.
Similar content being viewed by others
References
Yilmaz, A., Javed, O., Shah, M.: Object tracking: a survey. ACM Comput. Survey 38(4), 1–45 (2006)
Wright, J., Ma, Y., Mairal, J., Sapiro, G., Huang, T., Yan, S.: Sparse representation for computer vision and pattern recognition. Proc. IEEE. 98(6), 1031–1044 (2010)
Candès, E., Romberg, J., Tao, T.: Stable signal recovery from incomplete and inaccurate measurements. Comm. Pure Appl. Math. 59(8), 1207–1223 (2006)
Donoho, D.: Compressed sensing. IEEE Trans. Inf. Theory 52(4), 1289–1306 (2006)
Mei, X., Ling, H.: Robust visual tracking using \(l_{1}\) minimization, ICCV, pp. 1436–1443 (2009)
Li, H.X., Shen, C.: Robust real-time visual tracking using compressive sensing. CVPR, pp. 1305–1312 (2011)
Han, Z., Jiao, J., Zhang, B., Ye, Q., Liu, J.: Visual object tracking via sample-based adaptive sparse representation (AdaSR). Pattern Recogn. 44(9), 2170–2183 (2011)
Bai, T., Li, Y.F.: Robust visual tracking with structured sparse representation appearance model. Pattern Recogn. 45(6), 2390–2404 (2012)
Liu, B., Huang, J. et al.: Robust tracking using local sparse appearance model and K-selection. CVPR, pp. 1313–1320 (2011)
Jia, X., Lu, H., Yang, M.H.: Visual tracking via adaptive structural local sparse appearance model. CVPR, pp. 1822–1829 (2012)
Matthews, I., Ishikawa, T., Baker, S.: The template update problem, PAMI. 810–815 (2004)
Wang, Q., Chen, F., Xu, W., Yang, M.-H.: Object tracking via partial least squares analysis. IEEE Trans. Image Process. 21(10), 4454–4465 (2012)
Pérez, P., Hue, C. Vermaak, J., Gangnet, M.: Color-based probabilistic tracking. In: Proceedings of European conference on computer vision, pp. 661–675 (2002)
Comaniciu, D., Ramesh, V., Meer, P.: Kernel-based object tracking. IEEE Trans. Pattern Anal. Mach. Intell. 25(5), 564–575 (2003)
Ross, D., Lim, J., Lin, R.S., Yang, M.-H.: Incremental learning for robust visual tracking. IJCV 77(1), 125–141 (2008)
Kwon, J., Lee, K. M.: Visual tracking decomposition. CVPR, pp. 1269–1276 (2010)
Kwon, J., Lee, K.M.: Tracking by sampling trackers. ICCV, pp. 1195–1202 (2011)
Collins, R., Liu, Y., Leordeanu, M.: Online selection of discriminative tracking features. IEEE Trans. Pattern Anal. Mach. Intell. 27(10), 1631–1643 (2005)
Ramirez-Paredes, J.-P., Sanchez-Yanez, R.E., Ayala-Ramirez, V.: A fuzzy inference approach to template-based visual tracking. Mach. Vis. Appl. 23(3), 427–439 (2012)
Wang, D., Lu, H.C., Yang, M.-H.: Online Object Tracking with sparse prototypes. IEEE Trans. Image Process. 22(1), 314–325 (2013)
Zhong, W., Lu, H.C., Yang, M.-H.: Robust object tracking via sparsity-based collaborative model. In CVPR, pp. 1838–1845 (2012)
Wu, Y., Cheng, J., Wang, J.Q., Lu, H.Q., Wang, J., Ling, H.B., Blasch, E., Bai, L.: Real-time probabilistic covariance tracking with efficient model update. IEEE Trans. Image Process. 21(5), 2824–2837 (2012)
Avidan, S.: Ensemble tracking. PAMI 29(2), 261–271 (2007)
Viola, P., Platt, J.C., Zhang, C.: Multiple instance boosting for object detection. In: NIPS. 1417–1426 (2005)
Avidan, S.: Support vector tracking. In: CVPR. pp. 184-191 (2001)
Grabner, H., Bischof, H.: On-line boosting and vision. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 260–267 (2006)
Parag, T., Porikli, F., Elgammal, A.: Boosting adaptive linear weak classifiers for online learning and tracking. CVPR. pp. 1–8 (2008)
Carvalho, P., Oliveira, T., Ciobanu, L., Gaspar, F., et al.: Analysis of object description methods in a video object tracking environment. Mach. Vis. Appl. 24(6), 1149–1165 (2011)
Babenko, B., Yang, M.-H., Belongie, S.: Robust object tracking with online multiple instance learning. IEEE Trans. Pattern Anal. Mach. Intell. 33(8), 1619–1632 (2011)
Wright, J., Yang, A., Ganesh, A., Sastry, S., Ma, Y.: Robust face recognition via sparse representation. IEEE Trans. Pattern Anal. Mach. Intell. 31(2), 210–227 (2008)
Chen, F., Wang, Q., Wang, S., Zhang, W., Xu, W.: Object tracking via appearance modeling and sparse representation. Image Vis. Comput. 21(11), 787–796 (2011)
Wang, Q., Chen, F., Xu, W., Yang, M.-H.: Online discriminative object tracking with local sparse representation’, WACV ’12 Proceedings of the 2012 IEEE Workshop on the Applications of Computer Vision, pp. 425–432 (2012)
Doucet, A., de Freitas, N., Gordon, N.: Sequential Monte Carlo Methods in Practice. Springer, New York (2001)
Arulampalam, M.S., Maskell, S., Gordon, N., Clapp, T.: A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking. IEEE Trans. Signal Process. 50(2), 174–188 (2002)
Zhang K.H., Zhang L., Yang M.H.: Real-time compressive tracking. ECCV, pp. 864–877 (2012)
Tropp, J.A., Gilbert, A.C.: Signal recovery from random measurements via orthogonal matching pursuit. IEEE Trans. Inf. Theory 53(12), 4655–4666 (2007)
Everingham, M., Gool, L.V., Williams, C., Winn, J., Zisserman, A.: The pascal visual object classes (VOC) challenge. IJCV 88(2), 303–338 (2010)
Acknowledgments
This work was supported by the NSFC-Guangdong Joint Foundation Key Project under Grant (No. U1135003), the National Nature Science Foundation of China (Nos. 61070227, 61300058 and 41302261).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Xie, C., Tan, J., Chen, P. et al. Multi-scale patch-based sparse appearance model for robust object tracking. Machine Vision and Applications 25, 1859–1876 (2014). https://doi.org/10.1007/s00138-014-0632-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00138-014-0632-3