Abstract
Object tracking across multiple cameras is a very challenge issue in vision based monitoring applications. The selection of features is the first step to realize a reliable tracking algorithm.
In this work we analyse TLD and Struck, which are two of the most cited real-time visual trackers proposed in the literature in last years. They use two different feature extraction methodologies, Fern and Haar, respectively. The idea of this work is to compare performance of these well known visual tracking algorithms replacing their original feature characterization methods with local feature-based visual representations.
We test the improvement in terms of object detection and tracking performance grafting different features characterization into two completely different online tracker frameworks.
The used feature extraction methods are based on Local Binary Pattern (LBP), Local Gradient Pattern (LGP) and Histogram of Oriented Gradients (HOG). LGP is a novel detection methodology which is insensitive to global intensity variations like other representations such as local binary patterns (LBP).
The experimental results on well known benchmark sequences show as the feature extraction replacing improve the overall performances of the considered real-time visual trackers.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
For LGP source code: pierluigi.carcagni@ino.it.
- 5.
References
Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)
Brehar, R., Nedevschi, S.: Local information statistics of LBP and HOG for pedestrian detection. In: 2013 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP), pp. 117–122 (2013)
Chuan-xu, W., Zuo-yong, L.: A new face tracking algorithm based on local binary pattern and skin color information. In: International Symposium on Computer Science and Computational Technology 2008, ISCSCT ’08, vol. 2, pp. 657–660 (2008)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2005, CVPR 2005, vol. 1, pp. 886–893. IEEE (2005)
Davis, J., Goadrich, M.: The relationship between precision-recall and roc curves. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 233–240. ACM (2006)
Dollar, P., Belongie, S., Perona, P.: The fastest pedestrian detector in the west. In: Proceedings of BMVC, pp. 68.1–68.11 (2010). doi:10.5244/C.24.68
D’Orazio, T., Leo, M., Mosca, N., Spagnolo, P., Mazzeo, P.L.: A semi-automatic system for ground truth generation of soccer video sequences. In: Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance 2009, AVSS’09, pp. 559–564. IEEE (2009)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1627–1645 (2010)
Hare, S., Saffari, A., Torr, P.H.: Struck: structured output tracking with kernels. In: 2011 IEEE International Conference on Computer Vision (ICCV), pp. 263–270. IEEE (2011)
Hemery, B., Laurent, H., Rosenberger, C.: Comparative study of metrics for evaluation of object localisation by bounding boxes. In: Fourth International Conference on Image and Graphics 2007, ICIG 2007, pp. 459–464 (2007)
Jun, B., Choi, I., Kim, D.: Local transform features and hybridization for accurate face and human detection. IEEE Trans. Pattern Anal. Mach. Intell. 35(6), 1423–1436 (2013)
Kalal, Z., Matas, J., Mikolajczyk, K.: P-N learning: Bootstrapping binary classifiers by structural constraints. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 49–56. IEEE (2010)
Kalal, Z., Mikolajczyk, K., Matas, J.: Tracking-learning-detection. IEEE Trans. Pattern Anal. Machine Intell. 34(7), 1409–1422 (2012)
Khan, Z., Gu, I.H., Backhouse, A.: Robust visual object tracking using multi-mode anisotropic mean shift and particle filters. IEEE Trans. Circuits Syst. Video Technol. 21(1), 74–87 (2011)
Kuo, C.-H., Huang, C., Nevatia, R.: Inter-camera association of multi-target tracks by on-line learned appearance affinity models. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 383–396. Springer, Heidelberg (2010)
Mazzon, R., Cavallaro, A.: Multi-camera tracking using a multi-goal social force model. Neurocomputing 100, 41–50 (2013)
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recogn. 29(1), 51–59 (1996)
Ozuysal, M., Fua, P., Lepetit, V.: Fast keypoint recognition in ten lines of code. In: IEEE Conference on Computer Vision and Pattern Recognition 2007, CVPR’07, pp. 1–8. IEEE (2007)
Rami, H., Hamri, M., Masmoudi, L.: Article: objects tracking in images sequence using local binary pattern (LBP). Int. J. Comput. Appl. 63(20), 19–23 (2013). (Published by Foundation of Computer Science, New York, USA)
Shan, C., Gong, S., McOwan, P.W.: Facial expression recognition based on local binary patterns: a comprehensive study. Image Vis. Comput. 27(6), 803–816 (2009)
Shi, H., Lin, Z., Tang, W., Liao, B., Wang, J., Zheng, L.: A robust hand tracking approach based on modified tracking-learning-detection algorithm. In: Park, J.J.J.H., Chen, S.-C., Gil, J.-M., Yen, N.Y. (eds.) Multimedia and Ubiquitous Engineering. LNEE, vol. 308, pp. 9–15. Springer, Heidelberg (2014). http://dx.doi.org/10.1007/978-3-642-54900-7_2
Sun, N., Zheng, W., Sun, C., Zou, C., Zhao, L.: Gender classification based on boosting local binary pattern. In: Wang, J., Yi, Z., Żurada, J.M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3972, pp. 194–201. Springer, Heidelberg (2006)
Sun, S., Guo, Q., Dong, F., Lei, B.: On-line boosting based real-time tracking with efficient hog. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2297–2301 (2013)
Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 971–987 (2002)
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition 2001, CVPR 2001, vol. 1, pp. I-511. IEEE (2001)
Wu, Y., Lim, J., Yang, M.H.: Online object tracking: a benchmark. In: 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2411–2418. IEEE (2013)
Xu, F., Gao, M.: Human detection and tracking based on hog and particle filter. In: 2010 3rd International Congress on Image and Signal Processing (CISP), vol. 3, pp. 1503–1507 (2010)
Yi, S., Yao, Z., Liu, J., Chen, J., Liu, W.: Robust tracking using on-line selection of multiple features. In: 2012 Spring Congress on Engineering and Technology (S-CET), pp. 1–5 (2012)
Yilmaz, A., Javed, O., Shah, M.: Object tracking: a survey. ACM Comput. Surv. (CSUR) 38(4), 13 (2006)
Zhang, L., Chu, R.F., Xiang, S., Liao, S.C., Li, S.Z.: Face detection based on multi-block LBP representation. In: Lee, S.-W., Li, S.Z. (eds.) ICB 2007. LNCS, vol. 4642, pp. 11–18. Springer, Heidelberg (2007)
Acknowledgment
This work has been supported by the “2007–2013 NOP for Research and Competitiveness for the Convergence Regions (Calabria, Campania, Puglia and Sicilia)” with code PON04a3_00201 and in part by the PON Baitah, with code PON01_980.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Adamo, F., Carcagnì, P., Mazzeo, P.L., Distante, C., Spagnolo, P. (2014). TLD and Struck: A Feature Descriptors Comparative Study. In: Mazzeo, P., Spagnolo, P., Moeslund, T. (eds) Activity Monitoring by Multiple Distributed Sensing. AMMDS 2014. Lecture Notes in Computer Science(), vol 8703. Springer, Cham. https://doi.org/10.1007/978-3-319-13323-2_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-13323-2_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13322-5
Online ISBN: 978-3-319-13323-2
eBook Packages: Computer ScienceComputer Science (R0)