Abstract
For natural interaction with augmented reality (AR) applications, good tracking technology is key. But unlike dense stereo, optical flow or multi-view stereo, template-based tracking which is most commonly used for AR applications lacks benchmark datasets allowing a fair comparison between state-of-the-art algorithms. Until now, in order to evaluate objectively and quantitatively the performance and the robustness of template-based tracking algorithms, mainly synthetically generated image sequences were used. The evaluation is therefore often intrinsically biased. In this paper, we describe the process we carried out to perform the acquisition of real-scene image sequences with very precise and accurate ground truth poses using an industrial camera rigidly mounted on the end effector of a high-precision robotic measurement arm. For the acquisition, we considered most of the critical parameters that influence the tracking results such as: the texture richness and the texture repeatability of the objects to be tracked, the camera motion and speed, and the changes of the object scale in the images and variations of the lighting conditions over time. We designed an evaluation scheme for object detection and interframe tracking algorithms suited for AR and other computer vision applications and used the image sequences to apply this scheme to several state-of-the-art algorithms. The image sequences are freely available for testing, submitting and evaluating new template-based tracking algorithms, i.e. algorithms that detect or track a planar object in an image sequence given only one image of the object (called the template).
Similar content being viewed by others
References
Baker S, Matthews I (2004) Lucas-kanade 20 years on: a unifying framework. IJCV 56(3):221–255
Baker S, Scharstein D, Lewis J, Roth S, Black MJ, Szeliski R (2007) A database and evaluation methodology for optical flow. In: ICCV, pp 1–8
Bay H, Tuytelaars T, Gool LJV (2006) Surf: speeded up robust features. In: ECCV, pp 404–417
Benhimane S, Malis E (2007) Homography-based 2d visual tracking and servoing. Int J Rob Res 26(7):661–676
Heeger DJ (1987) Model for the extraction of image flow. J Opt Soc Am A 4(8):1455–1471
Klein G, Murray D (2007) Parallel tracking and mapping for small ar workspaces. In: ISMAR, pp 225–234
Lieberknecht S, Benhimane S, Meier P, Navab N (2009) A dataset and evaluation methodology for template-based tracking algorithms. In: ISMAR
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. IJCV 60(2):91–110
Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. IJCV 60(1):63–86
Moreels P, Perona P (2007) Evaluation of features detectors and descriptors based on 3d objects. IJCV 73(3):263–284
Özuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: CVPR, pp 1–8
Pentenrieder K, Meier P, Klinker G (2006) Analysis of tracking accuracy for single-camera square-marker-based tracking. In: Proceedings of Dritter Workshop Virtuelle und Erweiterte Realität der GI-Fachgruppe VR/AR
Price CJ, Coope ID, Byatt D (2002) A convergent variant of the nelder-mead algorithm. J Optim Theory Appl 113(1):5–19
Seitz SM, Curless B, Diebel J, Scharstein D, Szeliski R (2006) A comparison and evaluation of multi-view stereo reconstruction algorithms. In: CVPR, pp 519–528
Szeliski R, Zabih R, Scharstein D, Veksler O, Kolmogorov V, Agarwala A, Tappen M, Rother C (2008) A comparative study of energy minimization methods for markov random fields with smoothness-based priors. PAMI 30(6):1068–1080
Tsai RY, Lenz RK (1989) A new technique for fully autonomous and efficient 3d robotics hand/eye calibration. IEEE Trans Rob Autom 5(3):345–358
Vedaldi A, Fulkerson B (2008) VLFeat: an open and portable library of computer vision algorithms. http://www.vlfeat.org
Wagner D, Reitmayr G, Mulloni A, Drummond T, Schmalstieg D (2008) Pose tracking from natural features on mobile phones. In: ISMAR, pp 125–134
Zimmerman K, Matas J, Svoboda T (2009) Tracking by an optimal sequence of linear predictors. PAMI 31(4):677–692
Acknowledgments
This work was partially supported by BMBF grant Avilus/01 IM08001 P.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Lieberknecht, S., Benhimane, S., Meier, P. et al. Benchmarking template-based tracking algorithms. Virtual Reality 15, 99–108 (2011). https://doi.org/10.1007/s10055-010-0185-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10055-010-0185-3