Skip to main content
Log in

Benchmarking template-based tracking algorithms

  • SI: Augmented Reality
  • Published:
Virtual Reality Aims and scope Submit manuscript

Abstract

For natural interaction with augmented reality (AR) applications, good tracking technology is key. But unlike dense stereo, optical flow or multi-view stereo, template-based tracking which is most commonly used for AR applications lacks benchmark datasets allowing a fair comparison between state-of-the-art algorithms. Until now, in order to evaluate objectively and quantitatively the performance and the robustness of template-based tracking algorithms, mainly synthetically generated image sequences were used. The evaluation is therefore often intrinsically biased. In this paper, we describe the process we carried out to perform the acquisition of real-scene image sequences with very precise and accurate ground truth poses using an industrial camera rigidly mounted on the end effector of a high-precision robotic measurement arm. For the acquisition, we considered most of the critical parameters that influence the tracking results such as: the texture richness and the texture repeatability of the objects to be tracked, the camera motion and speed, and the changes of the object scale in the images and variations of the lighting conditions over time. We designed an evaluation scheme for object detection and interframe tracking algorithms suited for AR and other computer vision applications and used the image sequences to apply this scheme to several state-of-the-art algorithms. The image sequences are freely available for testing, submitting and evaluating new template-based tracking algorithms, i.e. algorithms that detect or track a planar object in an image sequence given only one image of the object (called the template).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Baker S, Matthews I (2004) Lucas-kanade 20 years on: a unifying framework. IJCV 56(3):221–255

    Article  Google Scholar 

  • Baker S, Scharstein D, Lewis J, Roth S, Black MJ, Szeliski R (2007) A database and evaluation methodology for optical flow. In: ICCV, pp 1–8

  • Bay H, Tuytelaars T, Gool LJV (2006) Surf: speeded up robust features. In: ECCV, pp 404–417

  • Benhimane S, Malis E (2007) Homography-based 2d visual tracking and servoing. Int J Rob Res 26(7):661–676

    Article  Google Scholar 

  • Heeger DJ (1987) Model for the extraction of image flow. J Opt Soc Am A 4(8):1455–1471

    Article  Google Scholar 

  • Klein G, Murray D (2007) Parallel tracking and mapping for small ar workspaces. In: ISMAR, pp 225–234

  • Lieberknecht S, Benhimane S, Meier P, Navab N (2009) A dataset and evaluation methodology for template-based tracking algorithms. In: ISMAR

  • Lowe DG (2004) Distinctive image features from scale-invariant keypoints. IJCV 60(2):91–110

    Article  Google Scholar 

  • Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. IJCV 60(1):63–86

    Article  Google Scholar 

  • Moreels P, Perona P (2007) Evaluation of features detectors and descriptors based on 3d objects. IJCV 73(3):263–284

    Article  Google Scholar 

  • Özuysal M, Fua P, Lepetit V (2007) Fast keypoint recognition in ten lines of code. In: CVPR, pp 1–8

  • Pentenrieder K, Meier P, Klinker G (2006) Analysis of tracking accuracy for single-camera square-marker-based tracking. In: Proceedings of Dritter Workshop Virtuelle und Erweiterte Realität der GI-Fachgruppe VR/AR

  • Price CJ, Coope ID, Byatt D (2002) A convergent variant of the nelder-mead algorithm. J Optim Theory Appl 113(1):5–19

    Article  MathSciNet  MATH  Google Scholar 

  • Seitz SM, Curless B, Diebel J, Scharstein D, Szeliski R (2006) A comparison and evaluation of multi-view stereo reconstruction algorithms. In: CVPR, pp 519–528

  • Szeliski R, Zabih R, Scharstein D, Veksler O, Kolmogorov V, Agarwala A, Tappen M, Rother C (2008) A comparative study of energy minimization methods for markov random fields with smoothness-based priors. PAMI 30(6):1068–1080

    Article  Google Scholar 

  • Tsai RY, Lenz RK (1989) A new technique for fully autonomous and efficient 3d robotics hand/eye calibration. IEEE Trans Rob Autom 5(3):345–358

    Article  Google Scholar 

  • Vedaldi A, Fulkerson B (2008) VLFeat: an open and portable library of computer vision algorithms. http://www.vlfeat.org

  • Wagner D, Reitmayr G, Mulloni A, Drummond T, Schmalstieg D (2008) Pose tracking from natural features on mobile phones. In: ISMAR, pp 125–134

  • Zimmerman K, Matas J, Svoboda T (2009) Tracking by an optimal sequence of linear predictors. PAMI 31(4):677–692

    Article  Google Scholar 

Download references

Acknowledgments

This work was partially supported by BMBF grant Avilus/01 IM08001 P.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastian Lieberknecht.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lieberknecht, S., Benhimane, S., Meier, P. et al. Benchmarking template-based tracking algorithms. Virtual Reality 15, 99–108 (2011). https://doi.org/10.1007/s10055-010-0185-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10055-010-0185-3

Keywords

Navigation