A Framework for Model-Based Tracking Experiments in Image Sequences

Dahlkamp, Hendrik; Nagel, Hans-Hellmut; Ottlik, Artur; Reuter, Paul

doi:10.1007/s11263-006-9786-4

A Framework for Model-Based Tracking Experiments in Image Sequences

Published: 04 September 2006

Volume 73, pages 139–157, (2007)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Hendrik Dahlkamp¹,
Hans-Hellmut Nagel¹,
Artur Ottlik¹ &
…
Paul Reuter¹

329 Accesses
18 Citations
Explore all metrics

Abstract

Motris, an integrated system for model-based tracking research, has been designed modularly to study the effects of algorithmic variations on tracking results. Motris attempts to avoid introducing bias into the relative assessment of alternative approaches. Such a bias may be caused by differences of implementation and parameterization if the component approaches are evaluated in separate testing environments. Tracking results are evaluated automatically on a significant test sample in order to quantify the effects of different combinations of alternatives. The Motris system environment thus allows an in-depth comparison between the so-called ‘Edge-Element Association’ approach documented in Haag and Nagel (1999) and the more recent ‘Expectation-Maximization’ approach reported by Pece and Worrall (2002).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Guided Search 6.0: An updated model of visual search

Article 05 February 2021

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Article Open access 08 October 2020

An Overview to Visual Odometry and Visual SLAM: Applications to Mobile Robotics

Article 13 November 2015

References

Badenas, J., Sanchiz, J.M., and Pla, F. 2001. Motion-based segmentation and region tracking in image sequences. Pattern Recognition, 34(3):661–670.
Article Google Scholar
Brand, M. and Kettnaker, V. 2000. Discovery and segmentation of activities in video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):844–851.
Article Google Scholar
Collins, R.T., Lipton, A.J., and Kanade, T. 2000. Introduction to the special section on video surveillance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):745–746.
Article Google Scholar
Cortelazzo, G.M. and Guerra, C. 2004. Model-based and Image-based 3D scene representation for interactive visualization. Computer Vision and Image Understanding, 96(3):269–273.
Article Google Scholar
Cremers, D. and Schnörr, C. 2003. Statistical shape knowledge in variational motion segmentation. Image and Vision Computing, 21(1):77–86.
Article Google Scholar
Cucchiara, R., Grana, C., Piccardi, M., and Prati, A. 2003. Detecting moving objects, ghosts, and shadows in video streams. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(10):1337–1342.
Article Google Scholar
Dahlkamp, H., Ottlik, A., Reuter, P., and Nagel, H.-H. 2004a. Model Based TRacking in Image Sequences—A framework for experiments. Technical report, Institut für Algorithmen und Kognitive Systeme, Fakultät für Informatik der Universität Karlsruhe (TH), 76128 Karlsruhe, Germany. See, too, Motris Homepage.
Dahlkamp, H., Pece, A.E.C., Ottlik, A., and Nagel, H.-H. 2004b. Differential analysis of two model-based vehicle tracking approaches. In Pattern Recognition, Proc. 26th DAGM-Symposium (DAGM’04), C.E. Rasmussen, H.H. Bülthoff, M.A. Giese, and B. Schölkopf (Eds.), vol. 3175 of Lecture Notes in Computer Science, pp. 71–78. Tübingen, Germany. Springer: Berlin, Heidelberg, New York, NY.
Dahlkamp, H., Ottlik, A., and Nagel, H.-H. 2006. Comparison of edge-driven algorithms for model-based motion estimation. In Proc. First International Workshop on Spatial Coherency for Visual Motion Analysis (SCVMA 2004), W.J. MacLean (Ed.), Prague, Czech Republic, vol. 3667 of Lecture Notes in Computer Science, pp. 38–50. Springer: Berlin, Heidelberg, New York, NY.
DeCarlo, D. and Metaxas, D. 1996. The integration of optical flow and deformable models with applications to human face shape and motion estimation. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 231–238. IEEE Computer Society: Los Alamitos, CA.
DeCarlo, D. and Metaxas, D. 1999. Combining information using hard constraints. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, Fort Collins, CO, pp. II:132–138. IEEE Computer Society: Los Alamitos, CA.
Duric, Z., Goldenberg, R., Rivlin, E., and Rosenfeld, A. 2002. Estimating relative vehicle motions in traffic scenes. Pattern Recognition, 35(6):1339–1353.
Article Google Scholar
Elgammal, A., Duraiswami, R., Harwood, D., and Davis, L.S. 2002. Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proceedings of the IEEE, 90(7):1151–1163.
Google Scholar
Ferryman, J.M. (Ed.) 2003. Proc. Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS–PETS), Nice, France. The University of Reading. ISBN 0-7695-2022-7.
Ferryman, J.M., Maybank, S.J., and Worrall, A.D. 1998. Visual surveillance for moving vehicles. In 1998 IEEE Workshop on Visual Surveillance, S.J. Maybank and T. Tan (Eds.), Bombay, India, pp. 73–80. IEEE Computer Society: Los Alamitos, CA.
Ferryman, J.M., Maybank, S.J., and Worrall, A.D. 2000. Visual surveillance for moving vehicles. International Journal of Computer Vision, 37(2):187–197.
Article Google Scholar
Fitzgibbon, A.W. and Zisserman, A. 2000. Multibody structure and motion: 3-D reconstruction of independently moving objects. In Vernon (2000), pp. II:891–906.
Fua, P. 2000. Regularized bundle-adjustment to model heads from image sequences without calibration data. International Journal of Computer Vision, 38(2):153–171.
Article Google Scholar
Gavrila, D.M. 1999. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73(1):82–98.
Article Google Scholar
Haag, M. 1998. Bildfolgenauswertung zur Erkennung der Absichten von Straßenverkehrsteilnehmern, vol. 193 of Dissertationen zur Künstlichen Intelligenz (DISKI). infix–Verlag, St. Augustin/ Germany, Dissertation, Fakultät für Informatik der Universität Karlsruhe (TH), Juli 1998 (in German).
Haag, M. and Nagel, H.-H. 1999. Combination of edge element and optical flow estimates for 3D-model-based vehicle tracking in traffic image sequences. International Journal of Computer Vision, 35(3):295–319.
Article Google Scholar
Horn, B.K.P. 1986. Robot Vision. The MIT Press: Cambridge, MA, London, UK.
Google Scholar
IAKS Image Sequences: http://kogs.iaks.uni-karls-ruhe.de/image_sequences/.
ICCV-2001: Eighth International Conference on Computer Vision, volume I + II, Vancouver, BC, Canada. IEEE Computer Society: Los Alamitos, CA.
Jang, D.-S. and Choi, H.-I. 2000. Active models for tracking moving objects. Pattern Recognition, 33(7):1135–1146.
Article Google Scholar
Jurie, F. and Dhome, M. 2002. Real time tracking of 3D objects: An efficient and robust approach. Pattern Recognition, 35(2):317–328.
Article Google Scholar
Kakadiaris, I. and Metaxas, D. 2000. Model-based estimation of 3D human motion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1453–1459.
Article Google Scholar
Kang, D.J., Ha, J.E., and Kweon, I.S. 2003. Fast object recognition using dynamic programming from combination of salient line groups. Pattern Recognition, 36(1):79–90.
Article Google Scholar
Kang, H.-G. and Kim, D. 2005. Real-time multiple people tracking using competitive condensation. Pattern Recognition, 38(7):1045–1058.
Article MathSciNet Google Scholar
Kastrinaki, V., Zervakis, M., and Kalaitzakis, K. 2003. A survey of video processing techniques for traffic applications. Image and Vision Computing, 21(4):359–381.
Article Google Scholar
Kato, J., Watanabe, T., Joga, S., Rittscher, J., and Blake, A. 2002. An HMM-based segmentation method for traffic monitoring movies. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9):1291–1296.
Article Google Scholar
Kim, Z. and Malik, J. 2003. Fast vehicle detection with probabilistic feature grouping and its application to vehicle tracking. In Proc. Ninth International Conference on Computer Vision, vol. I+II, Nice, France, pp. I:524–531. IEEE Computer Society: Los Alamitos, CA.
Kollnig, H. and Nagel, H.-H. 1995. 3D pose estimation by fitting image gradients directly to polyhedral models. In Proc. Fifth International Conference on Computer Vision, Cambridge, MA, pp. 569–574, IEEE Computer Society: Los Alamitos, CA.
Kollnig, H. and Nagel, H.-H. 1996. Matching object models to segments from an optical flow field. In Proc. 4th European Conference on Computer Vision, B. Buxton and R. Cipolla (Eds.), vol. 1064–1065 of Lecture Notes in Computer Science (LNCS), pp. II:388–399, Cambridge, UK. Springer-Verlag: Berlin, Heidelberg, New York, NY.
Kumar, P., Ranganath, S., Sengupta, K., and Weimin, H. 2004. Co-operative multi-target tracking and classification. In Pajdla and Matas (2004), pp. I:376–389.
Lee, L., Romano, R., and Stein, G. 2000. Monitoring activities from multiple video streams: Establishing a common coordinate frame. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):758–767.
Article Google Scholar
Lerdsudwichai, C., Abdel-Mottaleb, M., and Ansari, A.-N. 2005. Tracking multiple people with recovery from partial and total occlusion. Pattern Recognition, 38(7):1059–1070.
Article Google Scholar
Liu, Q., Lou, J., Hu, W., and Tan, T. 2003. Model based pose determination using bayes classification error. In Ferryman (2003), pp. 38–45, ISBN 0-7695-2022-7.
Magee, D.R. 2004. Tracking multiple vehicles using foreground, background and motion models. Image and Vision Computing, 22(2):143–155.
Article MathSciNet Google Scholar
Mansuri, A.-R. 2002. Region tracking via level set PDEs without motion computation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(7):947–961.
Article Google Scholar
Middendorf, M. 2004. Zur Auswertung lokaler Grauwertstrukturen. (in German, ISBN 3-8334-1175-9).
Middendorf, M. and Nagel, H.-H. 2001. Estimation and interpretation of discontinuities in optical flow fields. In ICCV-2001, pp. I:178–183.
Moeslund, T.B. and Granum, E. 2001. A survey of computer vision-based human motion capture. Computer Vision and Image Understanding, 81(3):231–268.
Article Google Scholar
Motamed, C. and Wallart, O. 2003. A cooperative distributed vision algorithm for wide area vehicle tracking. In Ferryman (2003), pp. 86–93, ISBN 0-7695-2022-7.
Motris Homepage: http://i21www.iaks.uni-karls-ruhe.de/motris/.
Nagel, H.-H. 2004. Steps toward a cognitive vision system. AI-Magazine, 25(2):31–50.
MathSciNet Google Scholar
Ning, H., Tan, T., Wang, L., and Hu, W. 2004. People tracking based on motion model and motion constraints with automatic initialization. Pattern Recognition, 37(7):1423–1440.
Article Google Scholar
Nummiaro, K., Koller-Meier, E., and Van Gool, L. 2003. An adaptive color-based particle filter. Image and Vision Computing, 21(1):99–110.
Article Google Scholar
Pai, C.-H., Tyan, H.-R., Liang, Y.-M., Liao, H.-Y.M., and Chen, S.-W. 2004. Pedestrian detection and tracking at crossroads. Pattern Recognition, 37(5):1025–1034.
Article Google Scholar
Pajdla, T. and Matas, J. (Eds.) 2004. Proc. 8th European Conference on Computer Vision (ECCV 2004, Parts I–IV), vol. 3021–3024 of Lecture Notes in Computer Science (LNCS), Prague, Czech Republic. Springer-Verlag: Berlin, Heidelberg, New York, NY.
Pece, A.E.C. 2003a. Generative model based vision. Image and Vision Computing, 21(1):1–3.
Article Google Scholar
Pece, A.E.C. 2003b. The Kalman-EM contour tracker. In Proceedings of the 3rd Workshop on Statistical and Computational Theories of Vision (SCTV 2003), Nice, France.
Pece, A.E.C. and Worrall, A.D. 2002. Tracking with the EM contour algorithm. In Proc. 7th European Conference on Computer Vision (ECCV 2002, Parts I–IV), A. Heyden, G. Sparr, M. Nielsen, and P. Johansen (Eds.), vol. 2350–2353 of Lecture Notes in Computer Science (LNCS), pp. I:3–17, Copenhagen, Denmark. Springer-Verlag: Berlin, Heidelberg, New York, NY.
Pei, S.-C. and Liou, L.-G. 1998. Vehicle-type motion estimation by the fusion of image point and line features. Pattern Recognition, 31(3):333–344.
Article Google Scholar
Polat, E., Yeasin, M., and Sharma, R. 2003. A 2D/3D model-based object tracking framework. Pattern Recognition, 36(9):2127–2141.
Article Google Scholar
Prati, A., Mikic, I., Trivedi, M.M., and Cucchiara, R. 2003. Detecting moving shadows: Algorithms and evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7):918–923.
Article Google Scholar
Remagnino, P., Shihab, A.I., and Jones, G.A. 2004. Distributed intelligence for multi-camera visual surveillance. Pattern Recognition, 37(4):675–689.
Article Google Scholar
Ricquebourg, Y. and Bouthemy, P. 2000. Real-time tracking of moving persons by exploiting spatio-temporal image slices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):797–808.
Article Google Scholar
Shashua, A. and Levin, A. 2001. Multi-frame infinitesimal motion model for the reconstruction of (dynamic) scenes with multiple linearly moving objects. In ICCV-2001, pp. II:592–599.
Sidenbladh, H., Black, M.J., and Fleet, D.J. 2000. Stochastic tracking of 3D human figures using 2D image motion. In Vernon (2000), pp. II:702–718.
Sivic, J., Schaffalitzky, F., and Zisserman, A. 2004. Object level grouping for video shots. In Pajdla and Matas (2004), pp. II:85–98.
Sminchisescu, C., Metaxas, D., and Dickinson, S. 2005. Incremental model-based estimation using geometric constraints. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(5):727–738.
Article Google Scholar
Stauffer, C. and Grimson, W.E.L. 2000. Learning patterns of activity using real-time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):747–757.
Article Google Scholar
Tan, T.N., Sullivan, G.D., and Baker, K.D. 1998. Model-based localisation and recognition of road vehicles. International Journal of Computer Vision, 27(1):5–25.
Article Google Scholar
Tissainayagam, P. and Suter, D. 2005. Object tracking in image sequences using point features. Pattern Recognition, 38(1):105–113.
Article Google Scholar
Tsap, L.V., Goldgof, D.B., and Sarkar, S. 2000. Nonrigid motion analysis based on dynamic refinement of finite element models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(5):526–543.
Article Google Scholar
Vacchetti, L., Lepetit, V., and Fua, P. 2004. Stable real-time 3D tracking using online and offline information. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(10):1385–1391.
Article Google Scholar
Veenman, C.J., Reinders, M.J.T., and Backer, E. 2003. Motion tracking as a constrained optimization problem. Pattern Recognition, 36(9):2049–2067.
Article Google Scholar
Vernon, D. (Ed.) 2000. Proc. 6th European Conference on Computer Vision (ECCV 2000, Parts I–II), vol. 1842–1843 of Lecture Notes in Computer Science (LNCS), Dublin, Ireland. Springer-Verlag: Berlin, Heidelberg, New York, NY.
Wang, L., Hu, W., and Tan, T. 2003. Recent developments in human motion analysis. Pattern Recognition, 36(3):585–601.
Article Google Scholar
Westphal, H. and Nagel, H.-H. 1986. Towards the derivation of 3-D descriptions from image sequences for non-convex moving objects. Computer Vision, Graphics, and Image Processing, 34(3):302–320.
Article Google Scholar
Xiao, J. and Shah, M. 2004. Tri-view morphing. Computer Vision and Image Understanding, 96(3):294–326.
Article Google Scholar
Yam, C.Y., Nixon, M.S., and Carter, J.N. 2004. Automated person recognition by walking and running via model-based approaches. Pattern Recognition, 37(5):1057–1072.
Article Google Scholar
Zhang, Y., Rosenhahn, B., and Sommer, G. 2000. Extended Kalman filter design for motion estimation by point and line observations. In Proc. 2nd International Workshop on Algebraic Frames for the Perception-Action Cycle, G. Sommer and Y.Y. Zeevi (Eds.), vol. 1888 of Lecture Notes in Computer Science, Kiel, Germany. pp. 339–348, Springer-Verlag: Berlin, Heidelberg, New York, NY.
Zhu, Z. and Hanson, A.R. 2004. LAMP: 3D layered, adaptive-resolution, and multi-perspective panorama—A new scene representation. Computer Vision and Image Understanding, 96(3):294–326.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Algorithmen und Kognitive Systeme, Universität Karlsruhe (TH), Postfach 6980, 76128, Karlsruhe, Germany
Hendrik Dahlkamp, Hans-Hellmut Nagel, Artur Ottlik & Paul Reuter

Authors

Hendrik Dahlkamp
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Hellmut Nagel
View author publications
You can also search for this author in PubMed Google Scholar
Artur Ottlik
View author publications
You can also search for this author in PubMed Google Scholar
Paul Reuter
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hans-Hellmut Nagel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dahlkamp, H., Nagel, HH., Ottlik, A. et al. A Framework for Model-Based Tracking Experiments in Image Sequences. Int J Comput Vision 73, 139–157 (2007). https://doi.org/10.1007/s11263-006-9786-4

Download citation

Received: 19 May 2006
Revised: 06 June 2006
Accepted: 06 June 2006
Published: 04 September 2006
Issue Date: June 2007
DOI: https://doi.org/10.1007/s11263-006-9786-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Framework for Model-Based Tracking Experiments in Image Sequences

Abstract

Access this article

Similar content being viewed by others

Guided Search 6.0: An updated model of visual search

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

An Overview to Visual Odometry and Visual SLAM: Applications to Mobile Robotics

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Framework for Model-Based Tracking Experiments in Image Sequences

Abstract

Access this article

Similar content being viewed by others

Guided Search 6.0: An updated model of visual search

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

An Overview to Visual Odometry and Visual SLAM: Applications to Mobile Robotics

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation