Skip to main content
Log in

A Framework for Model-Based Tracking Experiments in Image Sequences

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

Motris, an integrated system for model-based tracking research, has been designed modularly to study the effects of algorithmic variations on tracking results. Motris attempts to avoid introducing bias into the relative assessment of alternative approaches. Such a bias may be caused by differences of implementation and parameterization if the component approaches are evaluated in separate testing environments. Tracking results are evaluated automatically on a significant test sample in order to quantify the effects of different combinations of alternatives. The Motris system environment thus allows an in-depth comparison between the so-called ‘Edge-Element Association’ approach documented in Haag and Nagel (1999) and the more recent ‘Expectation-Maximization’ approach reported by Pece and Worrall (2002).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Badenas, J., Sanchiz, J.M., and Pla, F. 2001. Motion-based segmentation and region tracking in image sequences. Pattern Recognition, 34(3):661–670.

    Article  Google Scholar 

  • Brand, M. and Kettnaker, V. 2000. Discovery and segmentation of activities in video. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):844–851.

    Article  Google Scholar 

  • Collins, R.T., Lipton, A.J., and Kanade, T. 2000. Introduction to the special section on video surveillance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):745–746.

    Article  Google Scholar 

  • Cortelazzo, G.M. and Guerra, C. 2004. Model-based and Image-based 3D scene representation for interactive visualization. Computer Vision and Image Understanding, 96(3):269–273.

    Article  Google Scholar 

  • Cremers, D. and Schnörr, C. 2003. Statistical shape knowledge in variational motion segmentation. Image and Vision Computing, 21(1):77–86.

    Article  Google Scholar 

  • Cucchiara, R., Grana, C., Piccardi, M., and Prati, A. 2003. Detecting moving objects, ghosts, and shadows in video streams. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(10):1337–1342.

    Article  Google Scholar 

  • Dahlkamp, H., Ottlik, A., Reuter, P., and Nagel, H.-H. 2004a. Model Based TRacking in Image Sequences—A framework for experiments. Technical report, Institut für Algorithmen und Kognitive Systeme, Fakultät für Informatik der Universität Karlsruhe (TH), 76128 Karlsruhe, Germany. See, too, Motris Homepage.

  • Dahlkamp, H., Pece, A.E.C., Ottlik, A., and Nagel, H.-H. 2004b. Differential analysis of two model-based vehicle tracking approaches. In Pattern Recognition, Proc. 26th DAGM-Symposium (DAGM’04), C.E. Rasmussen, H.H. Bülthoff, M.A. Giese, and B. Schölkopf (Eds.), vol. 3175 of Lecture Notes in Computer Science, pp. 71–78. Tübingen, Germany. Springer: Berlin, Heidelberg, New York, NY.

  • Dahlkamp, H., Ottlik, A., and Nagel, H.-H. 2006. Comparison of edge-driven algorithms for model-based motion estimation. In Proc. First International Workshop on Spatial Coherency for Visual Motion Analysis (SCVMA 2004), W.J. MacLean (Ed.), Prague, Czech Republic, vol. 3667 of Lecture Notes in Computer Science, pp. 38–50. Springer: Berlin, Heidelberg, New York, NY.

  • DeCarlo, D. and Metaxas, D. 1996. The integration of optical flow and deformable models with applications to human face shape and motion estimation. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, San Francisco, CA, pp. 231–238. IEEE Computer Society: Los Alamitos, CA.

  • DeCarlo, D. and Metaxas, D. 1999. Combining information using hard constraints. In Proc. IEEE Conference on Computer Vision and Pattern Recognition, Fort Collins, CO, pp. II:132–138. IEEE Computer Society: Los Alamitos, CA.

  • Duric, Z., Goldenberg, R., Rivlin, E., and Rosenfeld, A. 2002. Estimating relative vehicle motions in traffic scenes. Pattern Recognition, 35(6):1339–1353.

    Article  Google Scholar 

  • Elgammal, A., Duraiswami, R., Harwood, D., and Davis, L.S. 2002. Background and foreground modeling using nonparametric kernel density estimation for visual surveillance. Proceedings of the IEEE, 90(7):1151–1163.

    Google Scholar 

  • Ferryman, J.M. (Ed.) 2003. Proc. Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (VS–PETS), Nice, France. The University of Reading. ISBN 0-7695-2022-7.

  • Ferryman, J.M., Maybank, S.J., and Worrall, A.D. 1998. Visual surveillance for moving vehicles. In 1998 IEEE Workshop on Visual Surveillance, S.J. Maybank and T. Tan (Eds.), Bombay, India, pp. 73–80. IEEE Computer Society: Los Alamitos, CA.

  • Ferryman, J.M., Maybank, S.J., and Worrall, A.D. 2000. Visual surveillance for moving vehicles. International Journal of Computer Vision, 37(2):187–197.

    Article  Google Scholar 

  • Fitzgibbon, A.W. and Zisserman, A. 2000. Multibody structure and motion: 3-D reconstruction of independently moving objects. In Vernon (2000), pp. II:891–906.

  • Fua, P. 2000. Regularized bundle-adjustment to model heads from image sequences without calibration data. International Journal of Computer Vision, 38(2):153–171.

    Article  Google Scholar 

  • Gavrila, D.M. 1999. The visual analysis of human movement: A survey. Computer Vision and Image Understanding, 73(1):82–98.

    Article  Google Scholar 

  • Haag, M. 1998. Bildfolgenauswertung zur Erkennung der Absichten von Straßenverkehrsteilnehmern, vol. 193 of Dissertationen zur Künstlichen Intelligenz (DISKI). infix–Verlag, St. Augustin/ Germany, Dissertation, Fakultät für Informatik der Universität Karlsruhe (TH), Juli 1998 (in German).

  • Haag, M. and Nagel, H.-H. 1999. Combination of edge element and optical flow estimates for 3D-model-based vehicle tracking in traffic image sequences. International Journal of Computer Vision, 35(3):295–319.

    Article  Google Scholar 

  • Horn, B.K.P. 1986. Robot Vision. The MIT Press: Cambridge, MA, London, UK.

    Google Scholar 

  • IAKS Image Sequences: http://kogs.iaks.uni-karls-ruhe.de/image_sequences/.

  • ICCV-2001: Eighth International Conference on Computer Vision, volume I + II, Vancouver, BC, Canada. IEEE Computer Society: Los Alamitos, CA.

  • Jang, D.-S. and Choi, H.-I. 2000. Active models for tracking moving objects. Pattern Recognition, 33(7):1135–1146.

    Article  Google Scholar 

  • Jurie, F. and Dhome, M. 2002. Real time tracking of 3D objects: An efficient and robust approach. Pattern Recognition, 35(2):317–328.

    Article  Google Scholar 

  • Kakadiaris, I. and Metaxas, D. 2000. Model-based estimation of 3D human motion. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1453–1459.

    Article  Google Scholar 

  • Kang, D.J., Ha, J.E., and Kweon, I.S. 2003. Fast object recognition using dynamic programming from combination of salient line groups. Pattern Recognition, 36(1):79–90.

    Article  Google Scholar 

  • Kang, H.-G. and Kim, D. 2005. Real-time multiple people tracking using competitive condensation. Pattern Recognition, 38(7):1045–1058.

    Article  MathSciNet  Google Scholar 

  • Kastrinaki, V., Zervakis, M., and Kalaitzakis, K. 2003. A survey of video processing techniques for traffic applications. Image and Vision Computing, 21(4):359–381.

    Article  Google Scholar 

  • Kato, J., Watanabe, T., Joga, S., Rittscher, J., and Blake, A. 2002. An HMM-based segmentation method for traffic monitoring movies. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(9):1291–1296.

    Article  Google Scholar 

  • Kim, Z. and Malik, J. 2003. Fast vehicle detection with probabilistic feature grouping and its application to vehicle tracking. In Proc. Ninth International Conference on Computer Vision, vol. I+II, Nice, France, pp. I:524–531. IEEE Computer Society: Los Alamitos, CA.

  • Kollnig, H. and Nagel, H.-H. 1995. 3D pose estimation by fitting image gradients directly to polyhedral models. In Proc. Fifth International Conference on Computer Vision, Cambridge, MA, pp. 569–574, IEEE Computer Society: Los Alamitos, CA.

  • Kollnig, H. and Nagel, H.-H. 1996. Matching object models to segments from an optical flow field. In Proc. 4th European Conference on Computer Vision, B. Buxton and R. Cipolla (Eds.), vol. 1064–1065 of Lecture Notes in Computer Science (LNCS), pp. II:388–399, Cambridge, UK. Springer-Verlag: Berlin, Heidelberg, New York, NY.

  • Kumar, P., Ranganath, S., Sengupta, K., and Weimin, H. 2004. Co-operative multi-target tracking and classification. In Pajdla and Matas (2004), pp. I:376–389.

  • Lee, L., Romano, R., and Stein, G. 2000. Monitoring activities from multiple video streams: Establishing a common coordinate frame. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):758–767.

    Article  Google Scholar 

  • Lerdsudwichai, C., Abdel-Mottaleb, M., and Ansari, A.-N. 2005. Tracking multiple people with recovery from partial and total occlusion. Pattern Recognition, 38(7):1059–1070.

    Article  Google Scholar 

  • Liu, Q., Lou, J., Hu, W., and Tan, T. 2003. Model based pose determination using bayes classification error. In Ferryman (2003), pp. 38–45, ISBN 0-7695-2022-7.

  • Magee, D.R. 2004. Tracking multiple vehicles using foreground, background and motion models. Image and Vision Computing, 22(2):143–155.

    Article  MathSciNet  Google Scholar 

  • Mansuri, A.-R. 2002. Region tracking via level set PDEs without motion computation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(7):947–961.

    Article  Google Scholar 

  • Middendorf, M. 2004. Zur Auswertung lokaler Grauwertstrukturen. (in German, ISBN 3-8334-1175-9).

  • Middendorf, M. and Nagel, H.-H. 2001. Estimation and interpretation of discontinuities in optical flow fields. In ICCV-2001, pp. I:178–183.

  • Moeslund, T.B. and Granum, E. 2001. A survey of computer vision-based human motion capture. Computer Vision and Image Understanding, 81(3):231–268.

    Article  Google Scholar 

  • Motamed, C. and Wallart, O. 2003. A cooperative distributed vision algorithm for wide area vehicle tracking. In Ferryman (2003), pp. 86–93, ISBN 0-7695-2022-7.

  • Motris Homepage: http://i21www.iaks.uni-karls-ruhe.de/motris/.

  • Nagel, H.-H. 2004. Steps toward a cognitive vision system. AI-Magazine, 25(2):31–50.

    MathSciNet  Google Scholar 

  • Ning, H., Tan, T., Wang, L., and Hu, W. 2004. People tracking based on motion model and motion constraints with automatic initialization. Pattern Recognition, 37(7):1423–1440.

    Article  Google Scholar 

  • Nummiaro, K., Koller-Meier, E., and Van Gool, L. 2003. An adaptive color-based particle filter. Image and Vision Computing, 21(1):99–110.

    Article  Google Scholar 

  • Pai, C.-H., Tyan, H.-R., Liang, Y.-M., Liao, H.-Y.M., and Chen, S.-W. 2004. Pedestrian detection and tracking at crossroads. Pattern Recognition, 37(5):1025–1034.

    Article  Google Scholar 

  • Pajdla, T. and Matas, J. (Eds.) 2004. Proc. 8th European Conference on Computer Vision (ECCV 2004, Parts I–IV), vol. 3021–3024 of Lecture Notes in Computer Science (LNCS), Prague, Czech Republic. Springer-Verlag: Berlin, Heidelberg, New York, NY.

  • Pece, A.E.C. 2003a. Generative model based vision. Image and Vision Computing, 21(1):1–3.

    Article  Google Scholar 

  • Pece, A.E.C. 2003b. The Kalman-EM contour tracker. In Proceedings of the 3rd Workshop on Statistical and Computational Theories of Vision (SCTV 2003), Nice, France.

  • Pece, A.E.C. and Worrall, A.D. 2002. Tracking with the EM contour algorithm. In Proc. 7th European Conference on Computer Vision (ECCV 2002, Parts I–IV), A. Heyden, G. Sparr, M. Nielsen, and P. Johansen (Eds.), vol. 2350–2353 of Lecture Notes in Computer Science (LNCS), pp. I:3–17, Copenhagen, Denmark. Springer-Verlag: Berlin, Heidelberg, New York, NY.

  • Pei, S.-C. and Liou, L.-G. 1998. Vehicle-type motion estimation by the fusion of image point and line features. Pattern Recognition, 31(3):333–344.

    Article  Google Scholar 

  • Polat, E., Yeasin, M., and Sharma, R. 2003. A 2D/3D model-based object tracking framework. Pattern Recognition, 36(9):2127–2141.

    Article  Google Scholar 

  • Prati, A., Mikic, I., Trivedi, M.M., and Cucchiara, R. 2003. Detecting moving shadows: Algorithms and evaluation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(7):918–923.

    Article  Google Scholar 

  • Remagnino, P., Shihab, A.I., and Jones, G.A. 2004. Distributed intelligence for multi-camera visual surveillance. Pattern Recognition, 37(4):675–689.

    Article  Google Scholar 

  • Ricquebourg, Y. and Bouthemy, P. 2000. Real-time tracking of moving persons by exploiting spatio-temporal image slices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):797–808.

    Article  Google Scholar 

  • Shashua, A. and Levin, A. 2001. Multi-frame infinitesimal motion model for the reconstruction of (dynamic) scenes with multiple linearly moving objects. In ICCV-2001, pp. II:592–599.

  • Sidenbladh, H., Black, M.J., and Fleet, D.J. 2000. Stochastic tracking of 3D human figures using 2D image motion. In Vernon (2000), pp. II:702–718.

  • Sivic, J., Schaffalitzky, F., and Zisserman, A. 2004. Object level grouping for video shots. In Pajdla and Matas (2004), pp. II:85–98.

  • Sminchisescu, C., Metaxas, D., and Dickinson, S. 2005. Incremental model-based estimation using geometric constraints. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(5):727–738.

    Article  Google Scholar 

  • Stauffer, C. and Grimson, W.E.L. 2000. Learning patterns of activity using real-time tracking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(8):747–757.

    Article  Google Scholar 

  • Tan, T.N., Sullivan, G.D., and Baker, K.D. 1998. Model-based localisation and recognition of road vehicles. International Journal of Computer Vision, 27(1):5–25.

    Article  Google Scholar 

  • Tissainayagam, P. and Suter, D. 2005. Object tracking in image sequences using point features. Pattern Recognition, 38(1):105–113.

    Article  Google Scholar 

  • Tsap, L.V., Goldgof, D.B., and Sarkar, S. 2000. Nonrigid motion analysis based on dynamic refinement of finite element models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(5):526–543.

    Article  Google Scholar 

  • Vacchetti, L., Lepetit, V., and Fua, P. 2004. Stable real-time 3D tracking using online and offline information. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(10):1385–1391.

    Article  Google Scholar 

  • Veenman, C.J., Reinders, M.J.T., and Backer, E. 2003. Motion tracking as a constrained optimization problem. Pattern Recognition, 36(9):2049–2067.

    Article  Google Scholar 

  • Vernon, D. (Ed.) 2000. Proc. 6th European Conference on Computer Vision (ECCV 2000, Parts I–II), vol. 1842–1843 of Lecture Notes in Computer Science (LNCS), Dublin, Ireland. Springer-Verlag: Berlin, Heidelberg, New York, NY.

  • Wang, L., Hu, W., and Tan, T. 2003. Recent developments in human motion analysis. Pattern Recognition, 36(3):585–601.

    Article  Google Scholar 

  • Westphal, H. and Nagel, H.-H. 1986. Towards the derivation of 3-D descriptions from image sequences for non-convex moving objects. Computer Vision, Graphics, and Image Processing, 34(3):302–320.

    Article  Google Scholar 

  • Xiao, J. and Shah, M. 2004. Tri-view morphing. Computer Vision and Image Understanding, 96(3):294–326.

    Article  Google Scholar 

  • Yam, C.Y., Nixon, M.S., and Carter, J.N. 2004. Automated person recognition by walking and running via model-based approaches. Pattern Recognition, 37(5):1057–1072.

    Article  Google Scholar 

  • Zhang, Y., Rosenhahn, B., and Sommer, G. 2000. Extended Kalman filter design for motion estimation by point and line observations. In Proc. 2nd International Workshop on Algebraic Frames for the Perception-Action Cycle, G. Sommer and Y.Y. Zeevi (Eds.), vol. 1888 of Lecture Notes in Computer Science, Kiel, Germany. pp. 339–348, Springer-Verlag: Berlin, Heidelberg, New York, NY.

  • Zhu, Z. and Hanson, A.R. 2004. LAMP: 3D layered, adaptive-resolution, and multi-perspective panorama—A new scene representation. Computer Vision and Image Understanding, 96(3):294–326.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hans-Hellmut Nagel.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dahlkamp, H., Nagel, HH., Ottlik, A. et al. A Framework for Model-Based Tracking Experiments in Image Sequences. Int J Comput Vision 73, 139–157 (2007). https://doi.org/10.1007/s11263-006-9786-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11263-006-9786-4

Keywords

Navigation