Active Structured Learning for High-Speed Object Detection

Lampert, Christoph H.; Peters, Jan

doi:10.1007/978-3-642-03798-6_23

Christoph H. Lampert¹⁸ &
Jan Peters¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5748))

Included in the following conference series:

Joint Pattern Recognition Symposium

2600 Accesses
5 Citations
3 Altmetric

Abstract

High-speed smooth and accurate visual tracking of objects in arbitrary, unstructured environments is essential for robotics and human motion analysis. However, building a system that can adapt to arbitrary objects and a wide range of lighting conditions is a challenging problem, especially if hard real-time constraints apply like in robotics scenarios. In this work, we introduce a method for learning a discriminative object tracking system based on the recent structured regression framework for object localization. Using a kernel function that allows fast evaluation on the GPU, the resulting system can process video streams at speed of 100 frames per second or more.

Consecutive frames in high speed video sequences are typically very redundant, and for training an object detection system, it is sufficient to have training labels from only a subset of all images. We propose an active learning method that select training examples in a data-driven way, thereby minimizing the required number of training labeling. Experiments on realistic data show that the active learning is superior to previously used methods for dataset subsampling for this task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hu, W., Tan, T., Wang, L., Maybank, S.: A survey on visual surveillance of object motion and behaviors. Systems, Man, and Cybernetics 34(3) (2004)
Google Scholar
Yilmaz, A., Javed, O., Shah, M.: Object tracking: A survey. ACM Computing Surveys 38(4) (2006)
Google Scholar
Kalman, R.E.: A new approach to linear filtering and prediction problems. Transaction of the ASME (1960)
Google Scholar
Tanizaki, H.: Non-gaussian state-space modeling of nonstationary time series. J. Amer. Statist. Assoc. 82 (1987)
Google Scholar
Tsatsanis, M.K., Giannakis, G.: Object detection and classification using matched filtering and higher-order statistics. In: Multidimensional Signal Processing (1989)
Google Scholar
Hager, G.D., Belhumeur, P.N.: Efficient region tracking with parametric models of geometry and illumination. IEEE Pattern Analysis and Machine Intelligence 20(10) (1998)
Google Scholar
Viola, P.A., Jones, M.J.: Robust real-time face detection. In: ICCV (2001)
Google Scholar
Grabner, H., Bischof, H.: On-line boosting and vision. In: CVPR (2006)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. IJCV 77(1) (2008)
Google Scholar
Bajramovic, F., Gräßl, C., Denzler, J.: Efficient combination of histograms for real-time tracking using mean-shift and trust-region optimization. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds.) DAGM 2005. LNCS, vol. 3663, pp. 254–261. Springer, Heidelberg (2005)
Chapter Google Scholar
Reisert, M., Burkhardt, H.: Equivariant holomorphic filters for contour denoising and rapid object detection. IEEE Image Processing 17(2) (2008)
Google Scholar
Shotton, J., Winn, J.M., Rother, C., Criminisi, A.: Textonboost: Joint appearance, shape and context modeling for multi-class object recognition and segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 1–15. Springer, Heidelberg (2006)
Chapter Google Scholar
Blaschko, M.B., Lampert, C.H.: Learning to localize objects with structured output regression. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 2–15. Springer, Heidelberg (2008)
Chapter Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. JMLR 6(2), 1453 (2006)
MathSciNet MATH Google Scholar
Joachims, T., Finley, T., Yu, C.-N.: Cutting-plane training of structural SVMs. Machine Learning (2009)
Google Scholar
Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. In: CVPR (1996)
Google Scholar
Joachims, T.: Training linear SVMs in linear time. In: KDD (2006)
Google Scholar
Szummer, M., Kohli, P., Hoiem, D.: Learning cRFs using graph cuts. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 582–595. Springer, Heidelberg (2008)
Chapter Google Scholar
Li, Y., Huttenlocher, D.P.: Learning for stereo vision using the structured support vector machine. In: CVPR (2008)
Google Scholar
Jähne, B.: Digital Image Processing. Springer, Heidelberg (2005)
MATH Google Scholar
Cohn, D.A., Ghahramani, Z., Jordan, M.I.: Active Learning with Statistical Models. Journal of Artificial Intelligence Research 4, 129–145 (1996)
MATH Google Scholar
Roth, D., Small, K.: Active learning with perceptron for structured output. In: ICML Workshop on Learning in Structured Output Spaces (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Max Planck Institute for Biological Cybernetics, Tübingen, Germany
Christoph H. Lampert & Jan Peters

Authors

Christoph H. Lampert
View author publications
You can also search for this author in PubMed Google Scholar
Jan Peters
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Digitale Bildverarbeitung, Universität Jena, Ernst-Abbe-Platz 2, 07743, Jena, Germany
Joachim Denzler & Herbert Süße &
Fraunhofer-Institut für Angewandte Optik und Feinmechanik, Albert-Einstein-Str. 7, 07745, Jena, Germany
Gunther Notni

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lampert, C.H., Peters, J. (2009). Active Structured Learning for High-Speed Object Detection. In: Denzler, J., Notni, G., Süße, H. (eds) Pattern Recognition. DAGM 2009. Lecture Notes in Computer Science, vol 5748. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03798-6_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-03798-6_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03797-9
Online ISBN: 978-3-642-03798-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics