The 2005 PASCAL Visual Object Classes Challenge

Everingham, Mark; Zisserman, Andrew; Williams, Christopher K. I.; Van Gool, Luc; Allan, Moray; Bishop, Christopher M.; Chapelle, Olivier; Dalal, Navneet; Deselaers, Thomas; Dorkó, Gyuri; Duffner, Stefan; Eichhorn, Jan; Farquhar, Jason D. R.; Fritz, Mario; Garcia, Christophe; Griffiths, Tom; Jurie, Frederic; Keysers, Daniel; Koskela, Markus; Laaksonen, Jorma; Larlus, Diane; Leibe, Bastian; Meng, Hongying; Ney, Hermann; Schiele, Bernt; Schmid, Cordelia; Seemann, Edgar; Shawe-Taylor, John; Storkey, Amos; Szedmak, Sandor; Triggs, Bill; Ulusoy, Ilkay; Viitaniemi, Ville; Zhang, Jianguo

doi:10.1007/11736790_8

Mark Everingham²²,
Andrew Zisserman²²,
Christopher K. I. Williams²³,
Luc Van Gool²⁴,
Moray Allan²³,
Christopher M. Bishop³¹,
Olivier Chapelle³²,
Navneet Dalal²⁹,
Thomas Deselaers²⁵,
Gyuri Dorkó²⁹,
Stefan Duffner²⁷,
Jan Eichhorn³²,
Jason D. R. Farquhar³³,
Mario Fritz²⁶,
Christophe Garcia²⁷,
Tom Griffiths²³,
Frederic Jurie²⁹,
Daniel Keysers²⁵,
Markus Koskela²⁸,
Jorma Laaksonen²⁸,
Diane Larlus²⁹,
Bastian Leibe²⁶,
Hongying Meng³³,
Hermann Ney²⁵,
Bernt Schiele²⁶,
Cordelia Schmid²⁹,
Edgar Seemann²⁶,
John Shawe-Taylor³³,
Amos Storkey²³,
Sandor Szedmak³³,
Bill Triggs²⁹,
Ilkay Ulusoy³⁰,
Ville Viitaniemi²⁸ &
…
Jianguo Zhang²⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3944))

Included in the following conference series:

Machine Learning Challenges Workshop

2810 Accesses

Abstract

The PASCAL Visual Object Classes Challenge ran from February to March 2005. The goal of the challenge was to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects). Four object classes were selected: motorbikes, bicycles, cars and people. Twelve teams entered the challenge. In this chapter we provide details of the datasets, algorithms used by the teams, evaluation criteria, and results achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Visual Object Class Recognition

Detecting Twenty-Thousand Classes Using Image-Level Supervision

Enhancing Object Detection Capabilities: A Comprehensive Exploration and Fine-Tuning of YOLOv5 Algorithm Across Diverse Datasets

References

Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(11), 1475–1490 (2004)
Article Google Scholar
Barker, M., Rayens, W.: Partial least squares for discrimination. Journal of Chemometrics 17, 166–173 (2003)
Article Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., Freitas, N., Blei, D., Jordan, M.I.: Matching words and pictures. Journal of Machine Learning Research 3, 1107–1135 (2003)
MATH Google Scholar
Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1995)
MATH Google Scholar
Borenstein, E., Ullman, S.: Class-specific, top-down segmentation. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2351, pp. 109–122. Springer, Heidelberg (2002)
Chapter Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines (2001), Software available at: http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chapelle, O., Haffner, P., Vapnik, V.: Support vector machines for histogrambased image classification. IEEE Transactions on Neural Networks 10(5), 1055–1064 (1999)
Article Google Scholar
Comaniciu, D., Meer, P.: Distribution free decomposition of multivariate data. Pattern Analysis and Applications 2, 22–30 (1999)
Article MATH Google Scholar
Comaniciu, D., Ramesh, V., Meer, P.: The variable bandwidth mean shift and data-driven scale selection. In: Proceedings of the 8th IEEE International Conference on Computer Vision, Vancouver, Canada, July 2001, vol. 1, pp. 438–445 (2001)
Google Scholar
Csurka, G., Dance, C., Fan, L., Williamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV 2004 Workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, June 2005, pp. 886–893 (2005)
Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Discriminative training for object recognition using image patches. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, June 2005, vol. 2, pp. 157–162 (2005)
Google Scholar
Deselaers, T., Keysers, D., Ney, H.: Improving a discriminative approach to object recognition using image patches. In: Kropatsch, W.G., Sablatnig, R., Hanbury, A. (eds.) DAGM 2005. LNCS, vol. 3663, pp. 326–333. Springer, Heidelberg (2005)
Chapter Google Scholar
Dorko, G., Schmid, C.: Selection of scale-invariant parts for object class recognition. In: Proceedings of the 9th IEEE International Conference on Computer Vision, Nice, France, October 2003, pp. 634–640 (2003)
Google Scholar
Dorkó, G., Schmid, C.: Object class recognition using discriminative local features. Technical report, INRIA (February 2005)
Google Scholar
Eichhorn, J., Chapelle, O.: Object categorization with SVM: kernels for local features. Technical report, Max Planck Institute for Biological Cybernetics (July 2004)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Proceedings of the Workshop on Generative-Model Based Vision, Washington, DC, USA (June 2004)
Google Scholar
Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, USA (June 2003)
Google Scholar
Fritz, M., Leibe, B., Caputo, B., Schiele, B.: Integrating representative and discriminant models for object category detection. In: Proceedings of the 10th IEEE International Conference on Computer Vision, Beijing, China (October 2005)
Google Scholar
Garcia, C., Delakis, M.: Convolutional face finder: A neural architecture for fast and robust face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(11), 1408–1423 (2004)
Article Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Proceedings of the 4th Alvey Vision Conference, pp. 147–151 (1988)
Google Scholar
Joachims, T.: Text categorization with support vector machines: Learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) Proceedings of the 10th European Conference on Machine Learning, Chemnitz, Germany, pp. 137–142. Springer, Heidelberg (1998)
Google Scholar
Joachims, T.: Making large-scale SVM learning practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning. The MIT Press, Cambridge (1999)
Google Scholar
Jurie, F., Triggs, W.: Creating efficient codebooks for visual recognition. In: Proceedings of the 10th IEEE International Conference on Computer Vision, Beijing, China (2005)
Google Scholar
Kondor, R., Jebara, T.: A kernel between sets of vectors. In: Proceedings of the 20th International Conference on Machine Learning, Washingon, DC, USA (2003)
Google Scholar
Laaksonen, J., Koskela, M., Oja, E.: PicSOM—Self-organizing image retrieval with MPEG-7 content descriptions. IEEE Transactions on Neural Networks, Special Issue on Intelligent Multimedia Processing 13(4), 841–853 (2002)
Article MATH Google Scholar
Larlus, D.: Creation de vocabulaires visuels efficaces pour la categorization d’images. Master’s thesis, Image Vision Robotic, INPG and UJF (June 2005)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Combined object categorization and segmentation with an implicit shape model. In: ECCV 2004 Workshop on Statistical Learning in Computer Vision, Prague, Czech Republic, May 2004, pp. 17–32 (2004)
Google Scholar
Leibe, B., Schiele, B.: Scale invariant object categorization using a scale-adaptive mean-shift search. In: Proceedings of the 26th DAGM Annual Pattern Recognition Symposium, Tuebingen, Germany (August 2004)
Google Scholar
Leibe, B., Seemann, E., Schiele, B.: Pedestrian detection in crowded scenes. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA (June 2005)
Google Scholar
Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision 30(2), 79–116 (1998)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Meng, H., Shawe-Taylor, J., Szedmak, S., Farquhar, J.R.D.: Support vector machine to synthesise kernels. In: Proceedings of the Sheffield Machine Learning Workshop, Sheffield, UK (2004)
Google Scholar
Mettu, R.R., Plaxton, C.G.: The online median problem. In: Proceedings of the 41st Annual Symposium on Foundations of Computer Science, p. 339. IEEE Computer Society, Los Alamitos (2000)
Chapter Google Scholar
Mikolajczyk, K., Leibe, B., Schiele, B.: Local features for object class recognition. In: Proceedings of the 10th IEEE International Conference on Computer Vision, Beijing, China (October 2005)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, USA, June 2003, vol. 2, pp. 257–263 (2003)
Google Scholar
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. International Journal of Computer Vision 60, 63–86 (2004)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(10), 1615–1630 (2005)
Article Google Scholar
Opelt, A., Fussenegger, A., Pinz, A., Auer, P.: Weak hypotheses and boosting for generic object detection and recognition. In: Proceedings of the 8th European Conference on Computer Vision, Prague, Czech Republic, vol. 2, pp. 71–84 (2004)
Google Scholar
Schölkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. The MIT Press, Cambridge (2002)
Google Scholar
Seemann, E., Leibe, B., Mikolajczyk, K., Schiele, B.: An evaluation of local shape-based features for pedestrian detection. In: Proceedings of the 16th British Machine Vision Conference, Oxford, UK (2005)
Google Scholar
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: Proceedings of the 6th European Conference on Computer Vision, Dublin, Ireland, pp. 18–32 (2000)
Google Scholar
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: An in-depth study. Technical report, INRIA (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Oxford, Oxford, UK
Mark Everingham & Andrew Zisserman
University of Edinburgh, Edinburgh, UK
Christopher K. I. Williams, Moray Allan, Tom Griffiths & Amos Storkey
ETH Zentrum, Zurich, Switzerland
Luc Van Gool
RWTH Aachen University, Aachen, Germany
Thomas Deselaers, Daniel Keysers & Hermann Ney
TU-Darmstadt, Darmstadt, Germany
Mario Fritz, Bastian Leibe, Bernt Schiele & Edgar Seemann
France Télécom, Cesson Sévigné, France
Stefan Duffner & Christophe Garcia
Helsinki University of Technology, Helsinki, Finland
Markus Koskela, Jorma Laaksonen & Ville Viitaniemi
INRIA Rhône-Alpes, Montbonnot, France
Navneet Dalal, Gyuri Dorkó, Frederic Jurie, Diane Larlus, Cordelia Schmid, Bill Triggs & Jianguo Zhang
Middle East Technical University, Ankara, Turkey
Ilkay Ulusoy
Microsoft Research, Cambridge, UK
Christopher M. Bishop
Max Planck Institute for Biological Cybernetics, Tübingen, Germany
Olivier Chapelle & Jan Eichhorn
University of Southampton, Southampton, UK
Jason D. R. Farquhar, Hongying Meng, John Shawe-Taylor & Sandor Szedmak

Authors

Mark Everingham
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Zisserman
View author publications
You can also search for this author in PubMed Google Scholar
Christopher K. I. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Luc Van Gool
View author publications
You can also search for this author in PubMed Google Scholar
Moray Allan
View author publications
You can also search for this author in PubMed Google Scholar
Christopher M. Bishop
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Chapelle
View author publications
You can also search for this author in PubMed Google Scholar
Navneet Dalal
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Deselaers
View author publications
You can also search for this author in PubMed Google Scholar
Gyuri Dorkó
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Duffner
View author publications
You can also search for this author in PubMed Google Scholar
Jan Eichhorn
View author publications
You can also search for this author in PubMed Google Scholar
Jason D. R. Farquhar
View author publications
You can also search for this author in PubMed Google Scholar
Mario Fritz
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Tom Griffiths
View author publications
You can also search for this author in PubMed Google Scholar
Frederic Jurie
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Keysers
View author publications
You can also search for this author in PubMed Google Scholar
Markus Koskela
View author publications
You can also search for this author in PubMed Google Scholar
Jorma Laaksonen
View author publications
You can also search for this author in PubMed Google Scholar
Diane Larlus
View author publications
You can also search for this author in PubMed Google Scholar
Bastian Leibe
View author publications
You can also search for this author in PubMed Google Scholar
Hongying Meng
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Ney
View author publications
You can also search for this author in PubMed Google Scholar
Bernt Schiele
View author publications
You can also search for this author in PubMed Google Scholar
Cordelia Schmid
View author publications
You can also search for this author in PubMed Google Scholar
Edgar Seemann
View author publications
You can also search for this author in PubMed Google Scholar
John Shawe-Taylor
View author publications
You can also search for this author in PubMed Google Scholar
Amos Storkey
View author publications
You can also search for this author in PubMed Google Scholar
Sandor Szedmak
View author publications
You can also search for this author in PubMed Google Scholar
Bill Triggs
View author publications
You can also search for this author in PubMed Google Scholar
Ilkay Ulusoy
View author publications
You can also search for this author in PubMed Google Scholar
Ville Viitaniemi
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Max Planck Institute for Biological Cybernetics, Spemannstr. 38, Tübingen, Germany
Joaquin Quiñonero-Candela
Bar Ilan University, 52900, Ramat Gan, Israel
Ido Dagan
ITC-IRST, Trento, Italy
Bernardo Magnini
Université d’Evry-Val d’Essonne, IBISC CNRS FRE 2873 and GENPOLE, 523, Place des terrasses, 91000, Evry, France
Florence d’Alché-Buc

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Everingham, M. et al. (2006). The 2005 PASCAL Visual Object Classes Challenge. In: Quiñonero-Candela, J., Dagan, I., Magnini, B., d’Alché-Buc, F. (eds) Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Tectual Entailment. MLCW 2005. Lecture Notes in Computer Science(), vol 3944. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11736790_8

Download citation

DOI: https://doi.org/10.1007/11736790_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33427-9
Online ISBN: 978-3-540-33428-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics