Class Representative Visual Words for Category-Level Object Recognition

López Sastre, Roberto Javier; Tuytelaars, Tinne; Maldonado Bascón, Saturnino

doi:10.1007/978-3-642-02172-5_25

Roberto Javier López Sastre²⁰,
Tinne Tuytelaars²¹ &
Saturnino Maldonado Bascón²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5524))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1812 Accesses

Abstract

Recent works in object recognition often use visual words, i.e. vector quantized local descriptors extracted from the images. In this paper we present a novel method to build such a codebook with class representative vectors. This method, coined Cluster Precision Maximization (CPM), is based on a new measure of the cluster precision and on an optimization procedure that leads any clustering algorithm towards class representative visual words. We compare our procedure with other measures of cluster precision and present the integration of a Reciprocal Nearest Neighbor (RNN) clustering algorithm in the CPM method. In the experiments, on a subset of the the Caltech101 database, we analyze several vocabularies obtained with different local descriptors and different clustering algorithms, and we show that the vocabularies obtained with the CPM process perform best in a category-level object recognition system using a Support Vector Machine (SVM).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Vector Quantization Enhancement for Computer Vision Tasks

Incremental Estimation of Visual Vocabulary Size for Image Retrieval

Multiple Instance Classification in the Image Domain

References

Csurka, G., Dance, C.R., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proceedings of the ECCV (2004)
Google Scholar
Lowe, D.: Object recognition from local scale-invariant features. In: ICCV (1999)
Google Scholar
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: Proceedings of the CVPR (2008)
Google Scholar
van de Sande, K., Gevers, T., Snoek, C.: Evaluation of color descriptors for object and scene recognition. In: Proceedings of the CVPR (2008)
Google Scholar
Everingham, M., et al.: The PASCAL voc 2008 Results (2008), http://www.pascal-network.org/challenges/VOC/voc2008/workshop/index.html
Tuytelaars, T., Mikolajczyk, K.: Local invariant feature detectors: A survey. Foundations and Trends in Computer Graphics and Vision 3(3), 177–280 (2008)
Article Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Transactions on PAMI 27(10), 1615–1630 (2005)
Article Google Scholar
Sivic, J., Zisserman, A.: Video data mining using configurations of viewpoint invariant regions. In: Proceedings of the CVPR, pp. 488–495 (2004)
Google Scholar
Quack, T., Ferrari, V., Leibe, B., Van Gool, L.: Efficient mining of frequent and distinctive feature configurations. In: Proceedings of the ICCV (2007)
Google Scholar
Yuan, J., Wu, Y.: Context-aware clustering. In: Proceedings of the CVPR (2008)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Semi-local affine parts for object recognition. In: Proceedings of the BMVC (2004)
Google Scholar
Leibe, B., Ettlin, A., Schiele, B.: Learning semantic object parts for object categorization. Image and Vision Computing 26(1), 15–26 (2008)
Article Google Scholar
Perronnin, P., Dance, C., Csurka, G., Bressan, M.: Adapted vocabularies for generic visual categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 464–475. Springer, Heidelberg (2006)
Chapter Google Scholar
Winn, J., Criminisi, A., Minka, A.: Object categorization by learned universal visual dictionary. In: Proceedings of the ICCV (2005)
Google Scholar
Moosmann, F., Triggs, B., Jurie, F.: Fast discriminative visual codebooks using randomized clustering forests. In: Advances in NIPS (2006)
Google Scholar
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: Proceedings of the CVPR (2007)
Google Scholar
Mikolajczyk, K., Leibe, B., Schiele, B.: Local features for object class recognition. In: Proceedings of the ICCV (2005)
Google Scholar
Stark, M., Schiele, B.: How good are local features for classes of geometric objects. In: Proceedings of the ICCV, pp. 1–8 (2007)
Google Scholar
Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Transactions on PAMI 24(24), 509–522 (2002)
Article Google Scholar
Leibe, B., Mikolajczyk, K., Schiele, B.: Efficient clustering and matching for object class recognition. In: Proceedings of the BMVC (2006)
Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Proceedings of the CVPR (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Alcalá, GRAM, Spain
Roberto Javier López Sastre & Saturnino Maldonado Bascón
K.U. Leuven, ESAT-PSI, Belgium
Tinne Tuytelaars

Authors

Roberto Javier López Sastre
View author publications
You can also search for this author in PubMed Google Scholar
Tinne Tuytelaars
View author publications
You can also search for this author in PubMed Google Scholar
Saturnino Maldonado Bascón
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Systems and Robotics, Dept. of Electrical and Computer Eng.-Polo II, University of Coimbra, 3030-290, Coimbra, Portugal
Helder Araujo
Institute of Biomedical Engineering, Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, 4200-465, Porto, Portugal
Ana Maria Mendonça
Dept. de Electrónica e Telecomunicações / IEETA, Universidade de Aveiro, Signal Processing Lab, DETI/IEETA, University of Aveiro, 3810–193, Aveiro, Portugal
Armando J. Pinho
Departamento de Electricidad y Electrónica, Fac. Ciencia y Tecnología - UPV/EHU, Universidad del País Vasco, Apartado 644, 48080, Bilbao, Spain
María Inés Torres

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

López Sastre, R.J., Tuytelaars, T., Maldonado Bascón, S. (2009). Class Representative Visual Words for Category-Level Object Recognition. In: Araujo, H., Mendonça, A.M., Pinho, A.J., Torres, M.I. (eds) Pattern Recognition and Image Analysis. IbPRIA 2009. Lecture Notes in Computer Science, vol 5524. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02172-5_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-02172-5_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02171-8
Online ISBN: 978-3-642-02172-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics