Skip to main content

Supervised Visual Vocabulary with Category Information

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6915))

Abstract

The bag-of-words model has been widely employed in image classification and object detection tasks. The performance of bag-of-words methods depends fundamentally on the visual vocabulary that is applied to quantize the image features into visual words. Traditional vocabulary construction methods (e.g. k-means) are unable to capture the semantic relationship between image features. In order to increase the discriminative power of the visual vocabulary, this paper proposes a technique to construct a supervised visual vocabulary by jointly considering image features and their class labels. The method uses a novel cost function in which a simple and effective dissimilarity measure is adopted to deal with category information. And, we adopt a prototype-based approach which tries to find prototypes for clusters instead of using the means in k-means algorithm. The proposed method works as the k-means algorithm by efficiently minimizing a clustering cost function. The experiments on different datasets show that the proposed vocabulary construction method is effective for image classification.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: Proc. ICCV, vol. 2, pp. 1470–1477 (2003)

    Google Scholar 

  2. Lowe, G.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  3. Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proc. ICCV (2005)

    Google Scholar 

  4. Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Proc. ICCV, pp. 1800–1807 (2005)

    Google Scholar 

  5. Lazebnik, S., Raginsky, M.: Supervised Learning of Quantizer Codebooks by Information Loss Minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(7), 1294–1309 (2009)

    Article  Google Scholar 

  6. Moosmann, F., Nowak, E., Jurie, F.: Randomized clustering forests for image classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(9), 1632–1646 (2008)

    Article  Google Scholar 

  7. Perronnin, F.: Universal and Adapted Vocabularies for Generic Visual Categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(7), 1243–1256 (2008)

    Article  Google Scholar 

  8. Yang, L., Jin, R., Sukthankar, R., Jurie, F.: Unifying discriminative visual codebook generation with classifier training for object category recognition. In: Proc. CVPR (2008)

    Google Scholar 

  9. Zhang, C., Liu, J., Ouyang, Y., Tian, Q., Lu, H., Ma, S.: Category sensitive codebook construction for object category recognition. In: Proc. ICIP (2009)

    Google Scholar 

  10. Lian, X., Li, Z., Wang, C., Lv, B., Zhang, L.: Probabilistic models for supervised dictionary learning. In: Proc. CVPR (2010)

    Google Scholar 

  11. Jian, A., Dubes, R.: Algorithms for clustering data. Prentice Hall, Englewood Cliffs (1988)

    MATH  Google Scholar 

  12. Huang, Z.: Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values. Data Mining and Knowledge Discovery, 283–304 (1998)

    Google Scholar 

  13. Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)

    Article  MATH  Google Scholar 

  14. Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. IEEE Transactions on Pattern Analysis and Machine Intelligence 30(4), 712–727 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Liu, Y., Caselles, V. (2011). Supervised Visual Vocabulary with Category Information. In: Blanc-Talon, J., Kleihorst, R., Philips, W., Popescu, D., Scheunders, P. (eds) Advanced Concepts for Intelligent Vision Systems. ACIVS 2011. Lecture Notes in Computer Science, vol 6915. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23687-7_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23687-7_2

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23686-0

  • Online ISBN: 978-3-642-23687-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics