An Approach to the Compact and Efficient Visual Codebook Based on SIFT Descriptor

Wang, Zhe; Liu, Guizhong; Qian, Xueming; Guo, Danping

doi:10.1007/978-3-642-15702-8_42

Zhe Wang²²,
Guizhong Liu²²,
Xueming Qian²² &
…
Danping Guo²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6297))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1468 Accesses
1 Citations

Abstract

The Bag-of-Words (BoW) derived from local keypoints was widely applied in visual information research such as image search, video retrieval, object categorization, and computer vision. Construction of visual codebook is a well-known and predominant method for the representation of BoW. However, a visual codebook usually has a high dimension that results in high computational complexity. In this paper, an approach is presented for constructing a compact visual codebook. Two important parameters, namely the likelihood ratio and the significant level, are proposed to estimate the discriminative capability of each of the codewords. Thus, the codewords that have higher discriminative capability are reserved, and the others are removed. Experiments prove that application of the proposed compact codebook not only reduces computational complexity, but also improves performance of object classification..

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jiang, Y., Ngo, C., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: CIVR (2007)
Google Scholar
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. In: CVPR (2003)
Google Scholar
Lowe, D.: Distinctive image features from scale invariant keypoints. IJCV 60(2), 91–110 (2004)
Article Google Scholar
Li, F.F., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: CVPR Workshop on Generative-Model Based Vision (2004)
Google Scholar
Kim, S., Kweon, I.S., Lee, C.W.: Visual Categorization Robust to Large Intra-Class Variations using Entropy-guided Codebook. In: IEEE International Conference on Robotics and Automation Roma, Italy, April 10-14 (2007)
Google Scholar
Wu, L.N., Luo, S.W., Sun, W.: Create efficient visual codebook based on weighted mRMR for object categorization. In: ICSP 9th International Conference (2008)
Google Scholar
Hotta, K.: Object Categorization Based on Kernel Principal Component Analysis of Visual Words. In: WACV (2008)
Google Scholar
Chang, S.F., He, J.F., et al.: Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search. In: Proc. TRECVID 2008 (2008)
Google Scholar
Snoek, C.G.M., Van, K.E.A., et al.: The MediaMill TRECVID 2008 Semantic Video Search Engine. In: Proc. TRECVID 2008 (2008)
Google Scholar
Wang, L.: Toward a discriminative codebook: codeword selection across multi-resolution. In: IEEE Conference on Computer Vision and Pattern Recognition, June 17-22, pp. 1–8 (2007)
Google Scholar
Li, T., Mei, T., Kweon, I.S.: Learning Optimal Compact Codebook for Efficient Object Categorization. In: WACV (2008)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/cjlin/libsvm
TREC Video Retrieval Evaluation (TRECVID), http://www-nlpir.nist.gov/projects/trecvid/
Everingham, M., Zisserman, A., et al.: The 2005 pascal visual object classes challenge. In: Quiñonero-Candela, J., Dagan, I., Magnini, B., d’Alché-Buc, F. (eds.) MLCW 2005. LNCS (LNAI), vol. 3944, pp. 117–176. Springer, Heidelberg (2006)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Xi’an Jiaotong University, 710049, China
Zhe Wang, Guizhong Liu, Xueming Qian & Danping Guo

Authors

Zhe Wang
View author publications
You can also search for this author in PubMed Google Scholar
Guizhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xueming Qian
View author publications
You can also search for this author in PubMed Google Scholar
Danping Guo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, University of Nottingham, Jubilee Campus, NG8 1BB, Nottingham, UK
Guoping Qiu
The Centre for Multimedia Signal Processing, The Hong Kong Polytechnic University, Hong Kong, China
Kin Man Lam
Faculty of System Design, Tokyo Metropolitan University, 6-6, Asahigaoka, 191-0065, Hino-city, Tokyo
Hitoshi Kiya
Shanghai Key Laboratory of Intelligent Information Processing, Department of Computer Science & Engineering, Fudan University, Shanghai, China
Xiang-Yang Xue
Department of Electrical Engineering, University of Southern California, 90089-2564, Los Angeles, CA
C.-C. Jay Kuo
LIACS Media Lab, Leiden University,
Michael S. Lew

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Z., Liu, G., Qian, X., Guo, D. (2010). An Approach to the Compact and Efficient Visual Codebook Based on SIFT Descriptor. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6297. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15702-8_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-15702-8_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15701-1
Online ISBN: 978-3-642-15702-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics