Collaborative Dictionary Learning and Soft Assignment for Sparse Coding of Image Features

Liu, Jie; Tang, Sheng; Li, Yu

doi:10.1007/978-3-319-51811-4_36

Jie Liu¹⁸,
Sheng Tang¹⁹ &
Yu Li¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10132))

Included in the following conference series:

International Conference on Multimedia Modeling

3311 Accesses

Abstract

In computer vision, the bag-of-words (BoW) model has been widely applied to image related tasks, such as large scale image retrieval, image classification, and object categorization. The sparse coding (SC) method which leverages SC as a means of feature coding can guarantee both sparsity of coding vector and lower reconstruction error in the BoW model. Thus it can achieve better performance than the traditional vector quantization method. However, it suffers from the side effect introduced by the non-smooth sparsity regularizer that quite different words may be selected for similar patches to favor sparsity, resulting in the loss of correlation between the corresponding coding vectors. To address this problem, in this paper, we propose a novel soft assignment method based on index combination of top-2 large sparse codes of local descriptors to make the SC-based BoW tolerate the case of different word selection for similar patches. To further ensure similar patches select same words to generate similar coding vectors, we propose a collaborative dictionary learning method through imposing the sparse code similarity regularization factor along with the row sparsity regularization across data instances on top of group sparse coding. Experiments on the well-known public Oxford dataset demonstrate the effectiveness of our proposed methods.

This work was supported by National Nature Science Foundation of China (61371194, 61672361, 61572472), Beijing Natural Science Foundation (4152050, 4152012), Beijing Advanced Innovation Center for Imaging Technology (BAICIT-2016009).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Hierarchical BoW with segmental sparse coding for large scale image classification and retrieval

Article 05 May 2018

Discriminative sparse neighbor coding

Article 07 October 2015

Image Retrieval Based on Optimized Visual Dictionary and Adaptive Soft Assignment

References

Xie, H., Gao, K., Zhang, Y., Tang, S., Li, J., Liu, Y.: Efficient feature detection and effective post-verification for large scale near-duplicate image search. IEEE Trans. Multimedia 13(6), 1319–1332 (2011)
Article Google Scholar
Nie, L., Yan, S., Wang, M., Hong, R., Chua, T.-S.: Harvesting visual concepts for image search with complex queries. In: Proceedings of ACM Multimedia 2012 Conference, October 2012
Google Scholar
Tang, S., Li, J.-T., Li, M., Xie, C., Liu, Y.-Z., Tao, K., Xu, S.-X.: TRECVID 2008 high-level feature extraction by MCG-ICT-CAS. In: Proceedings of TRECVID 2008 Workshop, November 2008
Google Scholar
Tang, S., Zheng, Y.-T., Wang, Y., Chua, T.-S.: Sparse ensemble learning for concept detection. IEEE Trans. Multimedia 14(1), 43–54 (2012)
Article Google Scholar
Li, P., Lu, X., Wang, Q.: From dictionary of visual words to subspaces: locality-constrained affine subspace coding. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2348–2357, June 2015
Google Scholar
Mikulik, A., Perdoch, M., Chum, O., Matas, J.: Learning vocabularies over a fine quantization. Int. J. Comput. Vision 103(1), 163–175 (2013)
Article MathSciNet Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: Proceedings of ICCV, pp. 1470–1477 (2003)
Google Scholar
Jegou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87, 316–336 (2010)
Article Google Scholar
Tang, S., Chen, H., Lv, K., Zhang, Y.D.: Large visual words for large scale image classification. In: 2015 IEEE International Conference on Image Processing (ICIP), pp. 1170–1174, September 2015
Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: Proceedings of CVPR, pp. 2161–2168 (2006)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: Proceedings of CVPR, pp. 1–8 (2007)
Google Scholar
Li, D., Yang, L., Hua, X.S., Zhang, H.J.: Large-scale robust visual codebook construction. In: ACM Multimedia 2010 (2010)
Google Scholar
Avrithis, Y., Kalantidis, Y.: Approximate Gaussian mixtures for large scale vocabularies. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 15–28. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_2
Chapter Google Scholar
Tang, S., Zhang, Y.D., Chen, H.: Scalable logo recognition based on compact sparse dictionary for mobile devices. In: 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP), pp. 1–6, October 2015
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Google Scholar
Jiang, Y.-G., Ngo, C.-W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proceedings of ACM International Conference on Image and Video Retrieval (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: Proceedings of CVPR (2008)
Google Scholar
Strelow, D., Bengio, S., Pereira, F., Singer, Y.: Group sparse coding. In: Neural Information Processing Systems - NIPS (2009)
Google Scholar
Petitcolas, F.A.P.: Watermarking schemes evaluation. IEEE. Sig. Process. 17(5), 117–128 (2000)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-fei, L.: Imagenet: a large-scale hierarchical image database. In: Proceedings of CVPR (2009). http://image-net.org/
Muja, M., Lowe, D.G.: Scalable nearest neighbor algorithms for high dimensional data. IEEE Trans. Pattern Anal. Mach. Intell. 36, 2227–2240 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Beijing Advanced Innovation Center for Imaging Technology, College of Information and Engineering, Capital Normal University, Beijing, 100048, People’s Republic of China
Jie Liu
Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, CAS, Beijing, 100190, People’s Republic of China
Sheng Tang & Yu Li

Authors

Jie Liu
View author publications
You can also search for this author in PubMed Google Scholar
Sheng Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yu Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sheng Tang .

Editor information

Editors and Affiliations

CNRS–IRISA, Rennes, France
Laurent Amsaleg
Reykjavík University, Reykjavik, Iceland
Gylfi Þór Guðmundsson
Dublin City University, Dublin, Ireland
Cathal Gurrin
Reykjavik University, Reykjavik, Ireland
Björn Þór Jónsson
National Institute of Informatics, Tokyo, Japan
Shin’ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, J., Tang, S., Li, Y. (2017). Collaborative Dictionary Learning and Soft Assignment for Sparse Coding of Image Features. In: Amsaleg, L., Guðmundsson, G., Gurrin, C., Jónsson, B., Satoh, S. (eds) MultiMedia Modeling. MMM 2017. Lecture Notes in Computer Science(), vol 10132. Springer, Cham. https://doi.org/10.1007/978-3-319-51811-4_36

Download citation

DOI: https://doi.org/10.1007/978-3-319-51811-4_36
Published: 31 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-51810-7
Online ISBN: 978-3-319-51811-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics