Skip to main content
Log in

Elastic net regularized dictionary learning for image classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Dictionary learning plays a key role in image representation for classification. A multi-modal dictionary is usually learned from feature samples across different classes and shared in the feature encoding process. Ideally each atom in dictionary corresponds to a single class of images, while each class of images corresponds to a certain group of atoms. Image features are encoded as linear combinations of selected atoms in a given dictionary. We propose to use elastic net as regularizer to select atoms in feature coding and related dictionary learning process, which not only benefits from the sparsity similar as 1 penalty but also encourages a grouping effect that helps improve image representation. Experimental results of image classification on benchmark datasets show that with dictionary learned in the proposed way outperforms state-of-the-art dictionary learning algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. All the results of OCSVM and HIKVQ are based on step size 8 and without concatenated Sobel images.

References

  1. Bo L, Sminchisescu C (2009) Efficient match kernel between sets of features for visual recognition. In: Proceedings of Advances in neural information processing systems, pp. 135–143

  2. Bertsekas DP (1999) Nonlinear programming. Athena Scientific, Belmont

    MATH  Google Scholar 

  3. Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Systs Technol 2(3):27:1–27:27

    Google Scholar 

  4. Gao S, Tsang IW-H, Chia L-T (2010) Kernel sparse representation for image classification and face recognition. In: Proceedings of the 11th ECCV. Springer, pp 1–14

  5. Gao S, Tsang IWH, Chia LT (2013) Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans Pattern Anal Mach Intell 35(1):92–104

    Article  Google Scholar 

  6. Gao S, Tsang IW, Chia L-T, Zhao P (2010) Local features are not lonely–laplacian sparse coding for image classification. In: Proceedings of the 23rd CVPR. IEEE, pp 3555–3561

  7. Gao Y, Wang M, Tao D, Ji R, Dai Q (2012) 3-d object retrieval and recognition with hypergraph analysis. IEEE Trans Image Process 21(9):4290–4303

    Article  MathSciNet  Google Scholar 

  8. Gao Y, Wang M, Zha Z-J, Shen J, Li X, Wu X (2013) Visual-textual joint relevance learning for tag-based social image search. IEEE Trans Image Process 22(1):363–376

    Article  MathSciNet  Google Scholar 

  9. Gao Y, Wang M, Zha Z-J, Tian Q, Dai Q, Zhang N (2011) Less is more: efficient 3-d object retrieval with query view selection. IEEE Trans Multimed 13(5):1007–1018

    Article  Google Scholar 

  10. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the 19th CVPR, pp 2169–2178

  11. Lee H, Battle A, Raina R, Ng AY (2006) Efficient sparse coding algorithms. In: Proceedings of advances in neural information processing systems, pp 801–808

  12. Li F-F, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Workshop of the 17th CVPR, vol. 12, p 178

  13. Li L-J, Fei-Fei L (2007) What, where and who? Classifying events by scene and object recognition. In: Proceedings of the 11th ICCV. IEEE, pp 1–8

  14. Liu B-D, Wang Y-X, Zhang Y-J, Shen B (2013) Learning dictionary on manifolds for image classification. Pattern Recog 46(7):1879–1890

    Article  Google Scholar 

  15. Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Hebert M (2014) Self-explanatory sparse representation for image classification. In: Proceedings of the 13th ECCV. Springer, pp 600–616

  16. Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Wang Y-J, Liu W-F (2013) Self-explanatory convex sparse representation for image classification. In: 2013 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, 2120–2125

  17. Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Wang Y-J (2014) Blockwise coordinate descent schemes for sparse representation. In: Proceedings of the 39th ICASSP. IEEE, pp 5267–5271

  18. Liu B-D, Wang Y-X, Zhang Y-J, Zheng Y (2012) Discriminant sparse coding for image classification. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 2193–2196

  19. Lu Z, Peng Y (2011) Latent semantic learning by efficient sparse coding with hypergraph regularization. In: Proceedings of the 25th AAAI, pp 411–416

  20. Ramamurthy KN, Thiagarajan JJ, Spanias A (2011) Improved sparse coding using manifold projections. In: Proceedings of the 18th ICIP. IEEE, pp 1237–1240

  21. Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326

    Article  Google Scholar 

  22. Shen B, Liu B-D, Allebach J (2014) Tisvm: Large margin classifier for misaligned image classification. In: Proceedings of the 21st ICIP. IEEE

  23. Shen B, Liu B-D, Wang Q, Ji R (2014) Robust nonnegative matrix factorization via l1 norm regularization by multiplicative updating rules. In: Proceedings of the 21st ICIP. IEEE

  24. Shen B, Si L (2010) Non-negative matrix factorization clustering on multiple manifolds. In: Proceedings of the 24th AAAI, pp 575–580

  25. Shen B, Wei H, Zhang Y, Zhang Y-J (2009) Image inpainting via sparse representation. In: IEEE international conference on acoustics, speech and signal processing, 2009. ICASSP 2009. IEEE, pp 697–700

  26. Van Nguyen H, Patel VM, Nasrabadi NM, Chellappa R (2012) Kernel dictionary learning. In: Proceedings of the 37th ICASSP. IEEE, pp 2021–2024

  27. van Gemert JC, Veenman CJ, Smeulders AWM, Geusebroek J-M (2010) Visual word ambiguity. IEEE Trans Pattern Anal Mach Intell 32(7):1271–1283

    Article  Google Scholar 

  28. Wang J, Yang J, Yu K, Lv F, Huang T, Y Gong (2010) Locality-constrained linear coding for image classification. In: Proceedings of the 23rd CVPR. IEEE, pp 3360–3367

  29. Wu J, Rehg JM (2009) Beyond the euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: Proceedings of the 12th ICCV, pp 630–637

  30. Yang J, Kai Y, Gong Y, Huang TS (2009) Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of the 22nd CVPR, pp 1794–1801

  31. Yi W, Shen B, Ling H (2014) Visual tracking via online non-negative matrix factorization. IEEE Trans Circ Syst Video Technol 24(3):374–383

    Article  Google Scholar 

  32. Yu K, Zhang T, Gong Y (2009) Nonlinear learning using local coordinate coding. In: Proceedings of advances in neural information processing systems, pp 2223–2231

  33. Zhang D, Yang M, Feng X (2011) Sparse representation or collaborative representation: which helps face recognition? In: Proceedings of the 13th ICCV. IEEE, pp 471–478

  34. Zheng M, Bu J, Chen C, Wang C, Zhang L, Qiu G, Cai D (2011) Graph regularized sparse coding for image representation. IEEE Trans Image Process 20(5):1327–1336

    Article  MathSciNet  Google Scholar 

  35. Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc: Ser B (Stat Methodol) 67(2):301–320

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of P.R. China (No. 61402535), Qingdao Science and Technology Project (No. 14-2-4-111-jch), the Fundamental Research Funds for the Central Universities (No. R1405012A), and the Talent Acquisition Project (No.Y1305024).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Bin Shen or Bao-Di Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shen, B., Liu, BD. & Wang, Q. Elastic net regularized dictionary learning for image classification. Multimed Tools Appl 75, 8861–8874 (2016). https://doi.org/10.1007/s11042-014-2257-y

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-2257-y

Keywords

Navigation