Abstract
Dictionary learning plays a key role in image representation for classification. A multi-modal dictionary is usually learned from feature samples across different classes and shared in the feature encoding process. Ideally each atom in dictionary corresponds to a single class of images, while each class of images corresponds to a certain group of atoms. Image features are encoded as linear combinations of selected atoms in a given dictionary. We propose to use elastic net as regularizer to select atoms in feature coding and related dictionary learning process, which not only benefits from the sparsity similar as ℓ 1 penalty but also encourages a grouping effect that helps improve image representation. Experimental results of image classification on benchmark datasets show that with dictionary learned in the proposed way outperforms state-of-the-art dictionary learning algorithms.







Similar content being viewed by others
Notes
All the results of OCSVM and HIKVQ are based on step size 8 and without concatenated Sobel images.
References
Bo L, Sminchisescu C (2009) Efficient match kernel between sets of features for visual recognition. In: Proceedings of Advances in neural information processing systems, pp. 135–143
Bertsekas DP (1999) Nonlinear programming. Athena Scientific, Belmont
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Systs Technol 2(3):27:1–27:27
Gao S, Tsang IW-H, Chia L-T (2010) Kernel sparse representation for image classification and face recognition. In: Proceedings of the 11th ECCV. Springer, pp 1–14
Gao S, Tsang IWH, Chia LT (2013) Laplacian sparse coding, hypergraph laplacian sparse coding, and applications. IEEE Trans Pattern Anal Mach Intell 35(1):92–104
Gao S, Tsang IW, Chia L-T, Zhao P (2010) Local features are not lonely–laplacian sparse coding for image classification. In: Proceedings of the 23rd CVPR. IEEE, pp 3555–3561
Gao Y, Wang M, Tao D, Ji R, Dai Q (2012) 3-d object retrieval and recognition with hypergraph analysis. IEEE Trans Image Process 21(9):4290–4303
Gao Y, Wang M, Zha Z-J, Shen J, Li X, Wu X (2013) Visual-textual joint relevance learning for tag-based social image search. IEEE Trans Image Process 22(1):363–376
Gao Y, Wang M, Zha Z-J, Tian Q, Dai Q, Zhang N (2011) Less is more: efficient 3-d object retrieval with query view selection. IEEE Trans Multimed 13(5):1007–1018
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the 19th CVPR, pp 2169–2178
Lee H, Battle A, Raina R, Ng AY (2006) Efficient sparse coding algorithms. In: Proceedings of advances in neural information processing systems, pp 801–808
Li F-F, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: Workshop of the 17th CVPR, vol. 12, p 178
Li L-J, Fei-Fei L (2007) What, where and who? Classifying events by scene and object recognition. In: Proceedings of the 11th ICCV. IEEE, pp 1–8
Liu B-D, Wang Y-X, Zhang Y-J, Shen B (2013) Learning dictionary on manifolds for image classification. Pattern Recog 46(7):1879–1890
Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Hebert M (2014) Self-explanatory sparse representation for image classification. In: Proceedings of the 13th ECCV. Springer, pp 600–616
Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Wang Y-J, Liu W-F (2013) Self-explanatory convex sparse representation for image classification. In: 2013 IEEE international conference on systems, man, and cybernetics (SMC). IEEE, 2120–2125
Liu B-D, Wang Y-X, Shen B, Zhang Y-J, Wang Y-J (2014) Blockwise coordinate descent schemes for sparse representation. In: Proceedings of the 39th ICASSP. IEEE, pp 5267–5271
Liu B-D, Wang Y-X, Zhang Y-J, Zheng Y (2012) Discriminant sparse coding for image classification. In: 2012 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 2193–2196
Lu Z, Peng Y (2011) Latent semantic learning by efficient sparse coding with hypergraph regularization. In: Proceedings of the 25th AAAI, pp 411–416
Ramamurthy KN, Thiagarajan JJ, Spanias A (2011) Improved sparse coding using manifold projections. In: Proceedings of the 18th ICIP. IEEE, pp 1237–1240
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Shen B, Liu B-D, Allebach J (2014) Tisvm: Large margin classifier for misaligned image classification. In: Proceedings of the 21st ICIP. IEEE
Shen B, Liu B-D, Wang Q, Ji R (2014) Robust nonnegative matrix factorization via l1 norm regularization by multiplicative updating rules. In: Proceedings of the 21st ICIP. IEEE
Shen B, Si L (2010) Non-negative matrix factorization clustering on multiple manifolds. In: Proceedings of the 24th AAAI, pp 575–580
Shen B, Wei H, Zhang Y, Zhang Y-J (2009) Image inpainting via sparse representation. In: IEEE international conference on acoustics, speech and signal processing, 2009. ICASSP 2009. IEEE, pp 697–700
Van Nguyen H, Patel VM, Nasrabadi NM, Chellappa R (2012) Kernel dictionary learning. In: Proceedings of the 37th ICASSP. IEEE, pp 2021–2024
van Gemert JC, Veenman CJ, Smeulders AWM, Geusebroek J-M (2010) Visual word ambiguity. IEEE Trans Pattern Anal Mach Intell 32(7):1271–1283
Wang J, Yang J, Yu K, Lv F, Huang T, Y Gong (2010) Locality-constrained linear coding for image classification. In: Proceedings of the 23rd CVPR. IEEE, pp 3360–3367
Wu J, Rehg JM (2009) Beyond the euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: Proceedings of the 12th ICCV, pp 630–637
Yang J, Kai Y, Gong Y, Huang TS (2009) Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of the 22nd CVPR, pp 1794–1801
Yi W, Shen B, Ling H (2014) Visual tracking via online non-negative matrix factorization. IEEE Trans Circ Syst Video Technol 24(3):374–383
Yu K, Zhang T, Gong Y (2009) Nonlinear learning using local coordinate coding. In: Proceedings of advances in neural information processing systems, pp 2223–2231
Zhang D, Yang M, Feng X (2011) Sparse representation or collaborative representation: which helps face recognition? In: Proceedings of the 13th ICCV. IEEE, pp 471–478
Zheng M, Bu J, Chen C, Wang C, Zhang L, Qiu G, Cai D (2011) Graph regularized sparse coding for image representation. IEEE Trans Image Process 20(5):1327–1336
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc: Ser B (Stat Methodol) 67(2):301–320
Acknowledgments
This work was supported by the National Natural Science Foundation of P.R. China (No. 61402535), Qingdao Science and Technology Project (No. 14-2-4-111-jch), the Fundamental Research Funds for the Central Universities (No. R1405012A), and the Talent Acquisition Project (No.Y1305024).
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Shen, B., Liu, BD. & Wang, Q. Elastic net regularized dictionary learning for image classification. Multimed Tools Appl 75, 8861–8874 (2016). https://doi.org/10.1007/s11042-014-2257-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-2257-y