Abstract
Convolutional Sparse Coding (CSC) is a popular model in the signal and image processing communities, resolving some limitations of the traditional patch-based sparse representations. However, most existing CSC algorithms are suited for image restoration. Also, in some CSC-based classification methods, the CSC model is only used as a feature extractor and so other classifiers are needed for classification. In this paper, we present a novel discriminative model based on CSC for image classification. The proposed method, discriminative local block coordinate descent (D-LoBCoD), is based on extending the LoBCoD algorithm by incorporating the classification error into the objective function that considers the performance of a linear classifier and the representational power of the filters, simultaneously. Thus, in the training phase, in each iteration, after updating the sparse coefficients and convolutional filters, we minimize the classification error by updating the parameters of the classifier according to the class label information of the training samples. Also, in the test phase, the label of the query image is determined by the trained classifier. To demonstrate the performance of the proposed method, we conduct extensive experiments on image data sets in comparison with state-of-the-art classification methods. The experimental results show that our method outperforms other competing methods in most cases. Further, we demonstrate that our proposed method is less dependent on the number of training samples because of capturing more representative information from the corresponding images. Thus our proposed method can work better than other methods on all small databases that have fewer samples.
Similar content being viewed by others
References
An F (2020) Image classification algorithm based on stacked sparse coding deep learning model-optimized kernel function nonnegative sparse representation. Soft Comput 24(22):16967–16981
Annunziata R, Trucco E (2016) Accelerating convolutional sparse coding for curvilinear structures segmentation by refining scird-ts filter banks. IEEE Trans Med Imaging 35(11):2381–2392
Boyd S, Parikh N, Chu E (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Now Publishers Inc
Chang H, Han J, Zhong C, Snijders A M, Mao J-H (2017) Unsupervised transfer learning via multi-scale convolutional sparse coding for biomedical applications. IEEE Trans Pattern Anal Mach Intell 40(5):1182–1194
Chen B, Li J, Ma B, Wei G (2016) Convolutional sparse coding classification model for image classification. In: 2016 IEEE international conference on image processing (ICIP). IEEE, pp 1918–1922
Chen S S, Donoho D L, Saunders M A (2001) Atomic decomposition by basis pursuit. SIAM Rev 43(1):129–159
Chen S, Billings S A, Luo W (1989) Orthogonal least squares methods and their application to non-linear system identification. Int J Control 50 (5):1873–1896
Cogliati A, Duan Z, Wohlberg B (2016) Context-dependent piano music transcription with convolutional sparse coding. IEEE/ACM Trans Audio Speech Lang Process 24(12):2218–2230
Degraux K, Kamilov U S, Boufounos P T, Liu D (2017) Online convolutional dictionary learning for multimodal imaging. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1617–1621
Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27 (8):3893–3903
Ding Z, Shao M, Fu Y (2016) Deep robust encoder through locality preserving low-rank dictionary. In: European Conference on Computer Vision. Springer, pp 567–582
Foroughi H, Ray N, Zhang H (2017) Object classification with joint projection and low-rank dictionary learning. IEEE Trans Image Process 27(2):806–821
Georghiades A S, Belhumeur P N, Kriegman D J (2001) From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660
Goel N, Bebis G, Nefian A (2005) Face recognition experiments with random projection. In: Biometric Technology for Human Identification II, vol 5779. International Society for Optics and Photonics, pp 426–437
Gou J, Wang L, Yi Z, Yuan Y, Ou W, Mao Q (2020) Weighted discriminative collaborative competitive representation for robust image classification. Neural Netw 125:104–120
Gu S, Zuo W, Xie Q, Meng D, Feng X, Zhang L (2015) Convolutional sparse coding for image super-resolution. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1823–1831
Guo Y, Lu C, Allebach J P, Bouman C A (2017) Model-based iterative restoration for binary document image compression with dictionary learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5984–5993
He J, Yu L, Liu Z, Yang W (2021) Image super-resolution by learning weighted convolutional sparse coding. SIViP:1–9
Heide F, Heidrich W, Wetzstein G (2015) Fast and flexible convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5135–5143
Hu J, Tan Y-P (2018) Nonlinear dictionary learning with application to image classification. Pattern Recogn 75:282–291
Jiang Z, Lin Z, Davis L S (2013) Label consistent k-svd: Learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664
Jin J, Chen CLP (2017) Convolutional sparse coding for face recognition. In: 2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS). IEEE, pp 137–141
Kavukcuoglu K, Sermanet P, Boureau Y-L, Gregor K, Mathieu M, Cun Y, et al. (2010) Learning convolutional feature hierarchies for visual recognition. Adv Neural Inf Process Syst 23:1090–1098
Li Z, Zhang Z, Qin J, Zhang Z, Shao L (2019) Discriminative fisher embedding dictionary learning algorithm for object recognition. IEEE Trans Neural Netw Learn Syst 31(3):786–800
Liao H-W, Su L (2018) Monaural source separation using ramanujan subspace dictionaries. IEEE Signal Process Lett 25(8):1156–1160
Liu J, Garcia-Cardona C, Wohlberg B, Yin W (2017) Online convolutional dictionary learning. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1707–1711
Liu Z, Wu X-J, Shu Z (2019) Sparsity augmented discriminative sparse representation for face recognition. Pattern Anal Appl 22(4):1527–1535
Martinez A, Benavente R (1998) The ar face database: Cvc technical report, 24
Nozaripour A, Soltanizadeh H (2021) Robust vein recognition against rotation using kernel sparse representation. Journal of AI and Data Mining
Pan H, Jing Z, Qiao L, Li M (2017) Discriminative structured dictionary learning on grassmann manifolds and its application on image restoration. IEEE Trans Cybern 48(10):2875–2886
Papyan V, Romano Y, Sulam J, Elad M (2017) Convolutional dictionary learning via local processing. In: Proceedings of the IEEE International Conference on Computer Vision, pp 5296–5304
Parvasideh P, Rezghi M (2020) A novel dictionary learning method based on total least squares approach with application in high dimensional biological data. ADAC:1–23
Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8
Pour A N, Eslami E, Haddadnia J (2015) A new method for automatic extraction of region of interest from infrared images of dorsal hand vein pattern based on floating selection model. Int J Appl Pattern Recogn 2(2):111–127
Rodriguez P (2018) Fast convolutional sparse coding with ℓ0 penalty. In: 2018 IEEE XXV International Conference on Electronics, Electrical Engineering and Computing (INTERCON). IEEE, pp 1–4
Romano Y, Elad M (2015) Patch-disagreement as away to improve k-svd denoising. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1280–1284
Samaria F S, Harter A C (1994) Parameterisation of a stochastic model for human face identification. In: Proceedings of 1994 IEEE workshop on applications of computer vision. IEEE, pp 138–142
Shazeeda S, Rosdi B A (2018) Finger vein recognition using mutual sparse representation classification. IET Biometr 8(1):49–58
Šorel M, Šroubek F (2016) Fast convolutional sparse coding using matrix inversion lemma. Digital Signal Process 55:44–51
Sun J, Ponce J (2016) Learning dictionary of discriminative part detectors for image categorization and cosegmentation. Int J Comput Vis 120(2):111–133
Vu T H, Monga V (2017) Fast low-rank shared dictionary learning for image classification. IEEE Trans Image Process 26(11):5160–5175
Wang J, Deng C, Liu W, Ji R, Chen X, Gao X (2013) Query-dependent visual dictionary adaptation for image reranking. In: Proceedings of the 21st ACM international conference on Multimedia, pp 769–772
Wang K, Lin L, Zuo W, Gu S, Zhang L (2016) Dictionary pair classifier driven convolutional neural networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2138–2146
Wang L, Dou J, Qin P, Lin S, Gao Y, Wang R, Zhang J (2021) Multimodal medical image fusion based on nonsubsampled shearlet transform and convolutional sparse representation. Multimed Tools Appl:1–21
Wang Y, Yao Q, Kwok J T, Ni L M (2018) Scalable online convolutional sparse coding. IEEE Trans Image Process 27(10):4850–4859
Wohlberg B (2015) Efficient algorithms for convolutional sparse representations. IEEE Trans Image Process 25(1):301–315
Wright J, Yang A Y, Ganesh A, Sastry S S, Ma Y (2008) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Xu Y, Li Z, Yang J, Zhang D (2017) A survey of dictionary learning algorithms for face recognition. IEEE Access 5:8502–8514
Yang M, Zhang L, Feng X, Zhang D (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vis 109(3):209–232
Yuksel A, Akarun L, Sankur B (2011) Hand vein biometry based on geometry and appearance methods. IET Comput Vis 5(6):398–406
Zeiler M D, Krishnan D, Taylor G W, Fergus R (2010) Deconvolutional networks. In: 2010 IEEE Computer Society Conference on computer vision and pattern recognition. IEEE, pp 2528–2535
Zeng S, Yang X, Gou J (2017) Multiplication fusion of sparse and collaborative representation for robust face recognition. Multimed Tools Appl 76(20):20889–20907
Zeng S, Zhang B, Gou J, Xu Y (2020) Regularization on augmented data to diversify sparse representation for robust image classification. IEEE Transactions on Cybernetics
Zhang L, Yang M, Feng X (2011) Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International conference on computer vision. IEEE, pp 471–478
Zhang Q, Li B (2010) Discriminative k-svd for dictionary learning in face recognition. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, pp 2691–2698
Zhao L, Zhang Y, Yin B, Sun Y, Hu Y, Piao X, Wu Q (2016) Fisher discrimination-based ℓ2,1-norm sparse representation for face recognition. Vis Comput 32(9):1165–1178
Zhao Z, Shen Q, Feng G, Zhu J (2021) Collaborative coding and dictionary learning for nearest subspace classification. Soft Comput 25(11):7627–7643
Zheng H, Tao D (2015) Discriminative dictionary learning via fisher discrimination k-svd algorithm. Neurocomputing 162:9–15
Zhou Y, Chang H, Barner K, Spellman P, Parvin B (2014) Classification of histology sections via multispectral convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3081–3088
Zhu Y, Deng X, Newsam S (2019) Fine-grained land use classification at the city scale using ground-level images. IEEE Trans Multimed 21(7):1825–1838
Zisselman E, Sulam J, Elad M (2019) A local block coordinate descent algorithm for the csc model. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8208–8217
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they don’t have received any funds or other support to assist with the preparation of this manuscript. Also, they have no conflicts of interest to declare that are relevant to the content of this article.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Nozaripour, A., Soltanizadeh, H. Discriminative convolution sparse coding for robust image classification. Multimed Tools Appl 81, 40849–40870 (2022). https://doi.org/10.1007/s11042-022-12395-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12395-0