Discriminative convolution sparse coding for robust image classification

Nozaripour, Ali; Soltanizadeh, Hadi

doi:10.1007/s11042-022-12395-0

Discriminative convolution sparse coding for robust image classification

Published: 13 May 2022

Volume 81, pages 40849–40870, (2022)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

206 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Convolutional Sparse Coding (CSC) is a popular model in the signal and image processing communities, resolving some limitations of the traditional patch-based sparse representations. However, most existing CSC algorithms are suited for image restoration. Also, in some CSC-based classification methods, the CSC model is only used as a feature extractor and so other classifiers are needed for classification. In this paper, we present a novel discriminative model based on CSC for image classification. The proposed method, discriminative local block coordinate descent (D-LoBCoD), is based on extending the LoBCoD algorithm by incorporating the classification error into the objective function that considers the performance of a linear classifier and the representational power of the filters, simultaneously. Thus, in the training phase, in each iteration, after updating the sparse coefficients and convolutional filters, we minimize the classification error by updating the parameters of the classifier according to the class label information of the training samples. Also, in the test phase, the label of the query image is determined by the trained classifier. To demonstrate the performance of the proposed method, we conduct extensive experiments on image data sets in comparison with state-of-the-art classification methods. The experimental results show that our method outperforms other competing methods in most cases. Further, we demonstrate that our proposed method is less dependent on the number of training samples because of capturing more representative information from the corresponding images. Thus our proposed method can work better than other methods on all small databases that have fewer samples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image classification via convolutional sparse coding

Article 06 April 2022

Deep Sparse Representation Learning for Multi-class Image Classification

End-to-End Trained Sparse Coding Network with Spatial Pyramid Pooling for Image Classification

Article 22 January 2019

References

An F (2020) Image classification algorithm based on stacked sparse coding deep learning model-optimized kernel function nonnegative sparse representation. Soft Comput 24(22):16967–16981
Article Google Scholar
Annunziata R, Trucco E (2016) Accelerating convolutional sparse coding for curvilinear structures segmentation by refining scird-ts filter banks. IEEE Trans Med Imaging 35(11):2381–2392
Article Google Scholar
Boyd S, Parikh N, Chu E (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Now Publishers Inc
Chang H, Han J, Zhong C, Snijders A M, Mao J-H (2017) Unsupervised transfer learning via multi-scale convolutional sparse coding for biomedical applications. IEEE Trans Pattern Anal Mach Intell 40(5):1182–1194
Article Google Scholar
Chen B, Li J, Ma B, Wei G (2016) Convolutional sparse coding classification model for image classification. In: 2016 IEEE international conference on image processing (ICIP). IEEE, pp 1918–1922
Chen S S, Donoho D L, Saunders M A (2001) Atomic decomposition by basis pursuit. SIAM Rev 43(1):129–159
Article MathSciNet Google Scholar
Chen S, Billings S A, Luo W (1989) Orthogonal least squares methods and their application to non-linear system identification. Int J Control 50 (5):1873–1896
Article Google Scholar
Cogliati A, Duan Z, Wohlberg B (2016) Context-dependent piano music transcription with convolutional sparse coding. IEEE/ACM Trans Audio Speech Lang Process 24(12):2218–2230
Article Google Scholar
Degraux K, Kamilov U S, Boufounos P T, Liu D (2017) Online convolutional dictionary learning for multimodal imaging. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1617–1621
Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27 (8):3893–3903
Article MathSciNet Google Scholar
Ding Z, Shao M, Fu Y (2016) Deep robust encoder through locality preserving low-rank dictionary. In: European Conference on Computer Vision. Springer, pp 567–582
Foroughi H, Ray N, Zhang H (2017) Object classification with joint projection and low-rank dictionary learning. IEEE Trans Image Process 27(2):806–821
Article MathSciNet Google Scholar
Georghiades A S, Belhumeur P N, Kriegman D J (2001) From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660
Article Google Scholar
Goel N, Bebis G, Nefian A (2005) Face recognition experiments with random projection. In: Biometric Technology for Human Identification II, vol 5779. International Society for Optics and Photonics, pp 426–437
Gou J, Wang L, Yi Z, Yuan Y, Ou W, Mao Q (2020) Weighted discriminative collaborative competitive representation for robust image classification. Neural Netw 125:104–120
Article Google Scholar
Gu S, Zuo W, Xie Q, Meng D, Feng X, Zhang L (2015) Convolutional sparse coding for image super-resolution. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1823–1831
Guo Y, Lu C, Allebach J P, Bouman C A (2017) Model-based iterative restoration for binary document image compression with dictionary learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5984–5993
He J, Yu L, Liu Z, Yang W (2021) Image super-resolution by learning weighted convolutional sparse coding. SIViP:1–9
Heide F, Heidrich W, Wetzstein G (2015) Fast and flexible convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5135–5143
Hu J, Tan Y-P (2018) Nonlinear dictionary learning with application to image classification. Pattern Recogn 75:282–291
Article Google Scholar
Jiang Z, Lin Z, Davis L S (2013) Label consistent k-svd: Learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664
Article Google Scholar
Jin J, Chen CLP (2017) Convolutional sparse coding for face recognition. In: 2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS). IEEE, pp 137–141
Kavukcuoglu K, Sermanet P, Boureau Y-L, Gregor K, Mathieu M, Cun Y, et al. (2010) Learning convolutional feature hierarchies for visual recognition. Adv Neural Inf Process Syst 23:1090–1098
Google Scholar
Li Z, Zhang Z, Qin J, Zhang Z, Shao L (2019) Discriminative fisher embedding dictionary learning algorithm for object recognition. IEEE Trans Neural Netw Learn Syst 31(3):786–800
Article MathSciNet Google Scholar
Liao H-W, Su L (2018) Monaural source separation using ramanujan subspace dictionaries. IEEE Signal Process Lett 25(8):1156–1160
Article Google Scholar
Liu J, Garcia-Cardona C, Wohlberg B, Yin W (2017) Online convolutional dictionary learning. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1707–1711
Liu Z, Wu X-J, Shu Z (2019) Sparsity augmented discriminative sparse representation for face recognition. Pattern Anal Appl 22(4):1527–1535
Article MathSciNet Google Scholar
Martinez A, Benavente R (1998) The ar face database: Cvc technical report, 24
Nozaripour A, Soltanizadeh H (2021) Robust vein recognition against rotation using kernel sparse representation. Journal of AI and Data Mining
Pan H, Jing Z, Qiao L, Li M (2017) Discriminative structured dictionary learning on grassmann manifolds and its application on image restoration. IEEE Trans Cybern 48(10):2875–2886
Article Google Scholar
Papyan V, Romano Y, Sulam J, Elad M (2017) Convolutional dictionary learning via local processing. In: Proceedings of the IEEE International Conference on Computer Vision, pp 5296–5304
Parvasideh P, Rezghi M (2020) A novel dictionary learning method based on total least squares approach with application in high dimensional biological data. ADAC:1–23
Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8
Pour A N, Eslami E, Haddadnia J (2015) A new method for automatic extraction of region of interest from infrared images of dorsal hand vein pattern based on floating selection model. Int J Appl Pattern Recogn 2(2):111–127
Article Google Scholar
Rodriguez P (2018) Fast convolutional sparse coding with ℓ₀ penalty. In: 2018 IEEE XXV International Conference on Electronics, Electrical Engineering and Computing (INTERCON). IEEE, pp 1–4
Romano Y, Elad M (2015) Patch-disagreement as away to improve k-svd denoising. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1280–1284
Samaria F S, Harter A C (1994) Parameterisation of a stochastic model for human face identification. In: Proceedings of 1994 IEEE workshop on applications of computer vision. IEEE, pp 138–142
Shazeeda S, Rosdi B A (2018) Finger vein recognition using mutual sparse representation classification. IET Biometr 8(1):49–58
Article Google Scholar
Šorel M, Šroubek F (2016) Fast convolutional sparse coding using matrix inversion lemma. Digital Signal Process 55:44–51
Article Google Scholar
Sun J, Ponce J (2016) Learning dictionary of discriminative part detectors for image categorization and cosegmentation. Int J Comput Vis 120(2):111–133
Article MathSciNet Google Scholar
Vu T H, Monga V (2017) Fast low-rank shared dictionary learning for image classification. IEEE Trans Image Process 26(11):5160–5175
Article MathSciNet Google Scholar
Wang J, Deng C, Liu W, Ji R, Chen X, Gao X (2013) Query-dependent visual dictionary adaptation for image reranking. In: Proceedings of the 21st ACM international conference on Multimedia, pp 769–772
Wang K, Lin L, Zuo W, Gu S, Zhang L (2016) Dictionary pair classifier driven convolutional neural networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2138–2146
Wang L, Dou J, Qin P, Lin S, Gao Y, Wang R, Zhang J (2021) Multimodal medical image fusion based on nonsubsampled shearlet transform and convolutional sparse representation. Multimed Tools Appl:1–21
Wang Y, Yao Q, Kwok J T, Ni L M (2018) Scalable online convolutional sparse coding. IEEE Trans Image Process 27(10):4850–4859
Article MathSciNet Google Scholar
Wohlberg B (2015) Efficient algorithms for convolutional sparse representations. IEEE Trans Image Process 25(1):301–315
Article MathSciNet Google Scholar
Wright J, Yang A Y, Ganesh A, Sastry S S, Ma Y (2008) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Article Google Scholar
Xu Y, Li Z, Yang J, Zhang D (2017) A survey of dictionary learning algorithms for face recognition. IEEE Access 5:8502–8514
Article Google Scholar
Yang M, Zhang L, Feng X, Zhang D (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vis 109(3):209–232
Article MathSciNet Google Scholar
Yuksel A, Akarun L, Sankur B (2011) Hand vein biometry based on geometry and appearance methods. IET Comput Vis 5(6):398–406
Article MathSciNet Google Scholar
Zeiler M D, Krishnan D, Taylor G W, Fergus R (2010) Deconvolutional networks. In: 2010 IEEE Computer Society Conference on computer vision and pattern recognition. IEEE, pp 2528–2535
Zeng S, Yang X, Gou J (2017) Multiplication fusion of sparse and collaborative representation for robust face recognition. Multimed Tools Appl 76(20):20889–20907
Article Google Scholar
Zeng S, Zhang B, Gou J, Xu Y (2020) Regularization on augmented data to diversify sparse representation for robust image classification. IEEE Transactions on Cybernetics
Zhang L, Yang M, Feng X (2011) Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International conference on computer vision. IEEE, pp 471–478
Zhang Q, Li B (2010) Discriminative k-svd for dictionary learning in face recognition. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, pp 2691–2698
Zhao L, Zhang Y, Yin B, Sun Y, Hu Y, Piao X, Wu Q (2016) Fisher discrimination-based ℓ_2,1-norm sparse representation for face recognition. Vis Comput 32(9):1165–1178
Article Google Scholar
Zhao Z, Shen Q, Feng G, Zhu J (2021) Collaborative coding and dictionary learning for nearest subspace classification. Soft Comput 25(11):7627–7643
Article Google Scholar
Zheng H, Tao D (2015) Discriminative dictionary learning via fisher discrimination k-svd algorithm. Neurocomputing 162:9–15
Article Google Scholar
Zhou Y, Chang H, Barner K, Spellman P, Parvin B (2014) Classification of histology sections via multispectral convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3081–3088
Zhu Y, Deng X, Newsam S (2019) Fine-grained land use classification at the city scale using ground-level images. IEEE Trans Multimed 21(7):1825–1838
Article Google Scholar
Zisselman E, Sulam J, Elad M (2019) A local block coordinate descent algorithm for the csc model. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8208–8217

Download references

Author information

Authors and Affiliations

Department of Electrical/Computer Engineering, Semnan University, Semnan, 3513119111, Semnan, Iran
Ali Nozaripour
Faculty of Electrical/Computer Engineering, Semnan University, Semnan, 3513119111, Semnan, Iran
Hadi Soltanizadeh

Authors

Ali Nozaripour
View author publications
You can also search for this author in PubMed Google Scholar
Hadi Soltanizadeh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hadi Soltanizadeh.

Ethics declarations

Conflict of Interests

The authors declare that they don’t have received any funds or other support to assist with the preparation of this manuscript. Also, they have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nozaripour, A., Soltanizadeh, H. Discriminative convolution sparse coding for robust image classification. Multimed Tools Appl 81, 40849–40870 (2022). https://doi.org/10.1007/s11042-022-12395-0

Download citation

Received: 22 September 2021
Revised: 05 December 2021
Accepted: 25 January 2022
Published: 13 May 2022
Issue Date: November 2022
DOI: https://doi.org/10.1007/s11042-022-12395-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discriminative convolution sparse coding for robust image classification

Abstract

Access this article

Similar content being viewed by others

Image classification via convolutional sparse coding

Deep Sparse Representation Learning for Multi-class Image Classification

End-to-End Trained Sparse Coding Network with Spatial Pyramid Pooling for Image Classification

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Discriminative convolution sparse coding for robust image classification

Abstract

Access this article

Similar content being viewed by others

Image classification via convolutional sparse coding

Deep Sparse Representation Learning for Multi-class Image Classification

End-to-End Trained Sparse Coding Network with Spatial Pyramid Pooling for Image Classification

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation