Skip to main content
Log in

Discriminative convolution sparse coding for robust image classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Convolutional Sparse Coding (CSC) is a popular model in the signal and image processing communities, resolving some limitations of the traditional patch-based sparse representations. However, most existing CSC algorithms are suited for image restoration. Also, in some CSC-based classification methods, the CSC model is only used as a feature extractor and so other classifiers are needed for classification. In this paper, we present a novel discriminative model based on CSC for image classification. The proposed method, discriminative local block coordinate descent (D-LoBCoD), is based on extending the LoBCoD algorithm by incorporating the classification error into the objective function that considers the performance of a linear classifier and the representational power of the filters, simultaneously. Thus, in the training phase, in each iteration, after updating the sparse coefficients and convolutional filters, we minimize the classification error by updating the parameters of the classifier according to the class label information of the training samples. Also, in the test phase, the label of the query image is determined by the trained classifier. To demonstrate the performance of the proposed method, we conduct extensive experiments on image data sets in comparison with state-of-the-art classification methods. The experimental results show that our method outperforms other competing methods in most cases. Further, we demonstrate that our proposed method is less dependent on the number of training samples because of capturing more representative information from the corresponding images. Thus our proposed method can work better than other methods on all small databases that have fewer samples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. An F (2020) Image classification algorithm based on stacked sparse coding deep learning model-optimized kernel function nonnegative sparse representation. Soft Comput 24(22):16967–16981

    Article  Google Scholar 

  2. Annunziata R, Trucco E (2016) Accelerating convolutional sparse coding for curvilinear structures segmentation by refining scird-ts filter banks. IEEE Trans Med Imaging 35(11):2381–2392

    Article  Google Scholar 

  3. Boyd S, Parikh N, Chu E (2011) Distributed optimization and statistical learning via the alternating direction method of multipliers. Now Publishers Inc

  4. Chang H, Han J, Zhong C, Snijders A M, Mao J-H (2017) Unsupervised transfer learning via multi-scale convolutional sparse coding for biomedical applications. IEEE Trans Pattern Anal Mach Intell 40(5):1182–1194

    Article  Google Scholar 

  5. Chen B, Li J, Ma B, Wei G (2016) Convolutional sparse coding classification model for image classification. In: 2016 IEEE international conference on image processing (ICIP). IEEE, pp 1918–1922

  6. Chen S S, Donoho D L, Saunders M A (2001) Atomic decomposition by basis pursuit. SIAM Rev 43(1):129–159

    Article  MathSciNet  Google Scholar 

  7. Chen S, Billings S A, Luo W (1989) Orthogonal least squares methods and their application to non-linear system identification. Int J Control 50 (5):1873–1896

    Article  Google Scholar 

  8. Cogliati A, Duan Z, Wohlberg B (2016) Context-dependent piano music transcription with convolutional sparse coding. IEEE/ACM Trans Audio Speech Lang Process 24(12):2218–2230

    Article  Google Scholar 

  9. Degraux K, Kamilov U S, Boufounos P T, Liu D (2017) Online convolutional dictionary learning for multimodal imaging. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1617–1621

  10. Deng C, Chen Z, Liu X, Gao X, Tao D (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE Trans Image Process 27 (8):3893–3903

    Article  MathSciNet  Google Scholar 

  11. Ding Z, Shao M, Fu Y (2016) Deep robust encoder through locality preserving low-rank dictionary. In: European Conference on Computer Vision. Springer, pp 567–582

  12. Foroughi H, Ray N, Zhang H (2017) Object classification with joint projection and low-rank dictionary learning. IEEE Trans Image Process 27(2):806–821

    Article  MathSciNet  Google Scholar 

  13. Georghiades A S, Belhumeur P N, Kriegman D J (2001) From few to many: Illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660

    Article  Google Scholar 

  14. Goel N, Bebis G, Nefian A (2005) Face recognition experiments with random projection. In: Biometric Technology for Human Identification II, vol 5779. International Society for Optics and Photonics, pp 426–437

  15. Gou J, Wang L, Yi Z, Yuan Y, Ou W, Mao Q (2020) Weighted discriminative collaborative competitive representation for robust image classification. Neural Netw 125:104–120

    Article  Google Scholar 

  16. Gu S, Zuo W, Xie Q, Meng D, Feng X, Zhang L (2015) Convolutional sparse coding for image super-resolution. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1823–1831

  17. Guo Y, Lu C, Allebach J P, Bouman C A (2017) Model-based iterative restoration for binary document image compression with dictionary learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5984–5993

  18. He J, Yu L, Liu Z, Yang W (2021) Image super-resolution by learning weighted convolutional sparse coding. SIViP:1–9

  19. Heide F, Heidrich W, Wetzstein G (2015) Fast and flexible convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 5135–5143

  20. Hu J, Tan Y-P (2018) Nonlinear dictionary learning with application to image classification. Pattern Recogn 75:282–291

    Article  Google Scholar 

  21. Jiang Z, Lin Z, Davis L S (2013) Label consistent k-svd: Learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664

    Article  Google Scholar 

  22. Jin J, Chen CLP (2017) Convolutional sparse coding for face recognition. In: 2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS). IEEE, pp 137–141

  23. Kavukcuoglu K, Sermanet P, Boureau Y-L, Gregor K, Mathieu M, Cun Y, et al. (2010) Learning convolutional feature hierarchies for visual recognition. Adv Neural Inf Process Syst 23:1090–1098

    Google Scholar 

  24. Li Z, Zhang Z, Qin J, Zhang Z, Shao L (2019) Discriminative fisher embedding dictionary learning algorithm for object recognition. IEEE Trans Neural Netw Learn Syst 31(3):786–800

    Article  MathSciNet  Google Scholar 

  25. Liao H-W, Su L (2018) Monaural source separation using ramanujan subspace dictionaries. IEEE Signal Process Lett 25(8):1156–1160

    Article  Google Scholar 

  26. Liu J, Garcia-Cardona C, Wohlberg B, Yin W (2017) Online convolutional dictionary learning. In: 2017 IEEE International Conference on Image Processing (ICIP). IEEE, pp 1707–1711

  27. Liu Z, Wu X-J, Shu Z (2019) Sparsity augmented discriminative sparse representation for face recognition. Pattern Anal Appl 22(4):1527–1535

    Article  MathSciNet  Google Scholar 

  28. Martinez A, Benavente R (1998) The ar face database: Cvc technical report, 24

  29. Nozaripour A, Soltanizadeh H (2021) Robust vein recognition against rotation using kernel sparse representation. Journal of AI and Data Mining

  30. Pan H, Jing Z, Qiao L, Li M (2017) Discriminative structured dictionary learning on grassmann manifolds and its application on image restoration. IEEE Trans Cybern 48(10):2875–2886

    Article  Google Scholar 

  31. Papyan V, Romano Y, Sulam J, Elad M (2017) Convolutional dictionary learning via local processing. In: Proceedings of the IEEE International Conference on Computer Vision, pp 5296–5304

  32. Parvasideh P, Rezghi M (2020) A novel dictionary learning method based on total least squares approach with application in high dimensional biological data. ADAC:1–23

  33. Pham D-S, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: 2008 IEEE conference on computer vision and pattern recognition. IEEE, pp 1–8

  34. Pour A N, Eslami E, Haddadnia J (2015) A new method for automatic extraction of region of interest from infrared images of dorsal hand vein pattern based on floating selection model. Int J Appl Pattern Recogn 2(2):111–127

    Article  Google Scholar 

  35. Rodriguez P (2018) Fast convolutional sparse coding with 0 penalty. In: 2018 IEEE XXV International Conference on Electronics, Electrical Engineering and Computing (INTERCON). IEEE, pp 1–4

  36. Romano Y, Elad M (2015) Patch-disagreement as away to improve k-svd denoising. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 1280–1284

  37. Samaria F S, Harter A C (1994) Parameterisation of a stochastic model for human face identification. In: Proceedings of 1994 IEEE workshop on applications of computer vision. IEEE, pp 138–142

  38. Shazeeda S, Rosdi B A (2018) Finger vein recognition using mutual sparse representation classification. IET Biometr 8(1):49–58

    Article  Google Scholar 

  39. Šorel M, Šroubek F (2016) Fast convolutional sparse coding using matrix inversion lemma. Digital Signal Process 55:44–51

    Article  Google Scholar 

  40. Sun J, Ponce J (2016) Learning dictionary of discriminative part detectors for image categorization and cosegmentation. Int J Comput Vis 120(2):111–133

    Article  MathSciNet  Google Scholar 

  41. Vu T H, Monga V (2017) Fast low-rank shared dictionary learning for image classification. IEEE Trans Image Process 26(11):5160–5175

    Article  MathSciNet  Google Scholar 

  42. Wang J, Deng C, Liu W, Ji R, Chen X, Gao X (2013) Query-dependent visual dictionary adaptation for image reranking. In: Proceedings of the 21st ACM international conference on Multimedia, pp 769–772

  43. Wang K, Lin L, Zuo W, Gu S, Zhang L (2016) Dictionary pair classifier driven convolutional neural networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2138–2146

  44. Wang L, Dou J, Qin P, Lin S, Gao Y, Wang R, Zhang J (2021) Multimodal medical image fusion based on nonsubsampled shearlet transform and convolutional sparse representation. Multimed Tools Appl:1–21

  45. Wang Y, Yao Q, Kwok J T, Ni L M (2018) Scalable online convolutional sparse coding. IEEE Trans Image Process 27(10):4850–4859

    Article  MathSciNet  Google Scholar 

  46. Wohlberg B (2015) Efficient algorithms for convolutional sparse representations. IEEE Trans Image Process 25(1):301–315

    Article  MathSciNet  Google Scholar 

  47. Wright J, Yang A Y, Ganesh A, Sastry S S, Ma Y (2008) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227

    Article  Google Scholar 

  48. Xu Y, Li Z, Yang J, Zhang D (2017) A survey of dictionary learning algorithms for face recognition. IEEE Access 5:8502–8514

    Article  Google Scholar 

  49. Yang M, Zhang L, Feng X, Zhang D (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vis 109(3):209–232

    Article  MathSciNet  Google Scholar 

  50. Yuksel A, Akarun L, Sankur B (2011) Hand vein biometry based on geometry and appearance methods. IET Comput Vis 5(6):398–406

    Article  MathSciNet  Google Scholar 

  51. Zeiler M D, Krishnan D, Taylor G W, Fergus R (2010) Deconvolutional networks. In: 2010 IEEE Computer Society Conference on computer vision and pattern recognition. IEEE, pp 2528–2535

  52. Zeng S, Yang X, Gou J (2017) Multiplication fusion of sparse and collaborative representation for robust face recognition. Multimed Tools Appl 76(20):20889–20907

    Article  Google Scholar 

  53. Zeng S, Zhang B, Gou J, Xu Y (2020) Regularization on augmented data to diversify sparse representation for robust image classification. IEEE Transactions on Cybernetics

  54. Zhang L, Yang M, Feng X (2011) Sparse representation or collaborative representation: Which helps face recognition?. In: 2011 International conference on computer vision. IEEE, pp 471–478

  55. Zhang Q, Li B (2010) Discriminative k-svd for dictionary learning in face recognition. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, pp 2691–2698

  56. Zhao L, Zhang Y, Yin B, Sun Y, Hu Y, Piao X, Wu Q (2016) Fisher discrimination-based 2,1-norm sparse representation for face recognition. Vis Comput 32(9):1165–1178

    Article  Google Scholar 

  57. Zhao Z, Shen Q, Feng G, Zhu J (2021) Collaborative coding and dictionary learning for nearest subspace classification. Soft Comput 25(11):7627–7643

    Article  Google Scholar 

  58. Zheng H, Tao D (2015) Discriminative dictionary learning via fisher discrimination k-svd algorithm. Neurocomputing 162:9–15

    Article  Google Scholar 

  59. Zhou Y, Chang H, Barner K, Spellman P, Parvin B (2014) Classification of histology sections via multispectral convolutional sparse coding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 3081–3088

  60. Zhu Y, Deng X, Newsam S (2019) Fine-grained land use classification at the city scale using ground-level images. IEEE Trans Multimed 21(7):1825–1838

    Article  Google Scholar 

  61. Zisselman E, Sulam J, Elad M (2019) A local block coordinate descent algorithm for the csc model. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 8208–8217

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hadi Soltanizadeh.

Ethics declarations

Conflict of Interests

The authors declare that they don’t have received any funds or other support to assist with the preparation of this manuscript. Also, they have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nozaripour, A., Soltanizadeh, H. Discriminative convolution sparse coding for robust image classification. Multimed Tools Appl 81, 40849–40870 (2022). https://doi.org/10.1007/s11042-022-12395-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12395-0

Keywords

Navigation