Skip to main content
Log in

Interactive and discriminative analysis dictionary learning for image classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Dictionary learning is widely utilized in pattern recognition, and analysis dictionary learning is a prevalent image classification method. However, its classification performance still has much room for improvement. Constructing the powerful discriminative constraint by exploiting the intrinsic characteristics of sample is an effective approach for enhancing the performance, thus how to design an effective constraint is a problem worth studying. On the other hand, some analysis dictionary learning models incorporate the classification error constraint. However, this constraint always adopts a strict binary matrix as the target matrix, which is harmful to the improvement of classification performance. To solve these issues, we propose an interactive and discriminative analysis dictionary learning for image classification, The ordinal locality preserving technique is utilized to to preserve the topology information of the samples, and Fisher constraint is applied to promote the discrepancy of inter-class coding coefficients and the similarity of intra-class coding coefficients. Furthermore, the target matrix is relaxed by exploiting \(\ell _{21}\) norm, which can provide more freedom to the classifier parameter. Finally, comparative experiments on different types of data sets show the efficacy of the proposed method, For 15 Scene dataset, the proposed IDADL improve the classification accuracies by 1.5%, 0.9% and 0.6% over FDDL, SLCADL and RADPL, respectively. Besides, the performance of the proposed model presents superiority over some classical deep models, For CMU PIE, the proposed IDADL promote the classification accuracies by 4.3% and 1.6% over AlexNet and VGG, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Algorithm 1
Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

Data Availability

The datasets analysed during the current study are available via the following link, the AR database (http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html), CMU PIE database (http://www.cs.cmu.edu/afs/cs/project/PIE/MultiPie/Multi-Pie/Home.html), MIT dataset (http://cbcl.mit.edu/software-datasets/heisele/facerecognition-database.html), FRGC (https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc), Fifteen Scene Category database (http://users.umiacs.umd.edu/~zhuolin/projectlcksvd.html), and Caltech 101 dataset (https://www.vision.caltech.edu/Image-Datasets/Caltech101/).

References

  1. Gou J, Yuan X, Du L et al (2022) Hierarchical graph augmented deep collaborative dictionary learning for classification. IEEE Trans Intell Transp Syst 23(12):25308–25322

    Article  Google Scholar 

  2. A GZ, B FP, C HS et al (2020) Cost-sensitive joint feature and dictionary learning for face recognition - sciencedirect, Neurocomputing 391:177–188

  3. You CZ, Shu ZQ, Fan HH (2021) Low-rank sparse subspace clustering with a clean dictionary 15

  4. Bruton J, Wang H (2022) Dictionary learning for clustering on hyperspectral images 15(2): 255–261

  5. Guo T, Luo F, Zhang L et al (2020) Learning structurally incoherent background and target dictionaries for hyperspectral target detection. IEEE J Selected Top Appl Earth Obs Remote Sens 13:3521–3533

    Article  Google Scholar 

  6. Li X, Li Q, Wang W et al (2022) An unsupervised multi-shot person re-identification method via mutual normalized sparse representation and stepwise learning. IEEE Trans Intell Transp Syst 23(7):7866–7880

    Article  MathSciNet  Google Scholar 

  7. Zhang Z, Cui P, Zhu W (2020) Deep learning on graphs: a survey. IEEE Trans Knowl Data Eng 34(1):249–270

  8. Wu S, Wu A, Zheng W-S (2021) Online deep transferable dictionary learning. Pattern Recogn 118:108007

    Article  Google Scholar 

  9. Aharon M, Elad M, Bruckstein A (2006) K-svd: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322

  10. Jiang Z, Lin Z, Davis LS (2011) Learning a discriminative dictionary for sparse coding via label consistent k-svd in CVPR, IEEE, 2011:1697–1704

  11. Li Z, Lai Z, Xu Y et al (2015) A locality-constrained and label embedding dictionary learning algorithm for image classification. IEEE Transactions on Neural Networks and Learning Systems 28(2):278–293

  12. Liu B-D, Shen B, Gui L et al (2016) Face recognition using class specific dictionary learning for sparse representation and collaborative representation. Neurocomputing 204:198–210

    Article  Google Scholar 

  13. Du H, Ma L, Li G et al (2020) Low-rank graph preserving discriminative dictionary learning for image recognition. Knowl-Based Syst 187:104823

    Article  Google Scholar 

  14. Shekhar S, Patel VM, Chellappa R (2014) Analysis sparse coding models for image-based classification. In: 2014 IEEE International conference on image processing (ICIP), 5207–5211, IEEE

  15. Tang W, Panahi A, Krim H et al (2019) Analysis dictionary learning based classification: structure for robustness. IEEE Trans Image Process 28(12):6035–6046

  16. Wang J, Guo Y, Guo J et al (2017) Class-aware analysis dictionary learning for pattern classification. IEEE Signal Process Lett 24(12):1822–1826

    Article  Google Scholar 

  17. Gu S, Zhang L, Zuo W et al (2014) Projective dictionary pair learning for pattern classification. Adv Neural Inform Process Syst 27

  18. Zhang P, Du H, Ma L (2022) Joint projection learning and structured analysis-synthesis dictionary pair learning for pattern classification. J Electron Imaging 31(1):013010

    Article  Google Scholar 

  19. Yang M, Chang H, Luo W (2017) Discriminative analysis-synthesis dictionary learning for image classification. Neurocomputing 219:404–411

    Article  Google Scholar 

  20. Zhang Z, Jiang W, Qin J et al (2017) Jointly learning structured analysis discriminative dictionary and analysis multiclass classifier. IEEE Transactions on Neural Networks and Learning Systems 29(8):3798–3814

  21. Zhang Z, Jiang W, Zhang Z et al (2019) Scalable block-diagonal locality-constrained projective dictionary learning. arXiv:1905.10568

  22. Sun Y, Zhang Z, Jiang W et al (2020) Discriminative local sparse representation by robust adaptive dictionary pair learning. IEEE Transactions on Neural Networks and Learning Systems 31(10):4303–4317

    Article  MathSciNet  Google Scholar 

  23. Tang W, Panahi AP, Krim H et al (2019) Analysis dictionary learning based classification: structure for robustness. IEEE Trans Image Process 28(12):6035–6046

  24. Chen Z, Wu X, Kittler J (2021) Relaxed block-diagonal dictionary pair learning with locality constraint for image recognition. IEEE Trans Neural Netw Learn Syst 33(8):3645–3659

    Article  MathSciNet  Google Scholar 

  25. Wang J, Guo Y, Guo J et al (2017) Synthesis linear classifier based analysis dictionary learning for pattern classification. Neurocomputing 238:103–113

    Article  Google Scholar 

  26. Chen Z, Wu X, Tianyang X et al (2022) Discriminative dictionary pair learning with scale-constrained structured representation for image classification. IEEE Trans Neural Netw Learn Syst pp 1–15

  27. Yang M, Zhang L, Feng X et al (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vision 109(3):209–232

    Article  MathSciNet  Google Scholar 

  28. Vu TH, Monga V (2017) Fast low-rank shared dictionary learning for image classification. IEEE Trans Image Process 26(11):5160–5175

    Article  MathSciNet  Google Scholar 

  29. Nie F, Huang H, Cai X et al (2010) Efficient and robust feature selection via joint \(\ell _{21}\)-norms minimization. Adv Neural Inform Process Syst 23

  30. Sim T, Baker S, Bsat M (2002) The cmu pose, illumination, and expression (pie) database. In: Proceedings of fifth IEEE international conference on automatic face gesture recognition, IEEE, pp 53–58

  31. Martinez A, Benavente R (1998) The ar face database: Cvc technical report 24

  32. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Computer society conference on computer vision and pattern recognition (CVPR’06), IEEE, 2:2169–2178

  33. Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: 2004 Conference on computer vision and pattern recognition workshop, IEEE, pp 178–178

  34. Phillips PJ, Flynn PJ, Scruggs T et al (2005) Overview of the face recognition grand challenge,” in 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), IEEE, 1:947–954

  35. Weyrauch B, Heisele B, Huang J et al (2004) Component-based face recognition with 3d morphable models. In: 2004 Conference on computer vision and pattern recognition workshop, IEEE, pp 85–85

  36. Yao B, Jiang X, Khosla A et al (2011) Human action recognition by learning bases of action attributes and parts. In: 2011 International conference on computer vision, IEEE, pp 1331–1338

  37. Nilsback M-E, Zisserman A (2008) Automated flower classification over a large number of classes. In: 2008 Sixth Indian conference on computer vision, graphics & image processing, IEEE, pp 722–729

  38. Xu J, An W, Zhang L et al (2019) Sparse, collaborative, or nonnegative representation: which helps pattern classification? Pattern Recogn 88:679–688

    Article  Google Scholar 

  39. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556

  40. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inform Process Syst 25

  41. Chui KT, Liu RW, Zhao M et al (2020) Predicting students performance with school and family tutoring using generative adversarial network-based deep support vector machine. IEEE Access 8:86745–86752

    Article  Google Scholar 

Download references

Funding

This work was supported by the Key Research Project of Henan Colleges and Universities (Grant No. 23B520038, No. 22A520053), and the Scientific and Technological Project of Henan Province (Grant No. 222102210337).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Huazhong Li.

Ethics declarations

Conflicts of interests statement

We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, J., Li, H. & Wang, S. Interactive and discriminative analysis dictionary learning for image classification. Multimed Tools Appl 83, 59943–59963 (2024). https://doi.org/10.1007/s11042-023-17891-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-023-17891-5

Keywords