Abstract
Dictionary learning is widely utilized in pattern recognition, and analysis dictionary learning is a prevalent image classification method. However, its classification performance still has much room for improvement. Constructing the powerful discriminative constraint by exploiting the intrinsic characteristics of sample is an effective approach for enhancing the performance, thus how to design an effective constraint is a problem worth studying. On the other hand, some analysis dictionary learning models incorporate the classification error constraint. However, this constraint always adopts a strict binary matrix as the target matrix, which is harmful to the improvement of classification performance. To solve these issues, we propose an interactive and discriminative analysis dictionary learning for image classification, The ordinal locality preserving technique is utilized to to preserve the topology information of the samples, and Fisher constraint is applied to promote the discrepancy of inter-class coding coefficients and the similarity of intra-class coding coefficients. Furthermore, the target matrix is relaxed by exploiting \(\ell _{21}\) norm, which can provide more freedom to the classifier parameter. Finally, comparative experiments on different types of data sets show the efficacy of the proposed method, For 15 Scene dataset, the proposed IDADL improve the classification accuracies by 1.5%, 0.9% and 0.6% over FDDL, SLCADL and RADPL, respectively. Besides, the performance of the proposed model presents superiority over some classical deep models, For CMU PIE, the proposed IDADL promote the classification accuracies by 4.3% and 1.6% over AlexNet and VGG, respectively.


















Similar content being viewed by others
Data Availability
The datasets analysed during the current study are available via the following link, the AR database (http://www2.ece.ohio-state.edu/~aleix/ARdatabase.html), CMU PIE database (http://www.cs.cmu.edu/afs/cs/project/PIE/MultiPie/Multi-Pie/Home.html), MIT dataset (http://cbcl.mit.edu/software-datasets/heisele/facerecognition-database.html), FRGC (https://www.nist.gov/programs-projects/face-recognition-grand-challenge-frgc), Fifteen Scene Category database (http://users.umiacs.umd.edu/~zhuolin/projectlcksvd.html), and Caltech 101 dataset (https://www.vision.caltech.edu/Image-Datasets/Caltech101/).
References
Gou J, Yuan X, Du L et al (2022) Hierarchical graph augmented deep collaborative dictionary learning for classification. IEEE Trans Intell Transp Syst 23(12):25308–25322
A GZ, B FP, C HS et al (2020) Cost-sensitive joint feature and dictionary learning for face recognition - sciencedirect, Neurocomputing 391:177–188
You CZ, Shu ZQ, Fan HH (2021) Low-rank sparse subspace clustering with a clean dictionary 15
Bruton J, Wang H (2022) Dictionary learning for clustering on hyperspectral images 15(2): 255–261
Guo T, Luo F, Zhang L et al (2020) Learning structurally incoherent background and target dictionaries for hyperspectral target detection. IEEE J Selected Top Appl Earth Obs Remote Sens 13:3521–3533
Li X, Li Q, Wang W et al (2022) An unsupervised multi-shot person re-identification method via mutual normalized sparse representation and stepwise learning. IEEE Trans Intell Transp Syst 23(7):7866–7880
Zhang Z, Cui P, Zhu W (2020) Deep learning on graphs: a survey. IEEE Trans Knowl Data Eng 34(1):249–270
Wu S, Wu A, Zheng W-S (2021) Online deep transferable dictionary learning. Pattern Recogn 118:108007
Aharon M, Elad M, Bruckstein A (2006) K-svd: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322
Jiang Z, Lin Z, Davis LS (2011) Learning a discriminative dictionary for sparse coding via label consistent k-svd in CVPR, IEEE, 2011:1697–1704
Li Z, Lai Z, Xu Y et al (2015) A locality-constrained and label embedding dictionary learning algorithm for image classification. IEEE Transactions on Neural Networks and Learning Systems 28(2):278–293
Liu B-D, Shen B, Gui L et al (2016) Face recognition using class specific dictionary learning for sparse representation and collaborative representation. Neurocomputing 204:198–210
Du H, Ma L, Li G et al (2020) Low-rank graph preserving discriminative dictionary learning for image recognition. Knowl-Based Syst 187:104823
Shekhar S, Patel VM, Chellappa R (2014) Analysis sparse coding models for image-based classification. In: 2014 IEEE International conference on image processing (ICIP), 5207–5211, IEEE
Tang W, Panahi A, Krim H et al (2019) Analysis dictionary learning based classification: structure for robustness. IEEE Trans Image Process 28(12):6035–6046
Wang J, Guo Y, Guo J et al (2017) Class-aware analysis dictionary learning for pattern classification. IEEE Signal Process Lett 24(12):1822–1826
Gu S, Zhang L, Zuo W et al (2014) Projective dictionary pair learning for pattern classification. Adv Neural Inform Process Syst 27
Zhang P, Du H, Ma L (2022) Joint projection learning and structured analysis-synthesis dictionary pair learning for pattern classification. J Electron Imaging 31(1):013010
Yang M, Chang H, Luo W (2017) Discriminative analysis-synthesis dictionary learning for image classification. Neurocomputing 219:404–411
Zhang Z, Jiang W, Qin J et al (2017) Jointly learning structured analysis discriminative dictionary and analysis multiclass classifier. IEEE Transactions on Neural Networks and Learning Systems 29(8):3798–3814
Zhang Z, Jiang W, Zhang Z et al (2019) Scalable block-diagonal locality-constrained projective dictionary learning. arXiv:1905.10568
Sun Y, Zhang Z, Jiang W et al (2020) Discriminative local sparse representation by robust adaptive dictionary pair learning. IEEE Transactions on Neural Networks and Learning Systems 31(10):4303–4317
Tang W, Panahi AP, Krim H et al (2019) Analysis dictionary learning based classification: structure for robustness. IEEE Trans Image Process 28(12):6035–6046
Chen Z, Wu X, Kittler J (2021) Relaxed block-diagonal dictionary pair learning with locality constraint for image recognition. IEEE Trans Neural Netw Learn Syst 33(8):3645–3659
Wang J, Guo Y, Guo J et al (2017) Synthesis linear classifier based analysis dictionary learning for pattern classification. Neurocomputing 238:103–113
Chen Z, Wu X, Tianyang X et al (2022) Discriminative dictionary pair learning with scale-constrained structured representation for image classification. IEEE Trans Neural Netw Learn Syst pp 1–15
Yang M, Zhang L, Feng X et al (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vision 109(3):209–232
Vu TH, Monga V (2017) Fast low-rank shared dictionary learning for image classification. IEEE Trans Image Process 26(11):5160–5175
Nie F, Huang H, Cai X et al (2010) Efficient and robust feature selection via joint \(\ell _{21}\)-norms minimization. Adv Neural Inform Process Syst 23
Sim T, Baker S, Bsat M (2002) The cmu pose, illumination, and expression (pie) database. In: Proceedings of fifth IEEE international conference on automatic face gesture recognition, IEEE, pp 53–58
Martinez A, Benavente R (1998) The ar face database: Cvc technical report 24
Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Computer society conference on computer vision and pattern recognition (CVPR’06), IEEE, 2:2169–2178
Fei-Fei L, Fergus R, Perona P (2004) Learning generative visual models from few training examples: an incremental bayesian approach tested on 101 object categories. In: 2004 Conference on computer vision and pattern recognition workshop, IEEE, pp 178–178
Phillips PJ, Flynn PJ, Scruggs T et al (2005) Overview of the face recognition grand challenge,” in 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), IEEE, 1:947–954
Weyrauch B, Heisele B, Huang J et al (2004) Component-based face recognition with 3d morphable models. In: 2004 Conference on computer vision and pattern recognition workshop, IEEE, pp 85–85
Yao B, Jiang X, Khosla A et al (2011) Human action recognition by learning bases of action attributes and parts. In: 2011 International conference on computer vision, IEEE, pp 1331–1338
Nilsback M-E, Zisserman A (2008) Automated flower classification over a large number of classes. In: 2008 Sixth Indian conference on computer vision, graphics & image processing, IEEE, pp 722–729
Xu J, An W, Zhang L et al (2019) Sparse, collaborative, or nonnegative representation: which helps pattern classification? Pattern Recogn 88:679–688
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. Adv Neural Inform Process Syst 25
Chui KT, Liu RW, Zhao M et al (2020) Predicting students performance with school and family tutoring using generative adversarial network-based deep support vector machine. IEEE Access 8:86745–86752
Funding
This work was supported by the Key Research Project of Henan Colleges and Universities (Grant No. 23B520038, No. 22A520053), and the Scientific and Technological Project of Henan Province (Grant No. 222102210337).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interests statement
We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yang, J., Li, H. & Wang, S. Interactive and discriminative analysis dictionary learning for image classification. Multimed Tools Appl 83, 59943–59963 (2024). https://doi.org/10.1007/s11042-023-17891-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-17891-5