Abstract
Support vector guided dictionary learning, as a discriminative dictionary learning method combining with support vector machine (SVM), embodies the margin maximization principle and achieves good generalization performances in many practical applications. However, this method ignores the key fact that the generalization performance of the SVM classifier depends not only on the margin between two classes of training samples, but also on the radius of the smallest sphere covering them. In the paper, we propose a novel method called generalization-error-bound-based discriminative dictionary learning (GEBDDL). The basic insight of GEBDDL is that the coding vectors, which are used to build the SVM classifier, are not fixed during the learning process. As a result, the radius of the smallest sphere changes with the learned coding vectors. The key feature of GEBDDL is that it explicitly incorporates the radius-margin-bound, which is directly related to the upper bound of the leave-one-out error of SVM, into its objective function to guide learning the dictionary and the coding vectors, and building the SVM classifier. In the paper, we first elaborate our motivation and propose the optimization model and then discuss how to solve it in detail. Further, we explore how to approximate the radius of the smallest sphere in our methodology. This can enhance the computational efficiency by bypassing the quadratic programming problem of computing the radius, while yielding a close performance to GEBDDL. Finally, the comprehensive experiments are conducted on several benchmark datasets, and the results demonstrate the superiority of the proposed methods over the other competing methods.
Similar content being viewed by others
References
Liu, Q., Zhang, C., Guo, Q., Xu, H., Zhou, Y.: Adaptive sparse coding on PCA dictionary for image denoising. Visual Comput (2016)
Yu, L., Yong, X., and Hui, J.: Removing rain from a single image via discriminative sparse coding. In: 2015 IEEE International Conference on Computer Vision (ICCV) (2015)
Li, H., He, X., Tao, D., Tang, Y., Wang, R.: Joint medical image fusion, denoising and enhancement via discriminative low-rank sparse dictionaries learning. Pattern Recognit. 79, 130–146 (2018)
Montazeri, A., Shamsi, M., Dianat, R.: MLK-SVD, the new approach in deep dictionary learning. Visual Comput. 23, 1–9 (2020)
Wen, Z., Hou, B., Jiao, L.: Discriminative dictionary learning with two-level low rank and group sparse decomposition for image classification. IEEE Trans. Cybern. 47(11), 3758–3771 (2017)
Wang, J., Kong, S.: A classification-oriented dictionary learning model: Explicitly learning the particularity and commonality across categories. Pattern Recognit, 47(2), pp. 885 – 898 (2014). [Online]. http://www.sciencedirect.com/science/article/pii/S0031320313003245
Jiang, J., Hu, R., Han, Z., Wang, Z.: Low-resolution and low-quality face super-resolution in monitoring scene via support-driven sparse coding. J. Signal Process. Syst. 75(3), 245–256 (2014)
Fotiadou, K., Tsagkatakis, G., Tsakalides, P.: Spectral super resolution of hyperspectral images via coupled dictionary learning. IEEE Trans. Geosci. Remote Sens. 57(5), 2777–2797 (2019)
Liling, Z., Quansen, S., Zelin, Z.: Single image super-resolution based on deep learning features and dictionary model. IEEE Access, pp. 1–1
Wright, J., Ganesh, A., Zhou, Zihan., Wagner, A., Yi Ma.: Demo: robust face recognition via sparse representation, pp. 1–2 (2008)
Zhang, L., Yang, M., Feng, X.: ’Sparse representation or collaborative representation: Which helps face recognition?, pp. 471–478 (2011)
Xu, Y., Zhong, Z., Yang, J., You, J., Zhang, D.: A new discriminative sparse representation method for robust face recognition via \(\text{ l}_{2}\) regularization. IEEE Trans. Neural Netw. Learn. Syst. 99, 1–10 (2016)
Li, Z., Zhang, Z., Qin, J., Zhang, Z., and Shao, L.: Discriminative fisher embedding dictionary learning algorithm for object recognition. IEEE Trans. Neural Netw. Learn. Syst., pp. 1–15 (2019)
Mairal, J., Bach, F., Ponce, J.: Task-driven dictionary learning. IEEE Trans. Pattern Anal. Mach. Intell. 34(4), 791–804 (2010)
Mairal, J., Bach, F., Ponce, J., Sapiro, G., Zisserman, A.: Discriminative learned dictionaries for local image analysis (2008)
Qiang, Z., Li, B.: Discriminative k-svd for dictionary learning in face recognition. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2010)
Jiang, Z., Lin, Z., Davis, L.S.: Label consistent k-SVD: learning a discriminative dictionary for recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(11), 2651–2664 (2013)
Li, Z., Lai, Z., Xu, Y., Yang, J., Zhang, D.: A locality-constrained and label embedding dictionary learning algorithm for image classification. IEEE Trans. Neural Netw. Learn. Syst. 28(2), 278–293 (2017)
Yang, M., Zhang, L., Feng, X., Zhang, D.: Sparse representation based fisher discrimination dictionary learning for image classification. Int. J. Comput. Vis. 109(3), 209–232 (2014)
Xu, Y., Zhang, Z., Lu, G., Yang, J.: Approximately symmetrical face images for image preprocessing in face recognition and sparse representation based classification. Pattern Recognit 54, 68–82 (2016)
Zhang, Z., Xu, Y., Shao, L., Yang, J.: Discriminative block-diagonal representation learning for image recognition. IEEE Trans. Neural Netw. Learn. Syst. 99, 1–15 (2017)
Deng, W., Hu, J., Guo, J.: Face recognition via collaborative representation: its discriminant nature and superposed representation. IEEE Trans. Pattern Anal. Mach. Intell. 99, 1–1 (2017)
Cai, S., Zuo, W., Zhang, L., Feng, X., Wang, P.: Support vector guided dictionary learning. In: European Conference on Computer Vision (2014)
Vapnik, V. N.: The nature of statistical learning theory (1995)
Do, H., Kalousis, A.: Convex formulations of radius-margin based support vector machines. In: 30th International Conference on Machine Learning, ICML 2013, pp. 169–177 (2013)
Do, H., Kalousis, A., Hilario, M.: Feature weighting using margin and radius based error bound optimization in SVMS, pp. 315–329 (2009)
Lin, Y., Lv, F., Zhu, S., Yang, M., Cour, T., Yu, K., Cao, L., Huang, T.: Large-scale image classification: fast feature extraction and SVM training, pp. 1689–1696 (2011)
Foody, G.M., Mathur, A.: Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification. Remote Sens. Environ. 93(1–2), 107–117 (2004)
Vapnik, V., Chapelle, O.: Bounds on error expectation for support vector machines. Neural Comput. 12(9), 2013–2036 (2000)
Chapelle, O.: Training a support vector machine in the primal. Neural Comput. 19(5), 1155 (2007)
Lee, H., Battle, A., Raina, R., Ng, A.Y.: Efficient sparse coding algorithms. In: Conference on Advances in Neural Information Processing Systems (2007)
Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 3(1), 1–122 (2010)
Zheng, M., Bu, J., Chen, C., Wang, C., Cai, D.: Graph regularized sparse coding for image representation. IEEE Trans. Image Process. 20(5), 1327–1336 (2011)
Burges, C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Disc. 2(2), 121–167 (1998)
Georghiades, A.S., Belhumeur, P., Kriegman, D.: From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans. Pattern Anal. Mach. Intell. 23(6), 643–660 (2001)
Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 54(11), 4311–4322 (2006)
Martínez, A., Benavente, R.: The AR face database. In: CVC Technical Report 24 (1998)
Samaria, F.S., Harter, A.C.: Parameterisation of a stochastic model for human face identification. In: Proceedings of the Second IEEE Workshop on in Applications of Computer Vision, 1994 (1994)
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput. Vis. Image Underst. 106(1), 59–70 (2007)
Rate, C., Retrieval, C.: Columbia object image library (coil-20) (2011)
Acknowledgements
This work was supported in part by the National Natural Science Foundation of China under Grant 61602390; and in part by the Innovation Fund of Postgraduate, Xihua University under Grant ycjj2019085.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Given formula (25)
The detailed derivation is as follows, first of all, for \({\mathcal{F}_1}(\mathbf{{S}})\), we have
The partial derivatives of \({\mathcal {F}}_{1}({\mathbf {S}})\) with respect to \({\mathbf {S}}\) can be formulated as
The partial derivatives of \({\mathcal {F}}_{2}({\mathbf {S}})\) with respect to \({\mathbf {S}}\) can be formulated as
For \({{{{\mathcal {F}}}}_3}(\mathbf{{S}})\), there are two cases for discussion. First, when \(y_i^c(\mathbf{{w}}_c^T{\mathbf{{s}}_i} + {b_c})<1\),
When \(y_{i}\left( {\mathbf {w}}_{c}^{T} {\mathbf {s}}_{i}+b_{c}\right) \,\ge \,q 1\)
Hence, the partial derivatives of \({\mathcal {F}}_{3}({\mathbf {S}})\) with respect to \({\mathbf {S}}\) can be formulated as
Rights and permissions
About this article
Cite this article
Zhang, K., Wang, X., Xu, T. et al. Generalization-error-bound-based discriminative dictionary learning. Vis Comput 38, 2853–2869 (2022). https://doi.org/10.1007/s00371-021-02160-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-021-02160-z