Skip to main content
Log in

Exploiting label consistency in structured sparse representation for classification

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Sparse representation with adaptive dictionaries has emerged as a promising tool in computer vision and pattern analysis. While standard sparsity promoted by \(\ell _0\) or \(\ell _1\) regularization has been widely used, recent approaches seek for kinds of structured sparsity to improve the discriminability of sparse codes. For classification, label consistency is one useful concept regarding structured sparsity, which relates class labels to dictionary atoms for generating discriminative sparsity patterns. Motivated by the limitations of existing label-consistent regularization methods, in this paper, we investigate the exploitation of label consistency and propose an effective sparse coding approach. The proposed approach enforces the sparse approximation of a label consistency matrix by sparse code during dictionary learning, which encourages the supports of sparse codes to be consistent for intra-class signals and distinct for inter-class signals. Thus, the learned dictionary can induce discriminative sparsity patterns when used in sparse coding. Moreover, the proposed method is computationally efficient, as the label consistency regularization developed in our method brings very little additional computational cost in solving the related sparse coding problem. The effectiveness of the proposed method is demonstrated with several recognition tasks, and the experimental results show that our method is very competitive with some state-of-the-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Notes

  1. Such a kind of sparsity is often referred to as structured sparsity.

  2. This is often true if the corresponding signal \({\varvec{y}}_1\) and \({\varvec{y}}_2\) are from different subclasses.

  3. For rigorous proof, the sequence \(\{{\varvec{C}}^{(k)}\}_k\) is required to be bounded, which can be guaranteed by simple projection of \({\varvec{C}}^{(k)}\) at each iteration.

References

  1. Aharon M, Elad M, Bruckstein A (2006) K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. Trans Signal Process 54(11):4311–4322

    Article  Google Scholar 

  2. Bagheri MA, Gao Q, Escalera S, Moeslund TB, Ren H, Etemad E (2017) Locality regularized group sparse coding for action recognition. Comput Vis Image Underst 158:106–114

    Article  Google Scholar 

  3. Bao C, Ji H, Quan Y, Shen Z (2016) Dictionary learning for sparse coding: algorithms and convergence analysis. Trans Pattern Anal Mach Intell 38(7):1356–1369

    Article  Google Scholar 

  4. Boureau YL, Bach F, LeCun Y, Ponce J (2010) Learning mid-level features for recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 2559–2566

  5. Cai S, Zuo W, Zhang L, Feng X, Wang P (2014) Support vector guided dictionary learning. In: ECCV, Springer, pp 624–639

  6. Chi YT, Ali M, Rajwade A, Ho J (2013) Block and group regularized sparse modeling for dictionary learning. In: Conference on computer vision and pattern recognition, IEEE, pp 377–382

  7. Gao Q, Wang Q, Huang Y, Gao X, Hong X, Zhang H (2015) Dimensionality reduction by integrating sparse representation and fisher criterion and its applications. Trans Image Process 24(12):5684–5695

    Article  MathSciNet  Google Scholar 

  8. Gao S, Tsang IW, Chia LT, Zhao P (2010) Local features are not lonely–Laplacian sparse coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 3555–3561

  9. Gao Y, Ma J, Yuille AL (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. Trans Image Process 26(5):2545–2560

    Article  MathSciNet  Google Scholar 

  10. Ghanem B, Ahuja N (2010) Maximum margin distance learning for dynamic texture recognition. In: European conference on computer vision, Springer, pp 223–236

  11. Ghanem B, Ahuja N (2010) Maximum margin distance learning for dynamic texture recognition. In: European conference on computer vision, Springer, pp 223–236

  12. Huang K, Aviyente S (2006) Sparse representation for signal classification. In: Advances in neural information processing systems, pp 609–616

  13. Jacob L, Obozinski G, Vert JP (2009) Group lasso with overlap and graph lasso. In: International conference on machine learning, ACM, pp 433–440

  14. Jenatton R, Audibert JY, Bach F (2011) Structured variable selection with sparsity-inducing norms. J Mach Learn Res 12:2777–2824

    MathSciNet  MATH  Google Scholar 

  15. Jiang Z, Lin Z, Davis LS (2011) Learning a discriminative dictionary for sparse coding via label consistent K-SVD. In: Conference on computer vision and pattern recognition, IEEE, pp 1697–1704

  16. Jiang Z, Zhang G, Davis LS (2012) Submodular dictionary learning for sparse coding. In: Proceedings of IEEE conference on computer vision and pattern recognition, IEEE, pp 3418–3425

  17. Jiang Z, Lin Z, Davis L (2013) Label consistent K-SVD: learning a discriminative dictionary for recognition. Trans Pattern Anal Mach Intell 35(11):2651–2664

    Article  Google Scholar 

  18. Kim S, Xing EP (2010) Tree-guided group lasso for multi-task regression with structured sparsity. In: International conference on machine learning, pp 543–550

  19. Kong S, Wang D (2012) A dictionary learning approach for classification: separating the particularity and the commonality. In: ECCV, Springer, pp 186–199

  20. Kviatkovsky I, Gabel M, Rivlin E, Shimshoni I (2017) On the equivalence of the LC-KSVD and the D-KSVD algorithms. Trans Pattern Anal Mach Intell 39(2):411–416

    Article  Google Scholar 

  21. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Conference on computer vision and pattern recognition, vol 2. IEEE, pp 2169–2178

  22. Lian XC, Li Z, Lu BL, Zhang L (2010) Max-margin dictionary learning for multiclass image categorization. In: European conference on computer vision, Springer, pp 157–170

  23. Lian XC, Li Z, Wang C, Lu BL, Zhang L (2010) Probabilistic models for supervised dictionary learning. In: Conference on computer vision and pattern recognition, IEEE, pp 2305–2312

  24. Liu J, Ye J (2010) Moreau–Yosida regularization for grouped tree structure learning. Adv Neural Inf Process Syst 23:1459–1467

    Google Scholar 

  25. Lu J, Wang G, Deng W, Moulin P (2014) Simultaneous feature and dictionary learning for image set based face recognition. In: European conference on computer vision, Springer, pp 265–280

  26. Mairal J, Bach F, Ponce J, Sapiro G, Zisserman A (2008) Discriminative learned dictionaries for local image analysis. In: Conference on computer vision and pattern recognition, IEEE, pp 1–8

  27. Mairal J, Ponce J, Sapiro G, Zisserman A, Bach FR (2009) Supervised dictionary learning. In: Advances in neural information processing systems, pp 1033–1040

  28. Majumdar A (2015) Discriminative label consistent dictionary learning. In: 2015 IEEE international conference on image processing (ICIP), IEEE, pp 1016–1020

  29. Marsousi M, Li X, Plataniotis KN (2016) Shape-included label-consistent discriminative dictionary learning: an approach to detect and segment multi-class objects in images. In: International conference on image processing, IEEE, pp 729–733

  30. Martinez AM (1998) The AR face database. CVC Technical report 24

  31. Péteri R, Fazekas S, Huiskes MJ (2010) DynTex : a comprehensive database of dynamic textures. Pattern Recogn Lett 31:1627–1632

    Article  Google Scholar 

  32. Pham DS, Venkatesh S (2008) Joint learning and dictionary construction for pattern recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 1–8

  33. Quan Y, Ji H, Shen Z (2014) Data-driven multi-scale non-local wavelet frame construction and image recovery. J Sci Comput 63:307–329

    Article  MathSciNet  Google Scholar 

  34. Quan Y, Xu Y, Sun Y, Huang Y (2016) Supervised dictionary learning with multiple classifier integration. Pattern Recogn 55:247–260

    Article  Google Scholar 

  35. Quan Y, Xu Y, Sun Y, Huang Y, Ji H (2016) Sparse coding for classification via discrimination ensemble. In: Conference on computer vision and pattern recognition, pp 5839–5847

  36. Ramirez I, Sprechmann P, Sapiro G (2010) Classification and clustering via dictionary learning with structured incoherence and shared features. In: Conference on computer vision and pattern recognition, IEEE, pp 3501–3508

  37. Rosenblum K, Zelnik-Manor L, Eldar Y (2010) Dictionary optimization for block-sparse representations. In: Association for the advancement of artificial intelligence fall symposium: manifold learning and its applications, pp 50–58

  38. Sprechmann P, Ramirez I, Sapiro G, Eldar YC (2011) C-HiLasso: a collaborative hierarchical sparse modeling framework. Trans Signal Process 59(9):4183–4198

    Article  MathSciNet  Google Scholar 

  39. Sun Y, Liu Q, Tang J, Tao D (2014) Learning discriminative dictionary for group sparse representation. Trans Image Process 23(9):3816–3828

    Article  MathSciNet  Google Scholar 

  40. Wang J, Yang J, Yu K, Lv F, Huang T, Gong Y (2010) Locality-constrained linear coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 3360–3367

  41. Wang X, Yang M, Shen L (2016) Structured regularized robust coding for face recognition. Neurocomputing 216:18–27

    Article  Google Scholar 

  42. Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. Trans Pattern Anal Mach Intell 31(2):210–227

    Article  Google Scholar 

  43. Xu Y, Quan Y, Ling H, Ji H (2011) Dynamic texture classification using dynamic fractal analysis. In: International conference on computer vision, IEEE, pp 1219–1226

  44. Yang J, Yu K, Gong Y, Huang T (2009) Linear spatial pyramid matching using sparse coding for image classification. In: Conference on computer vision and pattern recognition, IEEE, pp 1794–1801

  45. Yang J, Yu K, Huang T (2010) Supervised translation-invariant sparse coding. In: Conference on computer vision and pattern recognition, IEEE, pp 3517–3524

  46. Yang J, Zhang L, Xu Y, Jy Yang (2012) Beyond sparsity: the role of L1-optimizer in pattern classification. Pattern Recogn 45(3):1104–1118

    Article  Google Scholar 

  47. Yang M, Zhang D, Feng X (2011) Fisher discrimination dictionary learning for sparse representation. In: International conference on computer vision, IEEE, pp 543–550

  48. Yang M, Dai D, Shen L, Van Gool L (2014) Latent dictionary learning for sparse representation based classification. In: Conference on computer vision and pattern recognition, pp 4138–4145

  49. Yang M, Zhang L, Feng X, Zhang D (2014) Sparse representation based fisher discrimination dictionary learning for image classification. Int J Comput Vision 109(3):209–232

    Article  MathSciNet  Google Scholar 

  50. Zhang D, Liu P, Zhang K, Zhang H, Wang Q, Jing X (2015) Class relatedness oriented-discriminative dictionary learning for multiclass image classification. Pattern Recognit 59:168–175

    Article  Google Scholar 

  51. Zhang H, Wu QJ, Chow TW, Zhao M (2012) A two-dimensional neighborhood preserving projection for appearance-based face recognition. Pattern Recogn 45(5):1866–1876

    Article  Google Scholar 

  52. Zhang Q, Li B (2010) Discriminative K-SVD for dictionary learning in face recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 2691–2698

  53. Zhao G, Pietikainen M (2007) Dynamic texture recognition using local binary patterns with an application to facial expressions. Trans Pattern Anal Mach Intell 29(6):915–928

    Article  Google Scholar 

  54. Zhou N, Shen Y, Peng J, Fan J (2012) Learning inter-related visual dictionary for object recognition. In: Conference on computer vision and pattern recognition, IEEE, pp 3490–3497

Download references

Acknowledgements

Yuhui Quan would like to thank the support by National Natural Science Foundation of China (61602184), Science and Technology Planning Project of Guangdong Province (2017A030313376), Science and Technology Program of Guangzhou (201707010147) and Educational Reform Project of South China University of Technology (j2jwY9160960). Yong Xu would like to thank the support by National Natural Science Foundation of China (U16114616167224161602184 and 61528204) and Cultivation Project of Major Basic Research of NSF-Guangdong Province (2016A030308013).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuhui Quan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, Y., Quan, Y., Liu, T. et al. Exploiting label consistency in structured sparse representation for classification. Neural Comput & Applic 31, 6509–6520 (2019). https://doi.org/10.1007/s00521-018-3479-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-018-3479-0

Keywords

Navigation