Skip to main content
Log in

Learning double weights via data augmentation for robust sparse and collaborative representation-based classification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Image classification is a hot technique applied in many multimedia systems, where both l1 and l2 regularizations have shown potential for robust sparse representation-based image classification. However, previous studies showed that l1 or l2 alone cannot ensure a robust result. The robustness of a classifier depends on the nature of the dataset most of the time. What is worse, data augmentation may make the dataset more complicated, which leads a sparse model become harder to optimize. In this paper, a novel sparse representation that learns double weights through data augmentation is proposed for robust image classification. The first weight combines the two coefficients solved by l1 and l2 regularizations to obtain a more discriminative representation, while the second weight integrates the residuals obtained from the original and virtual samples, to take full advantage of diversity created by data augmentation. The double-weight process builds a robust model that is able to deal with the augmented but variational datasets. Experiments on popular facial and object datasets demonstrate the promising performance of the proposed method. Learning double weights via sample virtualization is helpful to develop multimedia applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Notes

  1. https://github.com/zengsn/research

References

  1. Akhtar N, Shafait F, Mian A (2017) Efficient classification with sparsity augmented collaborative representation. Pattern Recogn 65:136–145

    Google Scholar 

  2. Beck A, Teboulle M (2009) A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM Journal on Imaging Sciences 2(1):183–202

    MathSciNet  MATH  Google Scholar 

  3. Bloice MD, Holzinger A (2019) Patch augmentation: towards efficient decision boundaries for neural networks. arXiv:1911.07922

  4. Bloice MD, Roth PM, Holzinger A (2019) Biomedical image augmentation using augmentor. Bioinformatics 35(21):4522–4524

    Google Scholar 

  5. Bloice MD, Stocker C, Holzinger A (2017) Augmentor: an image augmentation library for machine learning. arXiv:1708.04680

  6. Cai S, Zhang L, Zuo W, Feng X (2016) A probabilistic collaborative representation based approach for pattern classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2950–2959

  7. Chen L, Man H, Nefian AV (2005) Face recognition based on multi-class mapping of fisher scores. Pattern Recogn 38(6):799–811

    Google Scholar 

  8. Crispell D, Biris O, Crosswhite N, Byrne J, Mundy JL (2017) Dataset augmentation for pose and lighting invariant face recognition. arXiv:1704.04326

  9. Florea MI, Vorobyov SA (2017) A robust fista-like algorithm. In: 2017 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 4521–4525

  10. Gao Y, Ma J, Yuille AL (2017) Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. IEEE Trans Image Process 26(5):2545–2560

    MathSciNet  MATH  Google Scholar 

  11. Guo G, Li SZ, Chan KL (2001) Support vector machines for face recognition. Image Vision Comput 19(9-10):631–638

    Google Scholar 

  12. Han D, Liu Q, Fan W (2018) A new image classification method using cnn transfer learning and web data augmentation. Expert Syst Appl 95:43–56

    Google Scholar 

  13. He Z, Patel VM (2017) Sparse representation-based open set recognition. IEEE Trans Pattern Anal Mach Intell 39(8):1690–1696

    Google Scholar 

  14. He Z, Yi S, Cheung Y-M, You X, Tang YY (2017) Robust object tracking via key patch sparse representation. IEEE Trans Cybern 47(2):354–364

    Google Scholar 

  15. Hong C, Yu J, Chen X, Image-based 3d human pose recovery with locality sensitive sparse retrieval (2013). In: 2013 IEEE international conference on systems, man, and cybernetics. IEEE, pp 2103–2108

  16. Hong C, Yu J, Wan J, Tao D, Wang M (2015) Multimodal deep autoencoder for human pose recovery. IEEE Trans Image Process 24(12):5659–5670

    MathSciNet  MATH  Google Scholar 

  17. Hong C, Yu J, Zhang J, Jin X, Lee K-H (2018) Multimodal face-pose estimation with multitask manifold deep learning. IEEE Trans Indust Inform 15(7):3952–3961

    Google Scholar 

  18. Inoue H (2018) Data augmentation by pairing samples for images classification. arXiv:1801.02929

  19. Jiang J, Chen C, Yu Y, Jiang X, Ma J (2017) Spatial-aware collaborative representation for hyperspectral remote sensing image classification. IEEE Geosci Remote Sens Lett 14(3):404–408

    Google Scholar 

  20. Jiang Z, Lin Z, Davis LS (2013) Label consistent k-svd: learning a discriminative dictionary for recognition. IEEE Trans Pattern Anal Mach Intell 35(11):2651–2664

    Google Scholar 

  21. Kundu A, Drineas P, Magdon-Ismail M (2017) Recovering pca and sparse pca via hybrid-(l1,l2) sparse sampling of data elements. J Mach Learn Res 18(75):1–34

    MATH  Google Scholar 

  22. Learned-Miller E, Huang GB, RoyChowdhury A, Li H, Hua G (2016) Labeled faces in the wild: a survey. In: Advances in face detection and facial image analysis. Springer, pp 189–248

  23. Leng B, Kai Y u, Jingyan QIN (2017) Data augmentation for unbalanced face recognition training sets. Neurocomputing 235:10–14

    Google Scholar 

  24. Li J, Chang H, Yang J, Luo W, Fu Y (2018) Visual representation and classification by learning group sparse deep stacking network. IEEE Trans Image Process 27(1):464–476

    MathSciNet  MATH  Google Scholar 

  25. Li J, Zhang B, Zhang D (2017) Joint discriminative and collaborative representation for fatty liver disease diagnosis. Expert Syst Appl 89:31–40

    Google Scholar 

  26. Li Z, Zhou W-D, Chang P-C, Liu J, Yan Z, Wang T, Li F-Z (2012) Kernel sparse representation-based classifier. IEEE Trans Signal Process 60 (4):1684–1695

    MathSciNet  MATH  Google Scholar 

  27. Liu J, Cosman PC, Rao BD (2018) Robust linear regression via l0 regularization. IEEE Trans Signal Process 66(3):698–713

    MathSciNet  MATH  Google Scholar 

  28. Liu Z, Pu J, Wu Q, Zhao X (2016) Using the original and symmetrical face training samples to perform collaborative representation for face recognition. Optik-International Journal for Light and Electron Optics 127(4):1900–1904

    Google Scholar 

  29. Lv J-J, Shao X-H, Huang J-S, Zhou X-D, Xi Z (2017) Data augmentation for face recognition. Neurocomputing 230:184–196

    Google Scholar 

  30. Ma H, Gou J, Wang X, Ke J, Zeng S (2017) Sparse coefficient-based k-nearest neighbor classification. IEEE Access 5:16618–16634

    Google Scholar 

  31. Nene SA, Nayar SK, Murase H, et al. (1996) Columbia object image library (coil-20)

  32. Niu L, Zhou R, Tian Y, Qi Z, Zhang P (2017) Nonsmooth penalized clustering via lp regularized sparse regression. IEEE Trans Cybern 47(6):1423–1433

    Google Scholar 

  33. Ozdemir O, Allen T, Choi S, Wimalajeewa T, Varshney P (2017) Copula based classifier fusion under statistical dependence. IEEE Trans Pattern Anal Mach Intell

  34. Perlibakas V (2004) Distance measures for pca-based face recognition. Pattern Recogn Lett 25(6):711–724

    Google Scholar 

  35. Phillips PJ, Moon H, Rizvi SA, Rauss PJ (2000) The feret evaluation methodology for face-recognition algorithms. IEEE Trans Pattern Anal Mach Intell 22 (10):1090–1104

    Google Scholar 

  36. Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 815–823

  37. Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31 (2):210–227

    Google Scholar 

  38. Wu SG, Bao FS, Xu EY, Wang Y-X, Chang Y-F, Xiang Q-L (2007) A leaf recognition algorithm for plant classification using probabilistic neural network. In: 2007 IEEE international symposium on signal processing and information technology, pp 11–16

  39. Xi P, Zhang L, Yi Z, Tan KK (2014) Learning locality-constrained collaborative representation for robust face recognition. Pattern Recogn 47(9):2794–2806

    Google Scholar 

  40. Xu Y, Zhang B, Zhong Z (2015) Multiple representations and sparse representation for image classification. Pattern Recogn Lett 68:9–14

    Google Scholar 

  41. Xu Y, Zhong Z, Yang J, You J, Zhang D (2017) A new discriminative sparse representation method for robust face recognition via l2 regularization. IEEE Trans Neural Netw Learn Sys 28(10):2233–2242

    Google Scholar 

  42. Xu Y, Zhu Q, Chen Y, Pan J-S, et al. (2013) An improvement to the nearest neighbor classifier and face recognition experiments. Int J Innov Comput Inf Control 9(2):543–554

    Google Scholar 

  43. Xu Y, Zhu X, Li Z, Liu G, Lu Y, Liu H (2013) Using the original and ’symmetrical face’ training samples to perform representation based two-step face recognition. Pattern Recogn 46(4):1151–1158

    Google Scholar 

  44. Yang AY, Zhou Z, Balasubramanian AG, Shankar Sastry S, Ma Y (2013) Fast l1-minimization algorithms for robust face recognition. IEEE Trans Image Process 22(8):3234–3246

    Google Scholar 

  45. Yang M, Wang X, Zeng G, Shen L (2017) Joint and collaborative representation with local adaptive convolution feature for face recognition with single sample per person. Pattern Recogn 66:117–128

    Google Scholar 

  46. Yu H, He F, Pan Y (2019) A novel segmentation model for medical images with intensity inhomogeneity based on adaptive perturbation. Multimed Tools Appl 78 (9):11779–11798

    Google Scholar 

  47. Yu H, He F, Pan Y (2020) A scalable region-based level set method using adaptive bilateral filter for noisy image segmentation. Multimed Tools Appl 79:5743–5765

    Google Scholar 

  48. Yu J, Tan M, Zhang H, Tao D, Rui Y (2019) Hierarchical deep click feature prediction for fine-grained image recognition. IEEE Trans Pattern Anal Machine Intell, Early Access (1):1–14

  49. Yu J, Tao D, Wang M, Rui Y (2014) Learning to rank using user clicks and visual features for image retrieval. IEEE Trans Cybern 45(4):767–779

    Google Scholar 

  50. Zeng S, Gou J, Deng L (2017) An antinoise sparse representation method for robust face recognition via joint l1 and l2 regularization. Expert Syst Appl 82:1–9

    Google Scholar 

  51. Zeng S, Gou J, Yang X (2018) Improving sparsity of coefficients for robust sparse and collaborative representation-based image classification. Neural Comput & Applic 30 (10):2965–2978

    Google Scholar 

  52. Zeng S, Yang X, Gou J (2017) Using kernel sparse representation to perform coarse-to-fine recognition of face images. Optik-International Journal for Light and Electron Optics 140:528–535

    Google Scholar 

  53. Zeng S, Zhang B, Du Y (2017) Joint distances by sparse representation and locality-constrained dictionary learning for robust leaf recognition. Comput Electron Agric 142:563–571

    Google Scholar 

  54. Zeng S, Zhang B, Zhang Y, Gou J (2020) Dual sparse learning via data augmentation for robust facial image classification. International Journal of Machine Learning and Cybernetics: 1–18

  55. Zhang J, He F, Chen Y (2020) A new haze removal approach for sky/river alike scenes based on external and internal clues. Multimed Tools Appl 79:2085–2107

    Google Scholar 

  56. Zhang L, Li L, Yang A, Shen Y, Yang M (2017) Towards contactless palmprint recognition: a novel device, a new benchmark, and a collaborative representation based identification approach. Pattern Recogn 69:199–212

    Google Scholar 

  57. Zhang L, Yang M, Feng X (2011) Sparse representation or collaborative representation: which helps face recognition?. In: 2011 IEEE international conference on computer vision (ICCV). IEEE, pp 471–478

  58. Zhang Q, Li B (2010) Discriminative k-svd for dictionary learning in face recognition. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR). IEEE, pp 2691–2698

  59. Zhang S, He F (2019) Drcdn: learning deep residual convolutional dehazing networks. The Visual Computer, Online First (1):1–12

  60. Zhang S, He F, Ren W, Yao J (2020) Joint learning of image detail and transmission map for single image dehazing. Vis Comput 36:305–316

    Google Scholar 

  61. Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67(2):301–320

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

This research was funded by the University of Macau (MYR G2018-00053-FST), National Natural Science Foundation of China (NSFC) (no. 61976107), Indigenous Innovation’s Capability Development Program of Huizhou University (no. hzu201907), and Educational Science Planning Project of Guangdong Province (no.2018GXJK183). We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bob Zhang.

Ethics declarations

Conflict of interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zeng, S., Zhang, B. & Gou, J. Learning double weights via data augmentation for robust sparse and collaborative representation-based classification. Multimed Tools Appl 79, 20617–20638 (2020). https://doi.org/10.1007/s11042-020-08918-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-08918-2

Keywords

Navigation