Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning

He, Yunpeng; Zang, Chuanzhi; Zeng, Peng; Dong, Qingwei; Liu, Ding; Liu, Yuqi

doi:10.1007/s11063-022-10894-7

Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning

Published: 10 June 2022

Volume 55, pages 505–518, (2023)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Yunpeng He ORCID: orcid.org/0000-0001-9672-7615^1,2,3,4,
Chuanzhi Zang⁵,
Peng Zeng^1,2,3,
Qingwei Dong^1,2,3,4,
Ding Liu^1,2,3,4 &
…
Yuqi Liu^1,2,3

534 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Meta Learning (ML) has the ability to quickly learn from a small number of samples, and has become an important research field after reinforcement learning. However, the complexity of sample features severely reduces the performance of few-shot learning, and proper feature selection plays a vital role in the performance of neural networks. To address this problem, this article draws up a new type of convolutional neural network with an attention mechanism, namely, convolutional shrinkage neural networks (CSNNs), using the characteristics of negligible noise to obtain a good optimization parameter model. Moreover, soft thresholding is inserted into the network architectures as nonlinear transformation layers to eliminate nonessential features. In addition, considering that it is difficult to set appropriate values for the thresholds, the developed convolutional shrinkage neural networks integrates some specialized neural networks into trainable modules to automatically set the thresholds. To illustrate the effectiveness of the proposed method, the model-agnostic meta-learning method is considered for testing. The results show that the improved method can significantly improve the accuracy of few-shot images classification and enhance the generalization performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 3

Meta-learning with Network Pruning

Are LSTMs good few-shot learners?

Article Open access 07 September 2023

Multi-scale Relation Network for Few-Shot Learning Based on Meta-learning

References

Tang WX, Li B, Barni M et al (2021) An automatic cost learning framework for image steganography using deep reinforcement learning. IEEE Trans Inf Forensics Secur 16:952–967
Article Google Scholar
Choi Y, Lee K, Oh S (2019) Distributional deep reinforcement learning with a mixture of Gaussians. In: Proceedings - IEEE International Conference on Robotics and Automation 2019-May: p 9791–9797
Zoph B, Vasudevan V, Shlens J et al (2018) Learning Transferable Architectures for Scalable Image Recognition. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p 8697–8710
Baker B, Gupta O, Naik N et al (2017) Designing neural network architectures using reinforcement learning. In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings
Zoph B, Le QV (2017) Neural architecture search with reinforcement learning. In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings
Pham H, Guan M Y, Zoph B et al (2018) Efficient Neural Architecture Search via parameter Sharing. In: 35th International Conference on Machine Learning, ICML 2018, vol 9, p 6522–6531
Pang G, Shen C, Cao L et al (2021) Deep Learning for Anomaly Detection: A Review. ACM Comput Surv 54(2):1–38
Article Google Scholar
Leake D, Crandall D (2020) On Bringing Case-Based Reasoning Methodology to Deep Learning. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 12311 LNAI: 343–348
Lu H, Jin L, Luo X et al (2019) RNN for Solving Perturbed Time-Varying Underdetermined Linear System With Double Bound Limits on Residual Errors and State Variables. IEEE Trans Ind Inform 15(11):5931–5942
Article Google Scholar
Hong Y, Niu L, Zhang J et al (2020) Matchinggan: Matching-Based Few-Shot Image Generation. IEEE Int Conf Multimed Expo (ICME) 2020:1–6
Google Scholar
Lake B, Salakhutdinov R, Tenenbaum J (2015) Human-level concept learning through probabilistic program induction. Science 350(6266):1332–1338
Article MathSciNet MATH Google Scholar
Wang J, Zhai Y (2020) Prototypical Siamese Networks for Few-shot Learning. In: 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC), p 178–181
Das D, Lee CSG (2020) A Two-Stage Approach to Few-Shot Learning for Image Recognition. IEEE Trans Image Process 29:3336–3350
Article MATH Google Scholar
Ramalho T, Garnelo M (2019) Adaptive Posterior Learning: few-shot learning with a surprise-based memory module. In: 7th International Conference on Learning Representations, ICLR
Finn C, Abbeel P, Levine S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In: 34th International Conference on Machine Learning, ICML 2017, vol 3, p 1856–1868
Antoniou A, Storkey A, Edwards H (2019) How to train your MAML. In: 7th International Conference on Learning Representations, ICLR
Liu Y, Lee J, Park M et al (2019) Learning to propagate labels: Transductive propagation network for few-shot learning. In: 7th International Conference on Learning Representations, ICLR
Yao H, Wei Y, Huang J et al (2019) Hierarchically structured meta-learning. In: 36th International Conference on Machine Learning, ICML, 2019-June, p 12189–12209
Yao H, Wu X, Tao Z et al (2020) Automated relational meta-learning. arXiv:2001.00745v1
Ravi S, Larochelle H (2017) Optimization as a model for few-shot learning. In: 5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings
Santoro A, Bartunov S, Botvinick M et al (2016) Meta-Learning with Memory-Augmented Neural Networks. In: 33rd International Conference on Machine Learning, ICML 2016, vol 4, p 2740–2751
Finn C, Rajeswaran A, Kakade S et al (2019) Online meta-learning. In: 36th International Conference on Machine Learning, ICML 2019, 2019-June, p 3398–3410
Munkhdalai T, Yu H (2017) Meta networks. In: 34th International Conference on Machine Learning, ICML 2017, vol 5, p 3933–3943
Lee K, Maji S, Ravichandran A et al (2019) Meta-learning with differentiable convex optimization. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, p 10649-10657
Xue T, Yu H (2020) Model-Agnostic Metalearning-Based Text-Driven Visual Navigation Model for Unfamiliar Tasks. IEEE Access 8:166742–166752
Article Google Scholar
Isogawa K, Ida T, Shiodera T et al (2018) Deep shrinkage convolutional neural network for adaptive noise reduction. IEEE Signal Process Lett 25:224–228
Article Google Scholar
Zhao M, Zhong S, Fu X et al (2020) Deep Residual Shrinkage Networks for Fault Diagnosis. IEEE Trans Industr Inf 16:4681–4690
Article Google Scholar
Hu J, Shen L, Albanie S et al (2020) Squeeze-and-Excitation Networks. IEEE Trans Pattern Anal Mach Intell 42:2011–2023
Article Google Scholar
Tan M, Chen B, Pang R et al (2019) Mnasnet: Platform-aware neural architecture search for mobile. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019-June, p 2815–2823
Vaswani A, Shazeer N, Parmar N et al (2017) Attention is all you need. Advances in Neural Information Processing Systems, 2017-December: 5999-6009
Gu J, Wang Z, Kuen J et al (2018) Recent Advances in Convolutional Neural Networks. Pattern Recogn 77:354–377
Article Google Scholar
Zhao ZQ, Zheng P, Xu S-T et al (2019) Object Detection with Deep Learning: A Review. IEEE Trans Neural Netw Learn Syst 30:3212–3232
Article Google Scholar
Yu SD, Liu LL, Wang ZY et al (2019) Transferring deep neural networks for the differentiation of mammographic breast lesions. Sci China Tech Sci 62:441–447
Article Google Scholar
Nichol A, Schulman J Reptile: a scalable metalearning algorithm. arXiv:1803.02999v1
Munkhdalai T, Yuan X, Mehri S et al (2018) Rapid adaptation with conditionallyshifted neurons. In: 35th International Conference on Machine Learning, ICML, vol 8, p 5898–5909
Das D, LeeC (2020) A Two-Stage Approach to Few-Shot Learning for Image Recognition. IEEE Trans Image Process 29:3336–3350
Article MATH Google Scholar
Vinyals O, Blundell C, Lillicrap T et al (2016) Matching networks for one shot learning. In: 30th conference on neural information processing systems (NIPS), vol 29
Wang R, Zhang X, Liu C (2021) Meta-Prototypical Learning for Domain-Agnostic Few-Shot Recognition. IEEE Transactions on Neural Networks and Learning Systems, 1–7
Lee H, Lee H, Na D et al (2020) Learning to Balance: Bayesian Meta-Learning for Imbalanced and Out-of-distribution Tasks. arXiv:1905.12917, 1–15
Luo X, Zhou M, Shang M et al (2016) A Novel Approach to Extracting Non-Negative Latent Factors From Non-Negative Big Sparse Matrices. IEEE Access 4:2649–2655
Article Google Scholar
Luo X, Zhou M, Li S et al (2021) Algorithms of Unconstrained Non-Negative Latent Factor Analysis for Recommender Systems. IEEE Trans Big Data 7(1):227–240
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB1700200, in part by the National Natural Science Foundation of China under Grant U1908212, 61533015 and 92067205, in part by the State Key Laboratory of Robotics of China under Grant Y91Z081, also in part by the Natural Science Foundation of Liaoning Province under Grant 2020-KF-11-02.

Author information

Authors and Affiliations

State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, 110016, Shenyang, China
Yunpeng He, Peng Zeng, Qingwei Dong, Ding Liu & Yuqi Liu
Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, 110016, Shenyang, China
Yunpeng He, Peng Zeng, Qingwei Dong, Ding Liu & Yuqi Liu
Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, 110169, Shenyang, China
Yunpeng He, Peng Zeng, Qingwei Dong, Ding Liu & Yuqi Liu
University of Chinese Academy of Sciences, 100049, Beijing, China
Yunpeng He, Qingwei Dong & Ding Liu
Shenyang University of Technology, 110870, Shenyang, China
Chuanzhi Zang

Authors

Yunpeng He
View author publications
You can also search for this author in PubMed Google Scholar
Chuanzhi Zang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Qingwei Dong
View author publications
You can also search for this author in PubMed Google Scholar
Ding Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yuqi Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peng Zeng.

Ethics declarations

Conflicts of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

He, Y., Zang, C., Zeng, P. et al. Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning. Neural Process Lett 55, 505–518 (2023). https://doi.org/10.1007/s11063-022-10894-7

Download citation

Accepted: 23 May 2022
Published: 10 June 2022
Issue Date: February 2023
DOI: https://doi.org/10.1007/s11063-022-10894-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning

Abstract

Access this article

Similar content being viewed by others

Meta-learning with Network Pruning

Are LSTMs good few-shot learners?

Multi-scale Relation Network for Few-Shot Learning Based on Meta-learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation