Sparse Progressive Neural Networks for Continual Learning

Ergün, Esra; Töreyin, Behçet Uğur

doi:10.1007/978-3-030-88113-9_58

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1463))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1439 Accesses

Abstract

Human brain effectively integrates prior knowledge to new skills by transferring experience across tasks without suffering from catastrophic forgetting. In this study, to continuously learn a visual classification task sequence, we employed a neural network model with lateral connections called Progressive Neural Networks (PNN). We sparsified PNNs with sparse group Least Absolute Shrinkage and Selection Operator (LASSO) and trained conventional PNNs with recursive connections. Later, the effect of the task prior on current performance is investigated with various task orders. The proposed approach is evaluated on permutedMNIST and selected subtasks from CIFAR-100 dataset. Results show that sparse Group LASSO regularization effectively sparsifies the progressive neural networks and the task sequence order affects the performance.

Supported by the Vodafone Future Laboratory, Istanbul Technical University (ITU), under Grant ITUVF20180901P04.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Bio-inspired, task-free continual learning through activity regularization

Article Open access 17 August 2023

Deep Rapid Class Augmentation; A New Progressive Learning Approach that Eliminates the Issue of Catastrophic Forgetting

Dynamic Task Prioritization for Multitask Learning

References

Chaudhry, A., Ranzato, M., Rohrbach, M., Elhoseiny, M.: Efficient lifelong learning with a-gem. arXiv preprint arXiv:1812.00420 (2018)
d’Autume, C.d.M., Ruder, S., Kong, L., Yogatama, D.: Episodic memory in lifelong language learning. arXiv preprint arXiv:1906.01076 (2019)
Goodfellow, I.J., Mirza, M., Xiao, D., Courville, A., Bengio, Y.: An empirical investigation of catastrophic forgetting in gradient-based neural networks. arXiv preprint arXiv:1312.6211 (2013)
Han, S., Pool, J., Tran, J., Dally, W.: Learning both weights and connections for efficient neural network. In: Advances in Neural Information Processing Systems, pp. 1135–1143 (2015)
Google Scholar
Kirkpatrick, J., et al.: Overcoming catastrophic forgetting in neural networks. Proc. Nat. Acad. Sci. 114(13), 3521–3526 (2017)
Google Scholar
Li, X., Zhou, Y., Wu, T., Socher, R., Xiong, C.: Learn to grow: a continual structure learning framework for overcoming catastrophic forgetting. In: International Conference on Machine Learning, pp. 3925–3934. PMLR (2019)
Google Scholar
Lopez-Paz, D., Ranzato, M.: Gradient episodic memory for continual learning. Adv. Neural Inf. Process. Syst. 30, 6467–6476 (2017)
Google Scholar
Nguyen, C.V., Li, Y., Bui, T.D., Turner, R.E.: Variational continual learning. In: International Conference on Learning Representations (2018). https://openreview.net/forum?id=BkQqq0gRb
Paszke, A., et al.: Automatic differentiation in Pytorch (2017)
Google Scholar
Riemer, M., et al.: Learning to learn without forgetting by maximizing transfer and minimizing interference. In: International Conference on Learning Representations (2019). https://openreview.net/forum?id=B1gTShAct7
Ritter, H., Botev, A., Barber, D.: Online structured Laplace approximations for overcoming catastrophic forgetting. In: Advances in Neural Information Processing Systems, pp. 3738–3748 (2018)
Google Scholar
Rusu, A.A., et al.: Progressive neural networks. arXiv preprint arXiv:1606.04671 (2016)
Scardapane, S., Comminiello, D., Hussain, A., Uncini, A.: Group sparse regularization for deep neural networks. Neurocomputing 241, 81–89 (2017)
Article Google Scholar
Serra, J., Suris, D., Miron, M., Karatzoglou, A.: Overcoming catastrophic forgetting with hard attention to the task. In: International Conference on Machine Learning, pp. 4548–4557. PMLR (2018)
Google Scholar
Strannegård, C., Carlström, H., Engsner, N., Mäkeläinen, F., Slottner Seholm, F., Haghir Chehreghani, M.: Lifelong learning starting from zero. In: Hammer, P., Agrawal, P., Goertzel, B., Iklé, M. (eds.) AGI 2019. LNCS (LNAI), vol. 11654, pp. 188–197. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-27005-6_19
Chapter Google Scholar
Xu, J., Ma, J., Zhu, Z.: Bayesian optimized continual learning with attention mechanism. arXiv preprint arXiv:1905.03980 (2019)
Xu, J., Zhu, Z.: Reinforced continual learning. In: Advances in Neural Information Processing Systems, pp. 899–908 (2018)
Google Scholar
Yoon, J., Yang, E., Lee, J., Hwang, S.J.: Lifelong learning with dynamically expandable networks. In: 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, 30 April–3 May 2018, Conference Track Proceedings. OpenReview.net (2018). https://openreview.net/forum?id=Sk7KsfW0-
Zenke, F., Poole, B., Ganguli, S.: Continual learning through synaptic intelligence. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 3987–3995 (2017)
Google Scholar

Download references

Author information

Authors and Affiliations

Informatics Institute, Istanbul Technical University, Istanbul, 34467, Turkey
Esra Ergün & Behçet Uğur Töreyin

Authors

Esra Ergün
View author publications
You can also search for this author in PubMed Google Scholar
Behçet Uğur Töreyin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Esra Ergün .

Editor information

Editors and Affiliations

Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz
VU Amsterdam, Amsterdam, The Netherlands
Jan Treur
University of the West of England, Bristol, UK
Elias Pimenidis
Wrocław University of Science and Technology, Wrocław, Poland
Marcin Maleszka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ergün, E., Töreyin, B.U. (2021). Sparse Progressive Neural Networks for Continual Learning. In: Wojtkiewicz, K., Treur, J., Pimenidis, E., Maleszka, M. (eds) Advances in Computational Collective Intelligence. ICCCI 2021. Communications in Computer and Information Science, vol 1463. Springer, Cham. https://doi.org/10.1007/978-3-030-88113-9_58

Download citation

DOI: https://doi.org/10.1007/978-3-030-88113-9_58
Published: 27 September 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88112-2
Online ISBN: 978-3-030-88113-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Sparse Progressive Neural Networks for Continual Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bio-inspired, task-free continual learning through activity regularization

Deep Rapid Class Augmentation; A New Progressive Learning Approach that Eliminates the Issue of Catastrophic Forgetting

Dynamic Task Prioritization for Multitask Learning

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Sparse Progressive Neural Networks for Continual Learning

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Bio-inspired, task-free continual learning through activity regularization

Deep Rapid Class Augmentation; A New Progressive Learning Approach that Eliminates the Issue of Catastrophic Forgetting

Dynamic Task Prioritization for Multitask Learning

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation