On Robustness of Generative Representations Against Catastrophic Forgetting

Masarczyk, Wojciech; Deja, Kamil; Trzcinski, Tomasz

doi:10.1007/978-3-030-92310-5_38

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1517))

Included in the following conference series:

International Conference on Neural Information Processing

2034 Accesses
1 Citations

Abstract

Catastrophic forgetting of previously learned knowledge while learning new tasks is a widely observed limitation of contemporary neural networks. Although many continual learning methods are proposed to mitigate this drawback, the main question remains unanswered: what is the root cause of catastrophic forgetting? In this work, we aim at answering this question by posing and validating a set of research hypotheses related to the specificity of representations built internally by neural models. More specifically, we design a set of empirical evaluations that compare the robustness of representations in discriminative and generative models against catastrophic forgetting. We observe that representations learned by discriminative models are more prone to catastrophic forgetting than their generative counterparts, which sheds new light on the advantages of developing generative models for continual learning. Finally, our work opens new research pathways and possibilities to adopt generative models in continual learning beyond mere replay mechanisms.

K. Deja—Work done prior joining Amazon.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Toward durable representations for continual learning

Article 17 December 2021

Brain-inspired replay for continual learning with artificial neural networks

Article Open access 13 August 2020

Contrastive Learning of Multivariate Gaussian Distributions of Incremental Classes for Continual Learning

References

Davidson, G., Mozer, M.C.: Sequential mastery of multiple visual tasks: networks naturally learn to learn and forget to forget. In: CVPR (2020)
Google Scholar
French, R.M.: Catastrophic forgetting in connectionist networks. TiCS 3, 128–135 (1999)
Google Scholar
Kirkpatrick, J., et al.: Overcoming catastrophic forgetting in neural networks. PNAS 114, 3521–3526 (2017)
Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. In: ICLR (2014)
Google Scholar
Kornblith, S., et al.: Similarity of neural network representations revisited. In: ICML(2019)
Google Scholar
Nguyen, G., et al.: Dissecting catastrophic forgetting in continual learning by deep visualization. arXiv (2020)
Google Scholar
Parisi, G.I., et al.: Continual lifelong learning with neural networks: a review. Neural Netw. 113, 54–71 (2019)
Google Scholar
Prabhu, A., Torr, P.H.S., Dokania, P.K.: GDumb: a simple approach that questions our progress in continual learning. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12347, pp. 524–540. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58536-5_31
Chapter Google Scholar
Ramasesh, V., et al.: Anatomy of catastrophic forgetting: hidden representations and task semantics. In: ICLR (2021)
Google Scholar
Rolnick, D., et al.: Experience Replay for Continual Learning. In: NeurIPS (2019)
Google Scholar
Russakovsky, O., et al.: ImageNet large scale visual recognition challenge. IJCV (2015)
Google Scholar
Rusu, A., et al.: Progressive neural networks. arXiv (2016)
Google Scholar
Thai, A., et al.: Does continual learning = catastrophic forgetting? arXiv (2021)
Google Scholar
Vaswani, A., et al.: Attention is all you need. In: NeurIPS (2017)
Google Scholar
van de Ven, G.M., Tolias, A.S.: Generative replay with feedback connections as a general strategy for continual learning. arXiv (2018)
Google Scholar
Wu, Y.N., et al.: A tale of three probabilistic families: discriminative, descriptive and generative models (2018)
Google Scholar
Yoon, J., et al.: Lifelong learning with dynamically expandable networks. In: ICLR (2018)
Google Scholar
Zenke, F., et al.: Continual learning through synaptic intelligence. In: ICML (2017)
Google Scholar

Download references

Acknowledgment

This research was funded by National Science Centre, Poland (grant no 2020/39/ B/ST6/01511 and 2018/31/N/ST6/02374), Foundation for Polish Science (grant no POIR.04.04.00-00-14DE/ 18-00 carried out within the Team-Net program co-financed by the European Union under the European Regional Development Fund) and Warsaw University of Technology (POB Research Centre for Artificial Intelligence and Robotics within the Excellence Initiative Program - Research University). For the purpose of Open Access, the author has applied a CC-BY public copyright license to any Author Accepted Manuscript (AAM) version arising from this submission.

Author information

Authors and Affiliations

Warsaw University of Technology, Warsaw, Poland
Wojciech Masarczyk, Kamil Deja & Tomasz Trzcinski
Jagiellonian University, Kraków, Poland
Tomasz Trzcinski
Tooploox, Wrocław, Poland
Tomasz Trzcinski

Authors

Wojciech Masarczyk
View author publications
You can also search for this author in PubMed Google Scholar
Kamil Deja
View author publications
You can also search for this author in PubMed Google Scholar
Tomasz Trzcinski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wojciech Masarczyk .

Editor information

Editors and Affiliations

Sampoerna University, Jakarta, Indonesia
Teddy Mantoro
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Sampoerna University, Jakarta, Indonesia
Media Anugerah Ayu
Murdoch University, Murdoch, WA, Australia
Kok Wai Wong
Universitas Indonesia, Depok, Indonesia
Achmad Nizar Hidayanto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Masarczyk, W., Deja, K., Trzcinski, T. (2021). On Robustness of Generative Representations Against Catastrophic Forgetting. In: Mantoro, T., Lee, M., Ayu, M.A., Wong, K.W., Hidayanto, A.N. (eds) Neural Information Processing. ICONIP 2021. Communications in Computer and Information Science, vol 1517. Springer, Cham. https://doi.org/10.1007/978-3-030-92310-5_38

Download citation

DOI: https://doi.org/10.1007/978-3-030-92310-5_38
Published: 02 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92309-9
Online ISBN: 978-3-030-92310-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

On Robustness of Generative Representations Against Catastrophic Forgetting

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Toward durable representations for continual learning

Brain-inspired replay for continual learning with artificial neural networks

Contrastive Learning of Multivariate Gaussian Distributions of Incremental Classes for Continual Learning

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

On Robustness of Generative Representations Against Catastrophic Forgetting

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Toward durable representations for continual learning

Brain-inspired replay for continual learning with artificial neural networks

Contrastive Learning of Multivariate Gaussian Distributions of Incremental Classes for Continual Learning

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation