research-article

Latent Coreset Sampling based Data-Free Continual Learning

Authors:

Zhuoyi Wang,

Dingcheng Li,

Ping LiAuthors Info & Claims

CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Pages 2077 - 2087

https://doi.org/10.1145/3511808.3557375

Published: 17 October 2022 Publication History

Get Access

Abstract

Catastrophic forgetting poses a major challenge in continual learning where the old knowledge is forgotten when the model is updated on new tasks. Existing solutions tend to solve this challenge through generative models or exemplar-replay strategies. However, such methods may not alleviate the issue that the low-quality samples are generated or selected for the replay, which would directly reduce the effectiveness of the model, especially in the class imbalance, noise, or redundancy scenarios. Accordingly, how to select a suitable coreset during continual learning becomes significant in such setting. In this work, we propose a novel approach that leverages continual coreset sampling (CCS) to address these challenges. We aim to select the most representative subsets during each iteration. When the model is trained on new tasks, it closely approximates/matches the gradient of both the previous and current tasks with respect to the model parameters. This way, adaptation of the model to new datasets could be more efficient. Furthermore, different from the old data storage for maintaining the old knowledge, our approach choose to preserving them in the latent space. We augment the previous classes in the embedding space as the pseudo sample vectors from the old encoder output, strengthened by the joint training with selected new data. It could avoid data privacy invasions in a real-world application when we update the model. Our experiments validate the effectiveness of our proposed approach over various CV/NLP datasets under against current baselines, and we also indicate the obvious improvement of model adaptation and forgetting reduction in a data-free manner.

Supplementary Material

MP4 File (CIKM_fp0475_Presentation.mp4)

The virtual presentation of the paper "Latent Coreset Sampling based Data-Free Continual Learning" (ID No: fp0475)

Download
246.11 MB

References

[1]

Rahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, Marcus Rohrbach, and Tinne Tuytelaars. 2018. Memory Aware Synapses: Learning What (not) to Forget. In Proceedings of the 15th European Conference on Computer Vision (ECCV), Part III. Munich, Germany, 144--161.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

An Energy Sampling Replay-Based Continual Learning Framework

Fast collapsed gibbs sampling for latent dirichlet allocation

Latent Slice Sampling

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations