A Closer Look at Rehearsal-Free Continual Learning

Smith, James Seale; Tian, Junjiao; Halbe, Shaunak; Hsu, Yen-Chang; Kira, Zsolt

Computer Science > Machine Learning

arXiv:2203.17269 (cs)

[Submitted on 31 Mar 2022 (v1), last revised 3 Apr 2023 (this version, v2)]

Title:A Closer Look at Rehearsal-Free Continual Learning

Authors:James Seale Smith, Junjiao Tian, Shaunak Halbe, Yen-Chang Hsu, Zsolt Kira

View PDF

Abstract:Continual learning is a setting where machine learning models learn novel concepts from continuously shifting training data, while simultaneously avoiding degradation of knowledge on previously seen classes which may disappear from the training data for extended periods of time (a phenomenon known as the catastrophic forgetting problem). Current approaches for continual learning of a single expanding task (aka class-incremental continual learning) require extensive rehearsal of previously seen data to avoid this degradation of knowledge. Unfortunately, rehearsal comes at a cost to memory, and it may also violate data-privacy. Instead, we explore combining knowledge distillation and parameter regularization in new ways to achieve strong continual learning performance without rehearsal. Specifically, we take a deep dive into common continual learning techniques: prediction distillation, feature distillation, L2 parameter regularization, and EWC parameter regularization. We first disprove the common assumption that parameter regularization techniques fail for rehearsal-free continual learning of a single, expanding task. Next, we explore how to leverage knowledge from a pre-trained model in rehearsal-free continual learning and find that vanilla L2 parameter regularization outperforms EWC parameter regularization and feature distillation. Finally, we explore the recently popular ImageNet-R benchmark, and show that L2 parameter regularization implemented in self-attention blocks of a ViT transformer outperforms recent popular prompting for continual learning methods.

Comments:	Accepted by the 2023 IEEE/CVF Conference on Computer Vision and Pattern (CVPR) Workshop on Continual Learning in Computer Vision (CLVision 2023)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.17269 [cs.LG]
	(or arXiv:2203.17269v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.17269

Submission history

From: James Smith [view email]
[v1] Thu, 31 Mar 2022 17:59:00 UTC (590 KB)
[v2] Mon, 3 Apr 2023 22:49:29 UTC (572 KB)

Computer Science > Machine Learning

Title:A Closer Look at Rehearsal-Free Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Closer Look at Rehearsal-Free Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators