SuperFormer: Continual learning superposition method for text classification.

Researchers

Journal

Neural networks : the official journal of the International Neural Network Society

Modalities

Models

Abstract

One of the biggest challenges in continual learning domains is the tendency of machine learning models to forget previously learned information over time. While overcoming this issue, the existing approaches often exploit large amounts of additional memory and apply model forgetting mitigation mechanisms which substantially prolong the training process. Therefore, we propose a novel SuperFormer method that alleviates model forgetting, while spending negligible additional memory and time. We tackle the continual learning challenges in a learning scenario, where we learn different tasks in a sequential order. We compare our method against several prominent continual learning methods, i.e., EWC, SI, MAS, GEM, PSP, etc. on a set of text classification tasks. We achieve the best average performance in terms of AUROC and AUPRC (0.7% and 0.9% gain on average, respectively) and the lowest training time among all the methods of comparison. On average, our method reduces the total training time by a factor of 5.4-8.5 in comparison to similarly performing methods. In terms of the additional memory, our method is on par with the most memory-efficient approaches.Copyright © 2023 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Show Full Text

SuperFormer: Continual learning superposition method for text classification.

Researchers

Journal

Modalities

Models

Abstract

Balancing Transferability and Discriminability for Unsupervised Domain Adaptation.

Discovering Parametric Activation Functions.

Multi-Source Deep Transfer Neural Network Algorithm.

Combining Supervised and Unsupervised Learning Algorithms for Human Activity Recognition.

Evolving Domain Generalization Via Latent Structure-Aware Sequential Autoencoder.

On the impact of approximate computation in an analog DeSTIN architecture.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply