Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals.

Researchers

Journal

Modalities

Models

Hebbian context gating Stochastic Gradient Descent

Abstract

Humans can learn several tasks in succession with minimal mutual interference but perform more poorly when trained on multiple tasks at once. The opposite is true for standard deep neural networks. Here, we propose novel computational constraints for artificial neural networks, inspired by earlier work on gating in the primate prefrontal cortex, that capture the cost of interleaved training and allow the network to learn two tasks in sequence without forgetting. We augment standard stochastic gradient descent with two algorithmic motifs, so-called “sluggish” task units and a Hebbian training step that strengthens connections between task units and hidden units that encode task-relevant information. We found that the “sluggish” units introduce a switch-cost during training, which biases representations under interleaved training towards a joint representation that ignores the contextual cue, while the Hebbian step promotes the formation of a gating scheme from task units to the hidden layer that produces orthogonal representations which are perfectly guarded against interference. Validating the model on previously published human behavioural data revealed that it matches performance of participants who had been trained on blocked or interleaved curricula, and that these performance differences were driven by misestimation of the true category boundary.Copyright: © 2023 Flesch et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Show Full Text

Modelling continual learning in humans with Hebbian context gating and exponentially decaying task signals.

Researchers

Journal

Modalities

Models

Abstract

A hand rubbing classification model based on image sequence enhanced by feature-based confidence metric.

Multimode Gesture Recognition Algorithm Based on Convolutional Long Short-Term Memory Network.

Predicting protein network topology clusters from chemical structure using deep learning.

Literature analysis of artificial intelligence in biomedicine.

Deep fake detection and classification using error-level analysis and deep learning.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply