Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning.

Researchers

Muhammad Burhan Hafez Stefan Wermter Tilman Immisch Tom Weber

Journal

Modalities

Models

Abstract

Deep reinforcement learning (RL) agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training new data. Replay memories are a common solution to the problem by decorrelating and shuffling old and new training samples. They naively store state transitions as they arrive, without regard for redundancy. We introduce a novel cognitive-inspired replay memory approach based on the Grow-When-Required (GWR) self-organizing network, which resembles a map-based mental model of the world. Our approach organizes stored transitions into a concise environment-model-like network of state nodes and transition edges, merging similar samples to reduce the memory size and increase pair-wise distance among samples, which increases the relevancy of each sample. Overall, our study shows that map-based experience replay allows for significant memory reduction with only small decreases in performance.Copyright © 2023 Hafez, Immisch, Weber and Wermter.

Show Full Text

Map-based experience replay: a memory-efficient solution to catastrophic forgetting in reinforcement learning.

Researchers

Journal

Modalities

Models

Abstract

Selection of pre-trained weights for transfer learning in automated cytomegalovirus retinitis classification.

Predicting Bone Metastasis Using Gene Expression-Based Machine Learning Models.

Evaluation of the area subscore of the Palmoplantar Pustulosis Area and Severity Index using an attention U-net deep learning algorithm.

Molecular Autonomous Pathfinder Using Deep Reinforcement Learning.

Automatic classification of dog barking using deep learning.

How data science and AI-based technologies impact genomics.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply