Other

Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.

November 7, 2018 Other

Researchers

Malcolm I Heywood Stephen Kelly

Journal

Evolutionary computation

Modalities

Models

deep reinforcement learning genetic programming Neural Networks

Abstract

Algorithms that learn through environmental interaction and delayed rewards, or reinforcement learning (RL), increasingly face the challenge of scaling to dynamic, high-dimensional, and partially observable environments. Significant attention is being paid to frameworks from deep learning, which scale to high-dimensional data by decomposing the task through multilayered neural networks. While effective, the representation is complex and computationally demanding. In this work, we propose a framework based on genetic programming which adaptively complexifies policies through interaction with the task. We make a direct comparison with several deep reinforcement learning frameworks in the challenging Atari video game environment as well as more traditional reinforcement learning frameworks based on a priori engineered features. Results indicate that the proposed approach matches the quality of deep learning while being a minimum of three orders of magnitude simpler with respect to model complexity. This results in real-time operation of the champion RL agent without recourse to specialized hardware support. Moreover, the approach is capable of evolving solutions to multiple game titles simultaneously with no additional computational cost. In this case, agent behaviours for an individual game as well as single agents capable of playing all games emerge from the same evolutionary run.

Show Full Text

Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.

Researchers

Journal

Modalities

Models

Abstract

Using an Improved Residual Network to Identify PIK3CA Mutation Status in Breast Cancer on Ultrasound Image.

Labeled dataset for training despeckling filters for SAR imagery.

Deep learning innovations in diagnosing diabetic retinopathy: The potential of transfer learning and the DiaCNN model.

Using Dimensionality Reduction to Visualize Phenotypic Changes in High-Throughput Microscopy.

Explainable uncertainty quantifications for deep learning-based molecular property prediction.

Intentional deep overfit learning for patient-specific dose predictions in adaptive radiotherapy.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply