Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment.

Researchers

Journal

Modalities

Models

Abstract

We describe and evaluate a neural network-based architecture aimed to imitate and improve the performance of a fully autonomous soccer team in RoboCup Soccer 2D Simulation environment. The approach utilizes deep Q-network architecture for action determination and a deep neural network for parameter learning. The proposed solution is shown to be feasible for replacing a selected behavioral module in a well-established RoboCup base team, Gliders2d, in which behavioral modules have been evolved with human experts in the loop. Furthermore, we introduce an additional performance-correlated signal (a delayed reward signal), enabling a search for local maxima during a training phase. The extension is compared against a known benchmark. Finally, we investigate the extent to which preserving the structure of expert-designed behaviors affects the performance of a neural network-based solution.
Copyright © 2020 Nguyen and Prokopenko.

Show Full Text

Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment.

Researchers

Journal

Modalities

Models

Abstract

3D regression neural network for the quantification of enlarged perivascular spaces in brain MRI.

Single-shot T mapping using overlapping-echo detachment planar imaging and a deep convolutional neural network.

TFC-GCN: Lightweight Temporal Feature Cross-Extraction Graph Convolutional Network for Skeleton-Based Action Recognition.

A Deep Convolutional Gated Recurrent Unit for CT Image Reconstruction.

Identifying knot types of polymer conformations by machine learning.

Enhancing preclinical drug discovery with artificial intelligence.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply