Scalable-MADDPG-Based Cooperative Target Invasion for a Multi-USV System.

Researchers

Cheng-Cheng Wang Fei Wang Peng Shi Yu-Long Wang

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

Abstract

This article concentrates on proposing a scalable deep reinforcement learning (DRL) method for a multiple unmanned surface vehicle (multi-USV) system to operate cooperative target invasion. The multi-USV system, which is made up of multiple invaders, needs to invade target areas in a specified time. A novel scalable reinforcement learning (RL) method called Scalable-MADDPG is proposed for the first time. In this method, the scale of the multi-USV system can be changed at any time without interrupting the training process. Then, to mitigate the policy oscillation after applying Scalable-MADDPG, a bi-directional long-short-term memory (Bi-LSTM) network is constructed. Moreover, an improved ϵ -greedy strategy is proposed to help balance the exploration and exploitation in RL. Furthermore, to enhance the robustness of the optimal policy, Ornstein-Uhlenbeck (OU) noise is added in this improved ϵ -greedy strategy during the training process. Finally, the scalable RL method is used to help the multi-USV system perform cooperative target invasion under complex marine environments. The effectiveness of Scalable-MADDPG is demonstrated through three experiments.

Show Full Text

Scalable-MADDPG-Based Cooperative Target Invasion for a Multi-USV System.

Researchers

Journal

Modalities

Models

Abstract

Efficient Development of Gait Classification Models for Five-Gaited Horses Based on Mobile Phone Sensors.

A reinforcement learning method for optimal control of oil well production using cropped well group samples.

Deep Generative Adversarial Reinforcement Learning for Semi-Supervised Segmentation of Low-Contrast and Small Objects in Medical Images.

Improving drug discovery with a hybrid deep generative model using reinforcement learning trained on a Bayesian docking approximation.

Self-organizing neural network for reproducing human postural mode alternation through deep reinforcement learning.

Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply