Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.

Researchers

Jialiang Zuo Longfei Yue Rennong Yang Ying Zhang

Journal

Modalities

Models

Constrained Markov Decision Process Soft Actor-Critic Transfer Learning

Abstract

A system with multiple cooperating unmanned aerial vehicles (multi-UAVs) can use its advantages to accomplish complicated tasks. Recent developments in deep reinforcement learning (DRL) offer good prospects for decision-making for multi-UAV systems. However, the safety and training efficiencies of DRL still need to be improved before practical use. This study presents a transfer-safe soft actor-critic (TSSAC) for multi-UAV decision-making. Decision-making by each UAV is modeled with a constrained Markov decision process (CMDP), in which safety is constrained to maximize the return. The soft actor-critic-Lagrangian (SAC-Lagrangian) algorithm is combined with a modified Lagrangian multiplier in the CMDP model. Moreover, parameter-based transfer learning is used to enable cooperative and efficient training of the tasks to the multi-UAVs. Simulation experiments indicate that the proposed method can improve the safety and training efficiencies and allow the UAVs to adapt to a dynamic scenario.Copyright © 2023 Yue, Yang, Zhang and Zuo.

Show Full Text

Research on reinforcement learning-based safe decision-making methodology for multiple unmanned aerial vehicles.

Researchers

Journal

Modalities

Models

Abstract

Artificial intelligence-driven new drug discovery targeting serine/threonine kinase 33 for cancer treatment.

Real-Time Control of Intelligent Prosthetic Hand Based on the Improved TCN.

Recursive neural programs: A differentiable framework for learning compositional part-whole hierarchies and image grammars.

Detection and analysis of COVID-19 in medical images using deep learning techniques.

Deep transfer learning with multimodal embedding to tackle cold-start and sparsity issues in recommendation system.

Automated Tracking Systems for the Assessment of Farmed Poultry.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply