Other

An Improved Prioritized DDPG Based on Fractional-Order Learning Scheme.

May 8, 2024 Other

Researchers

Bin Xu Meiying Cai Quan-Yong Fan

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

Deep deterministic policy gradient (DDPG)

Abstract

Although deep deterministic policy gradient (DDPG) algorithm gets widespread attention as a result of its powerful functionality and applicability for large-scale continuous control, it cannot be denied that DDPG has problems such as low sample utilization efficiency and insufficient exploration. Therefore, an improved DDPG is presented to overcome these challenges in this article. Firstly, an optimizer based on fractional gradient is introduced into the algorithm network, which is conductive to increase the speed and accuracy of training convergence. On this basis, high-value experience replay based on weight-changed priority is proposed to improve sample utilization efficiency, and aiming to have a stronger exploration of the environment, an optimized exploration strategy for boundary action space is adopted. Finally, our proposed method is tested through the experiments of gym and pybullet platform. According to the results, our method speeds up the learning process, obtains higher average rewards in comparison with other algorithms.

Show Full Text

An Improved Prioritized DDPG Based on Fractional-Order Learning Scheme.

Researchers

Journal

Modalities

Models

Abstract

Explaining the Neuroevolution of Fighting Creatures Through Virtual fMRI.

Local Self-Expression Subspace Learning Network for Motion Capture Data.

Combating the Infodemic: A Chinese Infodemic Dataset for Misinformation Identification.

Invertible Residual Blocks in Deep Learning Networks.

A timely and accurate approach to nearshore oil spill monitoring using deep learning and GIS.

Daily Human Activity Recognition Using Non-Intrusive Sensors.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply