Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.

Researchers

Jiaohao Zheng Mehmet Necip Kurt Xiaodong Wang

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

deep reinforcement learning Stochastic Actor-Critic

Abstract

We propose a deep stochastic actor-critic algorithm with an integrated network architecture and fewer parameters. We address stabilization of the learning procedure via an adaptive objective to the critic’s loss and a smaller learning rate for the shared parameters between the actor and the critic. Moreover, we propose a mixed on-off policy exploration strategy to speed up learning. Experiments illustrate that our algorithm reduces the sample complexity by 50%-93% compared with the state-of-the-art deep reinforcement learning (RL) algorithms twin delayed deep deterministic policy gradient (TD3), soft actor-critic (SAC), proximal policy optimization (PPO), advantage actor-critic (A2C), and interpolated policy gradient (IPG) over continuous control tasks LunarLander, BipedalWalker, BipedalWalkerHardCore, Ant, and Minitaur in the OpenAI Gym.

Show Full Text

Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.

Researchers

Journal

Modalities

Models

Abstract

Artificial Intelligence Applications in Oral Cancer and Oral Dysplasia.

Deep Multilabel Multilingual Document Learning for Cross-Lingual Document Retrieval.

Development and Validation of a Deep Learning Method to Predict Cerebral Palsy From Spontaneous Movements in Infants at High Risk.

Effective Techniques for Multimodal Data Fusion: A Comparative Analysis.

Automatic Screening and Identifying Myopic Maculopathy on Optical Coherence Tomography Images Using Deep Learning.

Rotation-Invariant Attention Network for Hyperspectral Image Classification.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply