Other

RAP Vol: Robust Adversary Populations With Volume Diversity Measure.

October 3, 2023 Other

Researchers

Jiachen Yang Jipeng Zhang

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

Deep Reinforcement Learning (DRL)

Abstract

Deep reinforcement learning (DRL) algorithms have made remarkable achievements in various fields, but they are vulnerable to changes in environment dynamics. This vulnerability easily leads to poor generalization, low performance, and catastrophic failures in unseen environments, which severely hinders the application of DRL in real-world scenarios. The robustness via adversary populations (RAP) algorithm addresses this issue by introducing a population of adversaries that perturb the protagonist. However, the low data utilization efficiency and lack of population diversity greatly limit the generalization performance. This article proposes robust adversary populations with volume diversity measure (RAP Vol) to address these drawbacks. In the proposed joint adversarial training framework, we use the training data to update all adversaries rather than only a single adversary, leading to a higher data utilization efficiency and a fast convergence speed. In the proposed population diversity iterative improvement mechanism, the vectors representing adversaries span a high-dimensional region. The volume of this region is utilized to measure and enhance population diversity via its square. The ablation experiments have verified the effectiveness of our proposed method in improving the robustness against variations in environment dynamics. Also, the influence of various factors (such as adversary population size and diversity weight) on the robustness has been investigated.

Show Full Text

RAP Vol: Robust Adversary Populations With Volume Diversity Measure.

Researchers

Journal

Modalities

Models

Abstract

ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI.

Time-Frequency Aliased Signal Identification Based on Multimodal Feature Fusion.

The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima.

Human-Unrecognizable Differential Private Noised Image Generation Method.

Deep Multi-View Enhancement Hashing for Image Retrieval.

Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply