Path planning of mobile robot based on improved TD3 algorithm in dynamic environment.

Researchers

Donghui Chen Lanyong Zhang Peng Li Shiquan Zhao Yuchen Wang

Journal

Modalities

Models

Gazebo ROS TD3 Twin Delayed Deep Deterministic Policy Gradient

Abstract

This paper proposes an improved TD3 (Twin Delayed Deep Deterministic Policy Gradient) algorithm to address the flaws of low success rate and slow training speed, when using the original TD3 algorithm in mobile robot path planning in dynamic environment. Firstly, prioritized experience replay and transfer learning are introduced to enhance the learning efficiency, where the probability of beneficial experiences being sampled in the experience pool is increased, and the pre-trained model is applied in an obstacle-free environment as the initial model for training in a dynamic environment. Secondly, dynamic delay update strategy is devised and OU noise is added to improve the success rate of path planning, where the probability of missing high-quality value estimate is reduced through changing the delay update interval dynamically, and the correlated exploration of the mobile robot inertial navigation system in the dynamic environment is temporally improved. The algorithm is tested by simulation where the Turtlebot3 robot model as a training object, the ROS melodic operating system and Gazebo simulation software as an experimental environment. Meanwhile, the result shows that the improved TD3 algorithm has a 16.6 % increase in success rate and a 23.5 % reduction in algorithm training time. A generalization experiment was designed finally, and it indicates that superior generation performance has been acquired in mobile robot path planning with continuous action spaces through the improved TD3 algorithm.© 2024 The Authors. Published by Elsevier Ltd.

Show Full Text

Path planning of mobile robot based on improved TD3 algorithm in dynamic environment.

Researchers

Journal

Modalities

Models

Abstract

From Scalp to Ear-EEG: A Generalizable Transfer Learning Model for Automatic Sleep Scoring in Older People.

Deep learning system for classification of ploidy status using time-lapse videos.

Altruistic Collaborative Learning.

Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines.

Integration of improved YOLOv5 for face mask detector and auto-labeling to generate dataset for fighting against COVID-19.

Criteria for implementing artificial intelligence systems in reproductive medicine.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply