Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization.

Researchers

Can-Kun Ou Ching-Chang Wong Tai-Ting Tsai

Journal

Modalities

Models

deep reinforcement learning Heuristic algorithms Proximal Policy Optimization

Abstract

This study proposes a method named Hybrid Heuristic Proximal Policy Optimization (HHPPO) to implement online 3D bin-packing tasks. Some heuristic algorithms for bin-packing and the Proximal Policy Optimization (PPO) algorithm of deep reinforcement learning are integrated to implement this method. In the heuristic algorithms for bin-packing, an extreme point priority sorting method is proposed to sort the generated extreme points according to their waste spaces to improve space utilization. In addition, a 3D grid representation of the space status of the container is used, and some partial support constraints are proposed to increase the possibilities for stacking objects and enhance overall space utilization. In the PPO algorithm, some heuristic algorithms are integrated, and the reward function and the action space of the policy network are designed so that the proposed method can effectively complete the online 3D bin-packing task. Some experimental results illustrate that the proposed method has good results in achieving online 3D bin-packing tasks in some simulation environments. In addition, an environment with image vision is constructed to show that the proposed method indeed enables an actual robot manipulator to successfully and effectively complete the bin-packing task in a real environment.

Show Full Text

Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization.

Researchers

Journal

Modalities

Models

Abstract

RU-SLAM: A Robust Deep-Learning Visual Simultaneous Localization and Mapping (SLAM) System for Weakly Textured Underwater Environments.

Recognition of Grasping Patterns Using Deep Learning for Human-Robot Collaboration.

Affinity Space Adaptation for Semantic Segmentation Across Domains.

Motion sensitive network for action recognition in control and decision-making of autonomous systems.

Empowering natural product science with AI: leveraging multimodal data and knowledge graphs.

An automatic parathyroid recognition and segmentation model based on deep learning of near-infrared autofluorescence imaging.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply