Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls.

Researchers

Conghui Wang Guohua Wu Long Teng Yaoxin Wu Zhiguang Cao

Journal

IEEE transactions on neural networks and learning systems

Modalities

Models

deep reinforcement learning Encoder-Decoder Structured Policy Network Heterogeneous Attention Neural Heuristic self-attention

Abstract

The vehicle routing problem with backhauls (VRPBs) is a challenging problem commonly studied in computer science and operations research. Featured by linehaul (or delivery) and backhaul (or pickup) customers, the VRPB has broad applications in real-world logistics. In this article, we propose a neural heuristic based on deep reinforcement learning (DRL) to solve the traditional and improved VRPB variants, with an encoder-decoder structured policy network trained to sequentially construct the routes for vehicles. Specifically, we first describe the VRPB based on a graph and cast the solution construction as a Markov decision process (MDP). Then, to identify the relationship among the nodes (i.e., linehaul and backhaul customers, and the depot), we design a two-stage attention-based encoder, including a self-attention and a heterogeneous attention for each stage, which could yield more informative representations of the nodes so as to deliver high-quality solutions. The evaluation on the two VRPB variants reveals that, our neural heuristic performs favorably against both the conventional and neural heuristic baselines on randomly generated instances and benchmark instances. Moreover, the trained policy network exhibits a desirable capability of generalization to various problem sizes and distributions.

Show Full Text

Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls.

Researchers

Journal

Modalities

Models

Abstract

DISCO: A deep learning ensemble for uncertainty-aware segmentation of acoustic signals.

Dendritic Growth Optimization: A Novel Nature-Inspired Algorithm for Real-World Optimization Problems.

Real-time artificial intelligence-based histological classification of colorectal polyps with augmented visualization.

CBD: A Deep-Learning-Based Scheme for Encrypted Traffic Classification with a General Pre-Training Method.

A Simulator and First Reinforcement Learning Results for Underwater Mapping.

DNA Encoding-based Nucleotide Pattern and Deep Features for Instance and Class-based Image Retrieval.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply