Computer Science | Pharmaceutical Sciences

Guidelines for RNN Transfer Learning Based Molecular Generation of Focussed Libraries.

July 13, 2020 Computer Science, Pharmaceutical Sciences

Researchers

Darren Green Peter Pogány Silvia Amabilino Stephen D Pickett

Journal

Journal of chemical information and modeling

Modalities

Models

GRU-RNN RNN

Abstract

Deep learning approaches have become popular in recent years in the ﬁeld of de novo molecular design. While a variety of diﬀerent methods are available, it is still a challenge to assess and compare their performance. A particularly promising approach for automated drug design is to use recurrent neural network (RNN) as SMILES generators and train them with the learning procedure called ‘transfer learning’. This involves ﬁrst training the initial model on a large generic data set of molecules, to learn the general syntax of SMILES, followed by ﬁne-tuning on a smaller set of molecules, coming from e.g. a lead optimization program. In order to create a well-performing transfer learning application which can be automated, it is important to understand how the size of the second data set aﬀects the training process. In addition, extensive post-ﬁltering using similarity metrics of the molecules generated after transfer learning should be avoided, as it can introduce new biases towards the selection of drug candidates. Here we present results from the application of a GRU-RNN to transfer learning on data sets of varying sizes and complexity. Analysis of the results has allowed us to provide some general guidelines for transfer learning. In particular, we show that data set sizes containing at least 190 molecules are needed for eﬀective GRU-RNN based molecular generation using transfer learning. The methods presented here should be applicable generally to other deep learning methodologies.

Show Full Text

Guidelines for RNN Transfer Learning Based Molecular Generation of Focussed Libraries.

Researchers

Journal

Modalities

Models

Abstract

Nano-opto-electro-mechanical switches operated at CMOS-level voltages.

U-Net-Based Medical Image Segmentation.

AOSLO-net: A Deep Learning-Based Method for Automatic Segmentation of Retinal Microaneurysms From Adaptive Optics Scanning Laser Ophthalmoscopy Images.

Invertible and Variable Augmented Network for Pretreatment Patient-Specific Quality Assurance Dose Prediction.

CT-Based Deep-Learning Model for Spread-Through-Air-Spaces Prediction in Ground Glass-Predominant Lung Adenocarcinoma.

Searching and designing potential inhibitors for SARS-CoV-2 Mpro from natural sources using atomistic and deep-learning calculations.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply