|

DL-SMILES#: A Novel Encoding Scheme for Predicting Compound Protein Affinity by Deep Learning.

Researchers

Journal

Modalities

Models

Abstract

Drug repositioning aims to screen drugs and therapeutic goals from approved drugs and abandoned compounds that have been identified as safe. This trend is changing the landscape of drug development and creating a model of drug repositioning for new drug development. In recent decade, machine learning methods have been applied to predict the binding affinity of compound proteins, while deep learning is recently becoming hot and achieving significant performances. Among the models, the way of representing the compounds is usually simple, which is the molecular fingerprints, i.e., a single SMILES string.
In this work, we improve it by proposing a novel representing manner, named SMILES#, to recode the SMILES string. This approach takes into account the properties of compounds and achieves well performance. After that, we propose a deep learning model that combines recurrent neural networks with a convolutional neural network with an attentional mechanism, using unlabeled data and labeled data to jointly encode molecular and predict binding affinity.
Experimental results show that SMILES# with compound properties can effectively improve the accuracy of the model and reduce the RMS error on most data sets.
Then, we used the method to verify the related and unrelated compounds with the same target, and the experimental results show the effectiveness of the method.
Copyright© Bentham Science Publishers; For any queries, please email at [email protected].

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *