DeepAS – Chemical language model for the extension of active analogue series.

Researchers

Journal

Modalities

Models

Abstract

In medicinal chemistry, hit-to-lead and lead optimization efforts produce analogue series (ASs), the analysis of which is of central relevance for the exploration and exploitation of structure-activity relationships (SARs) and generation of candidate compounds. The key question in any chemical optimization effort is which analogue(s) to generate next, for which computational support is typically provided through QSAR analysis and compound potency predictions. In this study, we introduce a new chemical language model for analogue design via deep learning. For this purpose, ASs comprising active compounds are ordered according to increasing potency and the chemical language model predicts preferred R-groups for new analogues on the basis of ordered R-group sequences. Hence, consistent with the principles of deep models for natural language processing, analogues with new R-groups are predicted based upon conditional probabilities taking preceding groups into account. This implicitly accounts for the potency gradient captured by an AS and detectable SAR trends, providing a new concept for analogue design. Herein, we report the AS-based chemical language model, its initial evaluation, and exemplary applications.Copyright © 2022 Elsevier Ltd. All rights reserved.

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *