Drug Discovery | Pharmacology

Finding the most potent compounds using active learning on molecular pairs.

September 3, 2024 Drug Discovery, Pharmacology

Researchers

Daniel Reker Zachary Fralish

Journal

Beilstein journal of organic chemistry

Modalities

Models

Chemprop Random Forest XGBoost

Abstract

Active learning allows algorithms to steer iterative experimentation to accelerate and de-risk molecular optimizations, but actively trained models might still exhibit poor performance during early project stages where the training data is limited and model exploitation might lead to analog identification with limited scaffold diversity. Here, we present ActiveDelta, an adaptive approach that leverages paired molecular representations to predict improvements from the current best training compound to prioritize further data acquisition. We apply the ActiveDelta concept to both graph-based deep (Chemprop) and tree-based (XGBoost) models during exploitative active learning for 99 Ki benchmarking datasets. We show that both ActiveDelta implementations excel at identifying more potent inhibitors compared to the standard exploitative active learning implementations of Chemprop, XGBoost, and Random Forest. The ActiveDelta approach is also able to identify more chemically diverse inhibitors in terms of their Murcko scaffolds. Finally, deep models such as Chemprop trained on data selected through ActiveDelta approaches can more accurately identify inhibitors in test data created through simulated time-splits. Overall, this study highlights the large potential for molecular pairing approaches to further improve popular active learning strategies in low data regimes by enabling faster and more accurate identification of more diverse molecular hits against critical drug targets.Copyright © 2024, Fralish and Reker.

Show Full Text

Finding the most potent compounds using active learning on molecular pairs.

Researchers

Journal

Modalities

Models

Abstract

Crosslinked-hybrid nanoparticle embedded in thermogel for sustained co-delivery to inner ear.

Compound Activity Prediction with Dose-Dependent Transcriptomic Profiles and Deep Learning.

Binding Activity Classification of Anti-SARS-CoV-2 Molecules using Deep Learning across Multiple Assays.

A similarity-based deep learning approach for determining the frequencies of drug side effects.

Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images.

OCTAD: an open workspace for virtually screening therapeutics targeting precise cancer patient groups using gene expression features.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply