Interpretable modeling of genotype-phenotype landscapes with state-of-the-art predictive power.

Researchers

Journal

Proceedings of the National Academy of Sciences of the United States of America

Modalities

Models

Abstract

Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype-phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space.

Show Full Text

Interpretable modeling of genotype-phenotype landscapes with state-of-the-art predictive power.

Researchers

Journal

Modalities

Models

Abstract

Machine learning based imputation techniques for estimating phylogenetic trees from incomplete distance matrices.

dSCOPE: a software to detect sequences critical for liquid-liquid phase separation.

Cooperation of local features and global representations by a dual-branch network for transcription factor binding sites prediction.

Iron-Sequestering Nanocompartments as Multiplexed Electron Microscopy Gene Reporters.

Identifying SNAREs by Incorporating Deep Learning Architecture and Amino Acid Embedding Representation.

Genomic regions associate with major axes of variation driven by gas exchange and leaf construction traits in cultivated sunflower (Helianthus annuus L.).

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply