Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning.

Researchers

Banghao Wu Liang Hong Liang Zhang Mingchen Li Pan Tan Yuanxi Yu Ziyi Zhou

Journal

Modalities

Models

Abstract

Accurately modeling the protein fitness landscapes holds great importance for protein engineering. Pre-trained protein language models have achieved state-of-the-art performance in predicting protein fitness without wet-lab experimental data, but their accuracy and interpretability remain limited. On the other hand, traditional supervised deep learning models require abundant labeled training examples for performance improvements, posing a practical barrier. In this work, we introduce FSFP, a training strategy that can effectively optimize protein language models under extreme data scarcity for fitness prediction. By combining meta-transfer learning, learning to rank, and parameter-efficient fine-tuning, FSFP can significantly boost the performance of various protein language models using merely tens of labeled single-site mutants from the target protein. In silico benchmarks across 87 deep mutational scanning datasets demonstrate FSFP’s superiority over both unsupervised and supervised baselines. Furthermore, we successfully apply FSFP to engineer the Phi29 DNA polymerase through wet-lab experiments, achieving a 25% increase in the positive rate. These results underscore the potential of our approach in aiding AI-guided protein engineering.© 2024. The Author(s).

Show Full Text

Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning.

Researchers

Journal

Modalities

Models

Abstract

Integration of lanthanide MOFs/methylcellulose-based fluorescent sensor arrays and deep learning for fish freshness monitoring.

In Search of Disentanglement in Tandem Mass Spectrometry Datasets.

Research on Music Style Classification Based on Deep Learning.

Enhanced Deep Learning Architectures for Face Liveness Detection for Static and Video Sequences.

Artificial Intelligence for Vaccine Design.

BoT-Net: a lightweight bag of tricks-based neural network for efficient LncRNA-miRNA interaction prediction.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply