An explainable language model for antibody specificity prediction using curated influenza hemagglutinin antibodies.

Researchers

Akshita B Gopal Claire S Graham Danbi Choi Huibin Lv Ivana R Shen Nicholas C Wu Qi Wen Teo Ruipeng Lei Timothy J C Tan Xin Chen Yiquan Wang Yuen-Hei Yeung

Journal

bioRxiv : the preprint server for biology

Modalities

Models

Language model

Abstract

Despite decades of antibody research, it remains challenging to predict the specificity of an antibody solely based on its sequence. Two major obstacles are the lack of appropriate models and inaccessibility of datasets for model training. In this study, we curated a dataset of >5,000 influenza hemagglutinin (HA) antibodies by mining research publications and patents, which revealed many distinct sequence features between antibodies to HA head and stem domains. We then leveraged this dataset to develop a lightweight memory B cell language model (mBLM) for sequence-based antibody specificity prediction. Model explainability analysis showed that mBLM captured key sequence motifs of HA stem antibodies. Additionally, by applying mBLM to HA antibodies with unknown epitopes, we discovered and experimentally validated many HA stem antibodies. Overall, this study not only advances our molecular understanding of antibody response to influenza virus, but also provides an invaluable resource for applying deep learning to antibody research.

Show Full Text

An explainable language model for antibody specificity prediction using curated influenza hemagglutinin antibodies.

Researchers

Journal

Modalities

Models

Abstract

Bone age assessment based on deep neural networks with annotation-free cascaded critical bone region extraction.

Jellyfish Search-Optimized Deep Learning for Compressive Strength Prediction in Images of Ready-Mixed Concrete.

Deep Learning Methods to Predict Mortality in COVID-19 Patients: A Rapid Scoping Review.

Predicting ammonia nitrogen in surface water by a new attention-based deep learning hybrid model.

Comparison between supervised and physics-informed unsupervised deep neural networks for estimating cerebral perfusion using multi-delay arterial spin labeling MRI.

Mesh2SSM: From Surface Meshes to Statistical Shape Models of Anatomy.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply