Protein-ligand binding residue prediction enhancement through hybrid deep heterogeneous learning of sequence and structure data.

Abstract

Knowledge of protein-ligand binding residues is important for understanding the functions of proteins and their interaction mechanisms. From experimentally solved protein structures, how to accurately identify its potential binding sites of a specific ligand on the protein is still a challenging problem. Compared with structure-alignment-based methods, machine learning algorithms provide an alternative flexible solution which is less dependent on annotated homogeneous protein structures. Several factors are important for an efficient protein-ligand prediction model, e.g. discriminative feature representation and effective learning architecture to deal with both the large-scale and severe imbalanced data.
In this study, we propose a novel deep-learning-based method called DELIA for protein-ligand binding residue prediction. In DELIA, a hybrid deep neural network is designed to integrate 1D sequence-based features with 2D structure-based amino acid distance matrices. In order to overcome the problem of severe data imbalance between the binding and non-binding residues, strategies of oversampling in mini-batch, random under-sampling, and stacking ensemble strategy are designed to enhance the model. Experimental results on five benchmark datasets demonstrate the effectiveness of proposed DELIA pipeline.
The web server of DELIA is available at www.csbio.sjtu.edu.cn/bioinf/delia/.
Supplementary data are available at Bioinformatics online.
© The Author(s) (2020). Published by Oxford University Press. All rights reserved. For Permissions, please email: [email protected].

Show Full Text

Protein-ligand binding residue prediction enhancement through hybrid deep heterogeneous learning of sequence and structure data.

Researchers

Journal

Modalities

Models

Abstract

A Deep Learning Filter that Blocks Phishing Campaigns Using Intelligent English Text Recognition Methods.

Automated Detection and Scoring of Tumor-Infiltrating Lymphocytes in Breast Cancer Histopathology Slides.

Review and Prospect: Deep Learning in Nuclear Magnetic Resonance Spectroscopy.

BioBERT: a pre-trained biomedical language representation model for biomedical text mining.

A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

Engineering and clinical use of artificial intelligence (AI) with machine learning and data science advancements: radiology leading the way for future.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply