Empirical Study of Protein Feature Representation on Deep Belief Networks Trained with Small Data for Secondary Structure Prediction.

Researchers

Chee-Keong Kwoh Shamima Rashid Suresh Sundaram

Journal

IEEE/ACM transactions on computational biology and bioinformatics

Modalities

Models

C-Alpha C-Beta Side Chain (CABS)-algorithm Deep Belief Networks (DBN)

Abstract

Protein secondary structure (SS) prediction is a classic problem of computational biology and is widely used in structural characterization and to infer homology. While most SS predictors have been trained on thousands of sequences, a previous approach had developed a compact model of training proteins that used a C<\b>-A<\b>lpha, C-B<\b>eta S<\b>ide Chain (CABS)-algorithm derived energy based feature representation. Here, the previous approach is extended to Deep Belief Networks (DBN). Deep learning methods are notorious for requiring large datasets and there is a wide consensus that training deep models from scratch on small datasets, works poorly. By contrast, we demonstrate a simple DBN architecture containing a single hidden layer, trained only on the CB513 dataset. Testing on an independent set of G Switch proteins improved the Q3 score of the previous compact model by almost 3%. The findings are further confirmed by comparison to several deep learning models which are trained on thousands of proteins. Finally, the DBN performance is also compared with Position Specific Scoring Matrix (PSSM)-profile based feature representation. The importance of (i) structural information in protein feature representation and (ii) complementary small dataset learning approaches for detection of structural fold switching are demonstrated.

Show Full Text

Empirical Study of Protein Feature Representation on Deep Belief Networks Trained with Small Data for Secondary Structure Prediction.

Researchers

Journal

Modalities

Models

Abstract

Automatic detection of COVID-19 from chest CT scan and chest X-Rays images using deep learning, transfer learning and stacking.

Deep-learning based 3D birefringence image generation using 2D multi-view holographic images.

A comprehensive survey of deep learning research on medical image analysis with focus on transfer learning.

Improving the Concrete Crack Detection Process via a Hybrid Visual Transformer Algorithm.

Methodological Considerations in Longitudinal Analyses of Microbiome Data: A Comprehensive Review.

Application of Big Data Technology to Promote Agricultural Structure Adjustment and High-Quality Development of Modern Agriculture.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply