A self-supervised deep learning method for data-efficient training in genomics.

Researchers

Alice C McHardy Bernd Bischl Hüseyin Anil Gündüz Martin Binder Mina Rezaei Philipp C Münch René Mreches Xiao-Yin To

Journal

Modalities

Models

Abstract

Deep learning in bioinformatics is often limited to problems where extensive amounts of labeled data are available for supervised classification. By exploiting unlabeled data, self-supervised learning techniques can improve the performance of machine learning models in the presence of limited labeled data. Although many self-supervised learning methods have been suggested before, they have failed to exploit the unique characteristics of genomic data. Therefore, we introduce Self-GenomeNet, a self-supervised learning technique that is custom-tailored for genomic data. Self-GenomeNet leverages reverse-complement sequences and effectively learns short- and long-term dependencies by predicting targets of different lengths. Self-GenomeNet performs better than other self-supervised methods in data-scarce genomic tasks and outperforms standard supervised training with ~10 times fewer labeled training data. Furthermore, the learned representations generalize well to new datasets and tasks. These findings suggest that Self-GenomeNet is well suited for large-scale, unlabeled genomic datasets and could substantially improve the performance of genomic models.© 2023. Springer Nature Limited.

Show Full Text

A self-supervised deep learning method for data-efficient training in genomics.

Researchers

Journal

Modalities

Models

Abstract

Protein structure prediction in the deep learning era.

Graph-DTI: A New Model for Drug-Target Interaction Prediction Based on Heterogenous Network Graph Embedding.

Machine learning on multiple epigenetic features reveals H3K27Ac as a driver of gene expression prediction across patients with glioblastoma.

graphLambda: Fusion Graph Neural Networks for Binding Affinity Prediction.

MLACNN: an attention mechanism-based CNN architecture for predicting genome-wide DNA methylation.

Autosurv: interpretable deep learning framework for cancer survival analysis incorporating clinical and multi-omics data.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply