GraphPro: An interpretable graph neural network-based model for identifying promoters in multiple species.

Researchers

Journal

Modalities

Models

convolutional neural networks fully connected neural network graph neural networks

Abstract

Promoters are DNA sequences that bind with RNA polymerase to initiate transcription, regulating this process through interactions with transcription factors. Accurate identification of promoters is crucial for understanding gene expression regulation mechanisms and developing therapeutic approaches for various diseases. However, experimental techniques for promoter identification are often expensive, time-consuming, and inefficient, necessitating the development of accurate and efficient computational models for this task. Enhancing the model’s ability to recognize promoters across multiple species and improving its interpretability pose significant challenges. In this study, we introduce a novel interpretable model based on graph neural networks, named GraphPro, for multi-species promoter identification. Initially, we encode the sequences using k-tuple nucleotide frequency pattern, dinucleotide physicochemical properties, and dna2vec. Subsequently, we construct two feature extraction modules based on convolutional neural networks and graph neural networks. These modules aim to extract specific motifs from the promoters, learn their dependencies, and capture the underlying structural features of the promoters, providing a more comprehensive representation. Finally, a fully connected neural network predicts whether the input sequence is a promoter. We conducted extensive experiments on promoter datasets from eight species, including Human, Mouse, and Escherichia coli. The experimental results show that the average Sn, Sp, Acc and MCC values of GraphPro are 0.9123, 0.9482, 0.8840 and 0.7984, respectively. Compared with previous promoter identification methods, GraphPro not only achieves better recognition accuracy on multiple species, but also outperforms all previous methods in cross-species prediction ability. Furthermore, by visualizing GraphPro’s decision process and analyzing the sequences matching the transcription factor binding motifs captured by the model, we validate its significant advantages in biological interpretability. The source code for GraphPro is available at https://github.com/liuliwei1980/GraphPro.Copyright © 2024 Elsevier Ltd. All rights reserved.

Show Full Text

GraphPro: An interpretable graph neural network-based model for identifying promoters in multiple species.

Researchers

Journal

Modalities

Models

Abstract

An Open-Source Deep Learning-Based GUI Toolbox For Automated Auditory Brainstem Response Analyses (ABRA).

UPCLASS: a deep learning-based classifier for UniProtKB entry publications.

Protein oligomer modeling guided by predicted inter-chain contacts in CASP14.

High-throughput classification of S. cerevisiae tetrads using deep learning.

scCapsNet-mask: an updated version of scCapsNet with extended applicability in functional analysis related to scRNA-seq data.

CAPE: a deep learning framework with Chaos-Attention net for Promoter Evolution.

Leave a Reply Cancel reply

Researchers

Journal

Modalities

Models

Abstract

Similar Posts

Leave a Reply Cancel reply