|

Multiple Sequence Alignment based on deep Q network with negative feedback policy.

Researchers

Journal

Modalities

Models

Abstract

Multiple Sequence Alignment (MSA) is an essential procedure in the sequence analysis of biological macromolecules, which can obtain the potential information between multiple sequences, such as functional and structural information. At present, the main challenge of MSA is an NP-complete problem; the algorithm’s complexity increases exponentially with the increase of the number of sequences. Some methods are constantly approaching the results towards the optimal ratio and easy to fall into the local optimization, so the accuracy of these methods is still greatly improved.Here, we propose a new method based on deep reinforcement learning (DRL) for MSA. Specifically, inspired by biofeedback, we leverage the Negative Feedback Policy (NFP) to enhance the performance and accelerate the convergence of the model. Furthermore, we developed a new profile algorithm to compute the sequence from aligned sequences for the next profile-sequence alignment to facilitate the experiment.Compared to six state-of-the-art methods, three different genetic algorithms, Q-learning, ClustalW, and MAFFT, our method exceeds these methods in terms of Sum-of-Pairs (SP) score and Column Score(CS) scores on most datasets in which the increased range of SP score is from 2 to 1056.Extensive experiments based on several datasets validate the effectiveness of our method for achieving a better alignment, and the results have higher accuracy and stability. The source code can be found at https://github.com/MrZhang176/DNPMSA.Copyright © 2022 Elsevier Ltd. All rights reserved.

Similar Posts

Leave a Reply

Your email address will not be published. Required fields are marked *