When does additional information improve accuracy of RNA secondary structure prediction?

Logan Rose
Luis Sanchez Giraldo
Duc Nguyen
Matthew Wheeler
David Murrugarra

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The secondary structure of an RNA sequence plays an important role in determining its function, and accurate prediction of the structure is still a major goal in computational biology. Improvements in the prediction accuracy of the secondary structure can be achieved via auxiliary information. In this paper, we study features based on suboptimal formations competing with the minimum-free energy formation and investigate their role in determining the improvement of accuracy via auxiliary information, which we call directability. Here, we introduce a similarity measure among competing substructures called profiles. Then, we present an n -dimensional representation of the profiles which allows the use of topological data analysis (i.e., persistence landscapes) to obtain different metrics that represent topological features. Then, we built random forest classifiers using these novel features. We show how the similarity feature is more important for classifiers trained on sequences with similar structures while the topological features are more important for classifiers trained on sequences with dissimilar structures. We performed extensive testing on two sets of RNA sequences where we studied the sensitivity of the classification accuracy and their feature importance.

Version published to 10.1101/2025.03.18.643972v1 on bioRxiv
Mar 19, 2025

RNAprecis: Prediction of full-detail RNA conformation from the experimentally best-observed sparse parameters

This article has 10 authors:
1. Henrik Wiechers
2. Christopher J. Williams
3. Benjamin Eltzner
4. Franziska Hoppe
5. Michael G. Prisant
6. Vincent B. Chen
7. Ezra Miller
8. Kanti V. Mardia
9. Jane S. Richardson
10. Stephan F. Huckemann
This article has no evaluationsLatest version Mar 5, 2025
Missense mutation knowledge can decrease prediction inaccuracies on protein secondary structure

This article has 3 authors:
1. Ivan Perez
2. Ulrike Stege
3. Hosna Jabbari
This article has no evaluationsLatest version Mar 10, 2025
High-accuracy protein complex structure modeling based on sequence-derived structure complementarity

This article has 8 authors:
1. Minghua Hou
2. Yuhao Xia
3. Pengcheng Wang
4. Zexin Lv
5. Dongliang Hou
6. Xiaogen Zhou
7. Jianyang Zeng
8. Guijun Zhang
This article has no evaluationsLatest version Mar 29, 2025

Listed in

Abstract

Article activity feed

Related articles

RNAprecis: Prediction of full-detail RNA conformation from the experimentally best-observed sparse parameters

Missense mutation knowledge can decrease prediction inaccuracies on protein secondary structure

High-accuracy protein complex structure modeling based on sequence-derived structure complementarity