Deep Learning for RNA Secondary Structure Determination: Gauging Generalizability and Broadening the Scope of Traditional Methods

Marcell Szikszai
Ting-Yuan Wang
Ryan Krueger
David H. Mathews
Max Ward
Sharon Aviran

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The diverse regulatory functions, protein production capacity, and stability of natural and synthetic RNAs are closely tied to their ability to fold into intricate structures. Determining RNA structure is thus fundamental to RNA biology and bioengineering. Among existing approaches to structure determination, computational secondary structure prediction offers a rapid and low-cost strategy and is thus widely used, especially when seeking to identify functional RNA elements in large transcriptomes or screen massive libraries of novel designs. While traditional approaches rely on detailed measurements of folding energetics and/or probabilistic modeling of structural data, recent years have witnessed a surge in deep learning methods, inspired by their tremendous success in protein structure prediction. However, the limited diversity and volume of known RNA structures can impede their ability to accurately predict structures markedly different from the ones they have seen. This is known as the generalization gap and currently poses a major barrier to progress in the field. In this Perspective article, we gauge method generalizability using a new benchmark dataset of structured RNAs we curated from the Protein Data Bank. We also discuss the emergence of deep learning methods for predicting structure probing data and use a new dataset to underscore generalization challenges unique to this domain along with directions for future improvement. Expanding beyond improving predictive accuracy, we review how advances in deep learning have recently enabled scalable and accessible optimization of traditional structure prediction methods and their seamless integration with modern neural networks.

Version published to 10.1101/2025.11.04.686644 on bioRxiv
Nov 7, 2025

Deep Learning Approaches for Accurate RNA 3D Structure Prediction from Primary Sequences

This article has 1 author:
1. Nnaemeka Kingsley Ugwumba
This article has no evaluationsLatest version Jan 29, 2026
Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

This article has 7 authors:
1. Valentina Carbonari
2. Annamaria Defilippo
3. Ugo Lomoio
4. Caterina Francesca Perri
5. Barbara Puccio
6. Pierangelo Veltri
7. Pietro Hiram Guzzi
This article has no evaluationsLatest version Dec 23, 2025
The Evolution of the AlphaFold Architecture

This article has 1 author:
1. Y.C.B.J. Dissanayaka
This article has no evaluationsLatest version Jan 9, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Deep Learning Approaches for Accurate RNA 3D Structure Prediction from Primary Sequences

Artificial Intelligence–Driven Structural Mining Enables Functional Inference in the Human Dark Proteome

The Evolution of the AlphaFold Architecture