HydraRNA: a hybrid architecture based full-length RNA language model

Guipeng Li
Feifei Jiang
Junhao Zhu
Huanhuan Cui
Zefeng Wang
Wei Chen

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

RNA, an essential component of the central dogma of molecular biology, plays versatile roles in all cellular processes. RNA large language models (LLMs) are emerging as powerful methods in RNA research to decipher its intricate network of function and regulation. However, previous RNA LLMs were based on the transformer model and pre-trained on short segment of non-coding RNAs, which limits their general usability. Here we present the first full-length RNA foundation model, HydraRNA, which is based on a hybrid architecture of bidirectional state space model and multi-head attention mechanism, and is pre-trained on a large amount of both protein-coding and non-coding RNAs. Despite being pre-trained with the fewest parameters and the least GPU resources, HydraRNA learns better RNA representations and outperforms the existing foundation models on a variety of downstream tasks, including RNA classification, prediction of RNA secondary structure, RBP binding sites, mRNA stability and translation efficiency. Furthermore, HydraRNA can accurately predict the effect of mutations and estimate the relative contributions of different mRNA regions to the RNA stability and translation. We anticipate that HydraRNA will enable dissecting the diverse properties of RNA, accelerating the research of RNA regulation and facilitating the optimal design of RNA therapeutics.

Version published to 10.1101/2025.03.06.641765v1 on bioRxiv
Mar 11, 2025

EvoFlow-RNA: Generating and Representing non-coding RNA with a Language Model

This article has 6 authors:
1. Sawan Patel
2. Fred Zhangzhi Peng
3. Keith Fraser
4. Adam D. Friedman
5. Pranam Chatterjee
6. Sherwood Yao
This article has no evaluationsLatest version Mar 17, 2025
RNAtranslator: Modeling protein-conditional RNA design as sequence-to-sequence natural language translation

This article has 4 authors:
1. Sobhan Shukueian Tabrizi
2. Sina Barazandeh
3. Helyasadat Hashemi Aghdam
4. A. Ercüment Çiçek
This article has no evaluationsLatest version Mar 25, 2025
Ab initio RNA structure prediction with composite language model and denoised end-to-end learning

This article has 4 authors:
1. Yang Li
2. Chenjie Feng
3. Xi Zhang
4. Yang Zhang
This article has no evaluationsLatest version Mar 11, 2025

HydraRNA: a hybrid architecture based full-length RNA language model

Listed in

Abstract

Article activity feed

EvoFlow-RNA: Generating and Representing non-coding RNA with a Language Model

RNAtranslator: Modeling protein-conditional RNA design as sequence-to-sequence natural language translation

Ab initio RNA structure prediction with composite language model and denoised end-to-end learning

Listed in

Abstract

Article activity feed

Related articles

EvoFlow-RNA: Generating and Representing non-coding RNA with a Language Model

RNAtranslator: Modeling protein-conditional RNA design as sequence-to-sequence natural language translation

Ab initio RNA structure prediction with composite language model and denoised end-to-end learning