Decoding the RNA binding systems by UltraGen
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
RNA plays multifaceted roles in catalytic reactions and gene regulation. The sequence-encoded binding language across diverse RNA-target interactomes is high-dimensional and complex. Here, we introduce UltraGen, an RNA language model designed to capture RNA binding properties. Utilizing fine-grained self-learning, UltraGen identifies RNA aptamers for a wide range of target sizes, including small molecules, proteins, cells, and tissues. Additionally, UltraGen discerns tissue specificity for millions of RNA species across 22 human organs based on their 3’-UTR sequences, predicts the tropism of human-pathogenic RNA viruses, and characterizes SARS-CoV-2 replicase RNA binding at single-base resolution.