Secondary-Structure-Informed RNA Inverse Design via Relational Graph Neural Networks
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
RNA inverse design is an essential part of many RNA therapeutic strategies. To date, there have been great advances in computationally driven RNA design. The current machine learning approaches can predict the sequence of an RNA given its 3D structure with acceptable accuracy and at tremendous speed. The design and engineering of RNA regulators such as riboswitches, however, is often more difficult, partly due to their inherent conformational switching abilities. Although recent state-of-the-art models do incorporate information about the multiple structures that a sequence can fold into, there is great room for improvement in modeling structural switching. In this work, a relational geometric graph neural network is proposed that explicitly incorporates alternative structures to predict an RNA sequence. Converting the RNA structure into a geometric graph, the proposed model uses edge types to distinguish between the primary structure, secondary structure, and spatial positioning of the nucleotides in representing structures. The results show higher native sequence recovery rates over those of gRNAde across different test sets (eg. 72% vs. 66%) and a benchmark from the literature (60% vs. 57%). Secondary-structure edge types had a more significant impact on the sequence recovery than the spatial edge types as defined in this work. Overall, these results suggest the need for more complex and case-specific characterization of RNA for successful inverse design.