On inputs to deep learning for RNA 3D structure prediction
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Today, there are several effective deep learning models for predicting the 3D structure of proteins. Building on their success, models have been developed for predicting the 3D structure of non-coding RNAs. Unfortunately, these models are much less accurate than their protein counterparts. In this paper, we highlight differences between protein and RNA structure, and demonstrate methods for deep learning targeted at addressing those differences, with the aim of prompting discussion on these topics. We present an RNA-specific pipeline for generating structural Multiple Sequence Alignments (MSAs). Derived from the structural alignments, we introduce engineered evolutionary features that strongly inform RNA structure. Further, from the crystal structure, we derive structural features describing RNA base pairing. These evolutionary and structural features can be used in loss functions at different stages of training. Finally, we discuss different cropping strategies informed by RNA structure.