Widespread DNA off-targeting confounds studies of RNA chromatin occupancy
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The importance of long noncoding RNA (lncRNA) functions is recognized across biological systems, but their modes of action remain poorly understood. One mechanism proposed to be particularly common is gene expression regulation via recruitment to specific genomic regions. Several high-throughput sequencing methods were developed for studying the genome-wide chromatin occupancy of lncRNAs, including ChIRP-seq, CHART-seq, and RAP-seq. These methods utilize biotin-labeled probes targeting the RNA of interest to isolate and recover the chromatin associated with it. Many of the datasets obtained with these methods contain thousands of binding sites, which appears to be in contradiction with the low abundance of the interrogated lncRNAs. We studied the chromatin interactome of NESPR lncRNA in cells with varying levels of endogenous expression and then performed a meta-analysis using dozens of RNA-chromatin interaction datasets in human and mouse cells. We demonstrate that thousands of regions reported to bind lncRNAs most likely arise from the spurious recovery of DNA elements, where the ends of the recovered DNA fragments exhibit partial complementarity with the probes used for the pulldown. In addition, crucial controls were rarely used in studies profiling RNA-chromatin interactions. Therefore, most chromatin regions reported as bound by trans -acting RNAs in recent studies in mammalian cells appear to be technical artifacts. We provide suggestions for assessing the quality of RNA-chromatin datasets and their improvement.