Massively parallel characterization and deep learning of enhancers in plant genomes
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Enhancers coordinate gene expression in response to developmental and environmental cues. Because plant enhancers lack the readily detectable molecular hallmarks of animal enhancers, their systematic functional characterization has yet to be accomplished. Here, we characterize the species- and condition-specific enhancer activity of over 350,000 sequences derived from accessible chromatin regions of the Arabidopsis , tomato, maize, and sorghum genomes. Enabled by the massive scale of the data, we developed plantGREP, a deep learning model that predicts enhancer strength and identifies the underlying functional sequence motifs. We apply plantGREP to evolve strong constitutive as well as species- and condition-specific enhancers, and to locate regions with enhancer activity upstream of developmental genes in crop genomes. These results should facilitate the targeted editing of enhancers in crop genomes and the design of cell-type-specific plant enhancers.