Decoding universal principles of codon-mediated regulation of gene expression

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Codon usage determines gene expression levels, yet its universal principles remain elusive. Here, we developed a regression-based model to derive “codon weights” from transcriptomic data, enabling improved prediction of mRNA and protein abundance across diverse taxa, including plants, mammals, insects, and microbes. Ribosome profiling (Ribo-seq) data analysis revealed that these codon weights correlate with Ribo-seq-weighted cumulative codon frequencies specifically in ribosome-unoccupied regions, rather than at stalling sites, across all seven model species. Experimental validation using species-specific optimization confirmed that our method effectively modulates gene expression in Escherichia coli and terrestrial plants. These findings demonstrate that species-specific environments for gene expression are encoded in codon weights, which can be deduced through a universal, species-independent framework, providing a new foundation for synthetic biology.

Article activity feed