Unraveling Rare Codon Bias in Actinomycetota: Lineage-Specific and 5’ Terminal Enrichment Across 1936 Genomes

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background Actinomycetota are a diverse phylum of major ecological, medical, and industrial importance, best known for producing antibiotics and other secondary metabolites. While some regulatory mechanisms of secondary metabolism are understood, many remain unresolved. Codon usage bias, the preferential use of synonymous codons, represents one potential layer of regulation, as it is known to influence translation efficiency and timing. The availability of thousands of high-quality genomes now enables codon usage to be examined across this phylum at unprecedented scale. Results We analyzed codon usage across 1936 high- and medium-quality genomes from 11 genera. The most common codon across the dataset was GCC, particularly enriched in Streptomyces albidoflavus . In contrast, TTA was consistently rare and showed variable distribution across genera. AGA was identified as another rare codon with especially strong enrichment at 5′ termini. Both TTA and AGA were enriched in functional categories such as replication, transcription, and secondary metabolism, and were significantly overrepresented in biosynthetic gene clusters, particularly within biosynthetic and regulatory genes. Conclusions These results show that rare codon usage in Actinomycetota reflects both evolutionary history and nonrandom positional enrichment, particularly at 5′ termini, where it may fine-tune translation timing. This positional bias likely represents a conserved mechanism for coordinating gene expression. Beyond providing biological insight, our findings highlight the practical value of codon analysis for synthetic biology, metabolic engineering, and efforts to optimize the expression of biosynthetic gene clusters.

Article activity feed