Sequence-Based Generative AI-Guided Design of Versatile Tryptophan Synthases

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Enzymes offer unparalleled selectivity and sustainability for chemical synthesis, yet their widespread industrial application is often hindered by the slow and uncertain process of discovering and optimizing suitable biocatalysts. While directed evolution remains the gold standard for enzyme optimization, its success hinges on the availability of a starting enzyme with measurable activity, a persistent bottleneck for many desired functions. Designing libraries likely to contain such functional starting points remains a major challenge. In this work, we use the GenSLM protein language model (PLM) along with a series of filters to generate novel sequences of the β -subunit of tryptophan synthase (TrpB) that express in Escherichia coli , are stable, and are catalytically active in the absence of a TrpA partner. Many generated TrpBs also demonstrated significant substrate promiscuity, accepting non-canonical substrates typically inaccessible to natural TrpBs. Remarkably, several outperformed both natural and laboratory-optimized TrpBs on native and non-canonical substrates. Comparative analysis of the most active and promiscuous generated TrpB and its closest natural homolog confirmed that this enhanced functional versatility does not stem from the natural enzyme, highlighting the creative potential of generative models. Our results demonstrate that the model can generate enzymes which not only preserve natural structure and function but also acquire non-natural properties, establishing PLMs as powerful tools for biocatalyst discovery and engineering, with the potential in some cases to bypass further optimization.

Article activity feed