Harnessing protein language model for structure-based discovery of highly efficient and robust PET hydrolases

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Plastic waste, particularly polyethylene terephthalate (PET), presents significant environmental challenges, driving extensive research into enzymatic biodegradation. However, existing PET hydrolases (PETases) are limited by narrow sequence diversity and suboptimal performance. This study introduces VenusMine, a protein discovery pipeline that integrates protein language models (PLMs) with a representation tree to identify PETases based on structural similarity using sequence information. Using the crystal structure of Is PETase as a template, VenusMine identifies and clusters target proteins. Candidates are further screened using PLM-based assessments of solubility and thermostability, leading to the selection of 34 proteins for biochemical validation. Results reveal that 14 candidates exhibit PET degradation activity across 30–60 °C. Notably, a PET hydrolase from Kibdelosporangium banguiense ( Kb PETase) demonstrates a melting temperature (T m ) 32 °C higher than Is PETase and exhibits the highest PET degradation activity within 30 – 65 °C among wild-type PETases. Kb PETase also surpasses FastPETase and LCC in catalytic efficiency. X-ray crystallography and molecular dynamics simulations show that Kb PETase possesses a conserved catalytic domain and enhanced intramolecular interactions, underpinning its improved functionality and thermostability. This work demonstrates a novel deep learning approach for discovering natural PETases with enhanced properties.

Article activity feed