Ovo, an Open-Source Ecosystem for De Novo Protein Design
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The protein design field is rapidly advancing, with frequent emergence of new models and pipelines for designing de novo proteins with tailored properties and functions not found in nature. However, the current tool landscape is fragmented, tools are hard to install and deploy, and require significant computational expertise to integrate into end-to-end, scalable pipelines. A particular challenge is managing many sequences, structures, and metrics for downstream testing and retrospective analysis of input parameters. To address this need, we introduce Ovo, an open-source de novo protein design ecosystem that consolidates models, workflows, data management, and interactive visualization into a scalable, infrastructure-agnostic platform. Ovo features Nextflow-based workflow orchestration, a storage layer, and both command-line and graphical interfaces that democratize scaffold design, binder design and diversification, and validation workflows. Ovo's novel ProteinQC module computes comprehensive sequence and structure descriptors, contextualizing designs against reference sets. Ovo plugins let the community add new workflows and user interfaces to accelerate adoption of emerging methods and facilitate community-driven benchmarking. Ovo lowers engineering barriers and demystifies the design process, allowing experts and non-technical users to design proteins at scale. With community-driven development, Ovo can accelerate de novo protein design and advance discovery in therapeutics and biotechnology.