BioLab: End-to-End Autonomous Life Sciences Research with Multi-Agents System Integrating Biological Foundation Models
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Scientific discovery in the life sciences remains hindered by fragmented workflows, narrow-scope computational models, and inefficient links between in silico prediction and wet-lab validation. We present BioLab, a multi-agent system that integrates domain-specialized foundation models to automate end-to-end biological research. BioLab comprises eight collaborating agents, including a Planner, Reasoner, and Critic, orchestrated through a Memory Agent that enables iterative refinement via retrieval-augmented generation and a suite of 219 computational xBio-Tools spanning five biological scales (DNA, RNA, protein, cell, and chemical). These tools are built on the xTrimo Universe, a collection of 104 models derived from six foundation models (xTrimoChem, Protein, RNA, DNA, Cell, and Text), the majority of which achieve state-of-the-art (91–100% SOTA ratios) on domain benchmarks. Across standard reasoning tasks (PubMedQA, MMLU-Pro/Biology, GPQA-diamond), BioLab consistently outperformed leading large language models, including GPT-4o, Gemini-2.5, and DeepSeek-R1. Beyond benchmarks, BioLab autonomously executed a fully computational pipeline for de novo macrophage-targeting antibody design, progressing from target mining to multi-objective antibody optimization, where molecular dynamics simulations revealed structural mechanisms underlying enhanced affinity of optimized variants. Closing the computational-experimental loop, BioLab designed optimized antibodies (Pem-MOO-1, Pem-MOO-2) that achieved IC50 values of 0.01–0.016 nM, markedly surpassing the parental Pembrolizumab (0.027 nM) for PD-1. Functional assays confirmed enhanced pathway blockade and improved multi-parameter performance profiles. Together, these results establish BioLab as a generalizable framework for AI-native scientific discovery, demonstrating how multi-agent systems coupled with foundation models can autonomously generate, execute, and experimentally validate novel biological hypotheses.