ORION: An agentic reasoning construct for the analysis of complex human immune profiling

Monica Dayao
Kenny Kim
Bernard Khor
Aaron Jaech
Bas van Opheusden
Aaron Bodansky
Joseph L. DeRisi

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The capacity to generate high-dimensional biological datasets has outpaced the ability to interpret them. Technologies such as phage immunoprecipitation and sequencing (PhIP-seq) enable proteome-scale profiling of antibody repertoires, but interpreting thousands of enriched peptides into mechanistic hypotheses remains a labor-intensive bottleneck requiring expert synthesis of statistics, literature, and domain knowledge. Here we describe ORION (Omics Reasoning & Interpretation Orchestrator), a multi-agent framework that uses reasoning-capable large language models to perform end-to-end analysis of complex immune profiling data. ORION integrates statistical analysis, machine learning, and automated literature review into a single structured workflow, producing results that are reproducible and fully traceable. Applied to a published PhIP-seq dataset from autoimmune polyendocrine syndrome type 1 (APS-1), ORION recovered the canonical autoantibody signature in approximately two hours, closely recapitulating an analysis that originally required one to two months of manual effort. To test hypothesis-generation capacity on previously unseen data, we applied ORION to a novel PhIP-seq dataset from individuals with Down syndrome, for which no proteome-wide autoantibody reference exists. ORION distinguished disease from control samples with high accuracy, prioritized candidate autoantibody targets, and organized them into biologically coherent groups spanning immune, gut, and neuronal programs, generating testable hypotheses for experimental follow-up. These results demonstrate that agentic AI systems can compress the analysis of complex immune profiling data from weeks to hours, allowing scientists to redirect their time toward the fundamental biology.

Version published to 10.64898/2026.04.13.718286 on bioRxiv
Apr 16, 2026

Pipette: Encoding scientific literature into an executable Skill Graph for multi-agent bioinformatics

This article has 2 authors:
1. Chirag Gupta
2. Ananya Sharma
This article has no evaluationsLatest version Apr 12, 2026
MechAInistic: An LLM-guided Multi-Agent System for Reasoning over Genome-Scale Constraint-Based Metabolic Models

This article has 7 authors:
1. Josh Loecker
2. Narayna Puraja
3. William Bryant
4. Bhanwar Lal Puniya
5. Prakash Packrisamy
6. Ahmed Abdeen Hamed
7. Tomáš Helikar
This article has no evaluationsLatest version May 13, 2026
GeneBench: Assessing AI Agents for Multi-Stage Inference Problems in Genomics and Quantitative Biology

This article has 2 authors:
1. Jeremy Li
2. Andrew Ho
This article has no evaluationsLatest version Apr 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Pipette: Encoding scientific literature into an executable Skill Graph for multi-agent bioinformatics

MechAInistic: An LLM-guided Multi-Agent System for Reasoning over Genome-Scale Constraint-Based Metabolic Models

GeneBench: Assessing AI Agents for Multi-Stage Inference Problems in Genomics and Quantitative Biology