OncoGPT: A Modular AI Assistant Orchestrating LLMs in Molecular Oncology

François Degrave
Cédric Balsat
Maxime Liénard
Sébastien Sauvage

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

General-purpose large language models (LLMs) show promise for biomedical reasoning but remain ill-suited to regulated clinical workflows: they hallucinate, rely on opaque sources, and are difficult to audit—limitations incompatible with validated molecular reporting pipelines. A common response is to train or host domain-specific LLMs, yet this requires substantial data, infrastructure, and time. We present OncoGPT, a modular, provider-agnostic orchestration layer that enables the safe and auditable use of off-the-shelf LLMs in molecular oncology with minimal integration cost. A pluggable ModelSelector routes each query to on-premise or API models based on declarative capability and cost profiles, avoiding vendor lock-in and enabling model swaps by configuration rather than code. A hierarchical ContextBuilder assembles task-specific information so that outputs prioritize content from the injected context (e.g., report sections and linked references), with optional fallback to general biomedical knowledge when needed. Evaluated on 19 representative clinical prompts derived from real-world oncology reports, automatic model selection with context achieved expert acceptance across all prompts while reducing inference cost by an order of magnitude; by contrast, a fixed high-end model produced higher cost and lower expert-rated quality. These results demonstrate that a context-first, plug-and-play orchestration approach can operationalize general LLMs for traceable, cost-efficient support in precision oncology workflows—without training new domain-specific models.

Version published to 10.21203/rs.3.rs-7869170/v1 on Research Square
Nov 12, 2025

Prompt-Orchestrated Large Language Models for Clinical Information Extraction

This article has 13 authors:
1. Livia Lilli
2. Andrea Rosati
3. Giovanni Paolo Tobia
4. Massimo Criscione
5. Federica Tomassini
6. Chiara Dachena
7. Alice Luraschi
8. Chiara Cantarini
9. Carolina De Maria
10. Luigi Congedo
11. Massimo Bernaschi
12. Stefano Patarnello
13. Anna Fagotti
This article has no evaluationsLatest version Jan 16, 2026
An Adaptive Foundation Model with Evidence-based Clinical Reasoning for Gastroenterology

This article has 12 authors:
1. Yixuan Yuan
2. Wenting Chen
3. Shengyuan Liu
4. Boyun Zheng
5. Jipeng Zhang
6. Wenxuan Wang
7. Dejun Fan
8. Raymond Tang
9. Thomas Yuen Tung Lam
10. Shannon Melissa Chan
11. Lei Xing
12. Jiancong Hu
This article has no evaluationsLatest version Jan 21, 2026
LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine

This article has 9 authors:
1. Sajib Acharjee Dip
2. Dipanwita Mallick
3. Uddip Acharjee Shuvo
4. Shovito Barua Soummo
5. Fazle Rafsani
6. Bikash Kumar Paul
7. Nazifa Ahmed Moumi
8. Shafayat Ahmed
9. Liqing Zhang
This article has no evaluationsLatest version Dec 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Prompt-Orchestrated Large Language Models for Clinical Information Extraction

An Adaptive Foundation Model with Evidence-based Clinical Reasoning for Gastroenterology

LLMAgent4Bio: LLM Agents for Biological Intelligence Across Genomics, Proteomics, Spatial Biology, and Biomedicine