DrugPilot: LLM-based Parameterized Reasoning Agent for Drug Discovery

Wenbin Hu
Kun Li
Zhennan Wu
Shoupeng Wang
Jia Wu
Bo Du
Xiangyu Wang
Shirui Pan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) integrated with autonomous agents hold significant potential for advancing scientific discovery through automated reasoning and task execution. However, applying LLM agents to drug discovery is still constrained by challenges such as large-scale multimodal data processing, limited task automation, and poor support for domain-specific tools. To overcome these limitations, we introduce DrugPilot, a LLM-based agent system with a parameterized reasoning architecture designed for end-to-end scientific workflows in drug discovery. DrugPilot enables multi-stage research processes by integrating structured tool use with a novel parameterized memory pool. The memory pool converts heterogeneous data from both public sources and user-defined inputs into standardized representations. This design supports efficient multi-turn conversation, reduces information loss during data exchange, and enhances complex scientific decision-making. To support training and benchmarking, we construct a drug instruction dataset covering eight core drug discovery tasks. Under the Berkeley function-calling benchmark, DrugPilot significantly outperforms state-of-the-art agents such as ReAct and LoT, achieving task completion rates of 98.0%, 93.5%, and 64.0% for simple, multi-tool, and multi-turn categories, respectively. These results highlight DrugPilot's strong potential as a generalizable agent framework for automated, interactive, and data-driven reasoning across computational science applications.

Version published to 10.21203/rs.3.rs-7489358/v1 on Research Square
Sep 29, 2025

Large Language Model Agent for Modular TaskExecution in Drug Discovery

This article has 6 authors:
1. Janghoon Ock
2. Radheesh Sharma Meda
3. Srivathsan Badrinarayanan
4. Neha S. Aluru
5. Achuth Chandrasekhar
6. Amir Barati Farimani
This article has no evaluationsLatest version Sep 17, 2025
PRIME: A Multi-Agent Environment for Orchestrating Dynamic Computational Workflows in Protein Engineerings

This article has 9 authors:
1. Yuyang Zhou
2. Jin Su
3. Jiawei Zhang
4. Wangyang Hu
5. Tianli Tao
6. Guanqi Li
7. Xibin Zhou
8. Li Fan
9. Fajie Yuan
This article has no evaluationsLatest version Sep 23, 2025
Large Language Model Agents for Biomedicine: A Comprehensive Review of Methods, Evaluations, Challenges, and Future Directions

This article has 2 authors:
1. Xiaoran Xu
2. Ravi Sankar
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Large Language Model Agent for Modular TaskExecution in Drug Discovery

PRIME: A Multi-Agent Environment for Orchestrating Dynamic Computational Workflows in Protein Engineerings

Large Language Model Agents for Biomedicine: A Comprehensive Review of Methods, Evaluations, Challenges, and Future Directions