From Prompt to Pipeline: Large Language Models for Scientific Workflow Development in Bioinformatics

Khairul Alam
Banani Roy

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Scientific Workflow Systems (SWSs) such as Galaxy and Nextflow are essential for scalable, reproducible, and automated bioinformatics analyses. However, developing and understanding scientific workflows remains challenging for many domain scientists due to the complexity of tool/module selection, infrastructure requirements, and limited programming expertise. This study explores whether state-of-the-art Large Language Models (LLMs) such as GPT-4o, Gemini 2.5 Flash, and DeepSeek-V3 can assist in generating accurate, complete, and usable bioinformatics workflows. We evaluate a set of representative workflows covering tasks such as RNA-seq, SNP analysis, and DNA methylation across both Galaxy (graphical) and Nextflow (script-based) platforms. To simulate realistic usage, we adopt a tiered prompting strategy: each workflow is first generated using an instruction-only prompt; if the output is incomplete or incorrect, we escalate to a role-based prompt, and finally to chain-of-thought prompting if needed. The generated workflows are evaluated against community-curated baselines from the Galaxy Training Network (GTN) and nf-core, using criteria including correctness, completeness, tool appropriateness, and executability. Results show that LLMs exhibit strong potential in workflow development. Gemini 2.5 Flash produced the most accurate and user-friendly workflows in Galaxy, while DeepSeek-V3 excelled in Nextflow pipeline generation. GPT-4o performed nicely with structured prompts. Prompting strategy significantly influenced output quality, with rolebased and chain-of-thought prompts enhancing correctness and completeness. Overall, LLMs can reduce the cognitive and technical barriers to workflow development, making SWSs more accessible to novice and expert users. This work highlights the practical utility of LLMs and provides actionable insights for integrating them into real-world bioinformatics workflow design.

Version published to 10.21203/rs.3.rs-7642675/v1 on Research Square
Oct 10, 2025

NSeqVerify: An Easy-to-Use Desktop Suite for Integrated NGS Data Analysis, from Raw Reads to Taxonomic Assignment

This article has 1 author:
1. Roberto Reinosa Fernández
This article has no evaluationsLatest version Nov 3, 2025
2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

This article has 2 authors:
1. Jeferyd Yepes García
2. Laurent Falquet
This article has no evaluationsLatest version Oct 15, 2025
Large Language Models for Accessible Reporting of Bioinformatics Analyses in Interdisciplinary Contexts

This article has 14 authors:
1. Lijia Yu
2. Daniel Kim
3. Yue Cao
4. Matthew Wei Shun Shu
5. Maya Shen
6. Xiaoqi Liang
7. Jasmine Gu
8. Rojashree Jayakumar
9. Wenze Ding
10. Fei Yang
11. Xumou Zhang
12. Jinman Kim
13. Pengyi Yang
14. Jean Yee Hwa Yang
This article has no evaluationsLatest version Nov 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

NSeqVerify: An Easy-to-Use Desktop Suite for Integrated NGS Data Analysis, from Raw Reads to Taxonomic Assignment

2Pipe: It Starts with a Question. Matching You with the Correct Pipeline for MAG Reconstruction

Large Language Models for Accessible Reporting of Bioinformatics Analyses in Interdisciplinary Contexts