Reproducible and shareable bioinformatics pipelines from natural-language prompts

Hyeon-Min Kim
Hwayeon Jeong
Abyot Melkamu Mekonnen
Yeongjun Kim
Youngchul Oh
Heetak Lee
Cheulhee Jung
Jeongbin Park

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) are increasingly used to generate bioinformatics pipelines and to carry out analyses from natural-language prompts. However, the resulting analyses are often difficult to reproduce across sessions, owing to the non-deterministic nature of LLM-driven conversations and heterogeneity of local execution environments, and cannot run on remote high-performance computing (HPC) servers or be shared and reused. We present Autopipe, a platform that guides any Model Context Protocol (MCP) - compatible LLM to produce, execute, and publish source-preserved, re-executable containerized pipelines. Autopipe enables users to execute bioinformatics pipelines on any on-premises remote servers - supported by comprehensive setup documentation aimed at researchers without prior server-administration experience - and to visualize results through an extensible web-based viewer. The Autopipe platform comprises four components: a desktop application with an embedded MCP server for pipeline management and remote execution, an online registry for pipeline and plugin discovery, a web-based result viewer, and a CLI tool for customizing viewer plugins. Autopipe turns conversational analysis into re-executable and shareable workflows. Autopipe is freely available at https://autopipe.org/ .

Version published to 10.64898/2026.05.28.719125 on bioRxiv
Jun 1, 2026

Slivka and Slivka-bio: a lightweight framework for presenting executables as web services and its application in bioinformatics

This article has 6 authors:
1. Mateusz Warowny
2. Thomas Down
3. Stuart A. MacGowan
4. Kiran Mukhyala
5. Geoffrey J. Barton
6. James B. Procter
This article has no evaluationsLatest version May 27, 2026
PromptBio-Bench: Benchmarking LLM-based Bioinformatics Agents for End-to-End Data Analysis

This article has 10 authors:
1. Wenbin Guo
2. Minzhe Zhang
3. Bowei Han
4. Youjia Ma
5. Yang Leng
6. Shishir Hebbar
7. Xiaoyuan Zhou
8. Wenhao Gu
9. Xiao Yang
10. Shashi Dhar
This article has no evaluationsLatest version May 8, 2026
REBEL, Reproducible Environment Builder for Explicit Library resolution

This article has 7 authors:
1. Eliseo Martelli
2. Maria Luisa Ratto
3. Beatrice Nuvolari
4. Maddalena Arigoni
5. Jianli Tao
6. Francesco Maria Antonio Micocci
7. Luca Alessandri
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Slivka and Slivka-bio: a lightweight framework for presenting executables as web services and its application in bioinformatics

PromptBio-Bench: Benchmarking LLM-based Bioinformatics Agents for End-to-End Data Analysis

REBEL, Reproducible Environment Builder for Explicit Library resolution