Genome-Bench: A Scientific Reasoning Benchmark from Real-World Expert Discussions

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

In this short report, we present an automated pipeline tailored for the genomics domain and introduce Genome-Bench , a new benchmark constructed from over a decade of scientific forum discussions on genome engineering. Our pipeline transforms raw interactions into a reinforcement learningfriendly multiple-choice questions format, supported by 3000+ high-quality questionanswer pairs spanning foundational biology, experimental troubleshooting, tool usage, and beyond. To our knowledge, this is the first end-to-end pipeline for teaching LLMs to reason from scientific discussions, with promising potential for generalization across scientific domains beyond biology. The dataset is available at https://huggingface.co/datasets/Mingyin0312/Genome-Bench .

Article activity feed