LLM-Assisted Replication as Scientific Infrastructure

So Kubota
Hiromu Yakura
Sho Yamada
Yuki Nakamura
Samuel Coavoux

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) are rapidly accelerating scientific production, from literature synthesis to automated analysis. Yet this expansion risks creating a verification gap, in which the volume of scientific claims outpaces the community's capacity to check their reproducibility. We argue that the same LLM capabilities driving scientific output can be redirected toward scalable verification. As a concrete example, we demonstrate how an autonomous LLM-based agent can reproduce the core statistical results of a classic sociology paper, while identifying underspecified methodological details. This means that automated replication does not adjudicate scientific truth, but it localizes discrepancies and documentation gaps, lowering the cost of computational reproducibility checks. Thereby, we propose embedding LLM-assisted replication across the research lifecycle, such as pre-submission quality check, journal-integrated verification, post-publication audits, and forensic reconstruction of legacy studies. To prevent misuse and preserve trust, we call for transparent standards and community governance. If institutionalized responsibly, AI can serve not only to generate science, but to scale its self-correction.

Version published to 10.31235/osf.io/qtgx8_v1 on OSF Preprints
Feb 17, 2026

Scaling Reproducibility: An AI-Assisted Workflow for Large-Scale Reanalysis

This article has 2 authors:
1. Yiqing Xu
2. Leo Yang
This article has no evaluationsLatest version Feb 18, 2026
From Uncontrolled Artificial Generation to an Accountable Research Partnership: Methodological Governance of LLMs in Academic Work

This article has 1 author:
1. Moses Boudourides
This article has no evaluationsLatest version Feb 6, 2026
Why Risk it, When You Can {rix} it: A Tutorial for Computational Reproducibility Focused on Simulation Studies

This article has 3 authors:
1. Felipe Fontana Vieira
2. Jason Geller
3. Bruno Rodrigues
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Scaling Reproducibility: An AI-Assisted Workflow for Large-Scale Reanalysis

From Uncontrolled Artificial Generation to an Accountable Research Partnership: Methodological Governance of LLMs in Academic Work

Why Risk it, When You Can {rix} it: A Tutorial for Computational Reproducibility Focused on Simulation Studies