Automating Abstract Screening in Research Synthesis using Large Language Models

Mirka Henninger
Jan Radek
Jean-Paul Snijder
Martin Pauly

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Screening abstracts is a crucial yet labor-intensive step in research synthesis projects such as systematic reviews and meta-analyses. Large Language Models (LLMs) promise an opportunity to streamline and automate this process. However, there is currently little experience or practical insight into how such automated workflows can be implemented in research practice. In this article, we illustrate an LLM-based abstract screening workflow using a set of human-rated abstracts from a recent meta-analysis. We describe how we developed and evaluated different prompting strategies and structured output formats, compared the performance of multiple LLMs, quantified model uncertainty, and automated the entire workflow within the R environment. We also provide R scripts and implementation guidance to support psychological researchers in adopting LLM-based workflows for research synthesis. Our comparisons show how different types of LLMs vary in accuracy relative to human raters, and how prompting strategies and hyperparameter settings affect model performance and uncertainty. We demonstrate that LLM-assisted screening can substantially reduce the time and cost of review preparation while maintaining accuracy comparable to human raters. At the same time, we emphasize that this work represents an initial step, and that continued refinement and validation are essential as LLM technologies and their applications continue to evolve rapidly.

Version published to 10.31234/osf.io/pjmcs_v1 on OSF Preprints
Oct 27, 2025

Large Language Models (LLMs) for Evidence Synthesis: An Exploratory Evaluation and A New Approach for Automated Data Extraction

This article has 10 authors:
1. Yuchen Zhang
2. Nanyu Luo
3. Hajung Kim
4. Linxin Li
5. Linfeng Gao
6. Jiayi Han
7. Shiting Chen
8. Xiaoya Zhang
9. Jinbo He
10. Feng Ji
This article has no evaluationsLatest version Oct 16, 2025
A Prompt-Based Tutorial for Large Language Model–Assisted Screening in Systematic Reviews and Meta-Analyses

This article has 3 authors:
1. Tongyu Qiu
2. Huangqi Jiang
3. Tao Lin
This article has no evaluationsLatest version Oct 31, 2025
The Potential of Large Language Models in Solving Optimization Problems: An Empirical Study

This article has 3 authors:
1. Marco Boresta
2. Francesco Romito
3. Lorenzo Saccucci
This article has no evaluationsLatest version Oct 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Large Language Models (LLMs) for Evidence Synthesis: An Exploratory Evaluation and A New Approach for Automated Data Extraction

A Prompt-Based Tutorial for Large Language Model–Assisted Screening in Systematic Reviews and Meta-Analyses

The Potential of Large Language Models in Solving Optimization Problems: An Empirical Study