The efficiency and accuracy of Artificial Intelligence in conducting systematic reviews: A single case analysis

Katherine Mok
Daniel Gan
Annabel Burnside
Caroline Gao
Kate Filia

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

ObjectivesArtificial Intelligence (AI) tools present an opportunity to expedite the typically lengthy process of systematic reviews and meta-analyses, but more evidence is required on their performance in practice. This paper examined the use of ASReview and ChatGPT for screening, data extraction, and quality ratings compared against a traditional systematic review to explore potential efficiency, accuracy of information, and consistency of judgements compared to human reviewers. MethodsThree screening simulations were conducted using ASReview with different amounts of training data (1, 3, or 5 irrelevant/relevant records). A standardised set of prompts were developed and uploaded to ChatGPT for data extraction and coded for accuracy against the primary studies. For quality ratings, Cochrane’s guidance for the Risk of Bias 2.0 and Risk of Bias in Non-Randomised Studies of Interventions tools were provided to ChatGPT, with their respective templates.ResultsIn all simulations ASReview prioritised relevant studies within the first 800 records (approximately 17% of the dataset). When extracting, ChatGPT sometimes omitted information, though further detail was provided with additional prompting. Few instances of inaccurate information were observed. Consistency of quality ratings was low to moderate, depending on the domain of bias. ConclusionsAI tools can reduce the time required for screening through effective prioritisation of relevant articles and might also support data extraction by quickly locating relevant information in a manuscript. They should, however, be approached with caution for more complex tasks (e.g., quality ratings). In any case, the use of AI requires careful testing and validation of outputs.

Version published to 10.31234/osf.io/ngb65_v1 on OSF Preprints
Sep 11, 2025

Using Elicit AI research assistant for data extraction in systematic reviews: a feasibility study across environmental and life sciences

This article has 7 authors:
1. Malgorzata Lagisz
2. Ayumi Mizuno
3. Kyle Morrison
4. Pietro Pollo
5. Lorenzo Ricolfi
6. Yefeng Yang
7. Shinichi Nakagawa
This article has no evaluationsLatest version Aug 11, 2025
Can AI Lend a Hand? Integrating Artificial Intelligence into Education Systematic Reviews and Meta-Analyses

This article has 4 authors:
1. Qi Zhang
2. Laura Elizabeth Michaelson
3. Kamal Middlebrook
4. Martyna Citkowicz
This article has no evaluationsLatest version Sep 16, 2025
Assessment of selective reporting biases in studies included in Campbell Systematic Reviews: A systematic review

This article has 4 authors:
1. Julia H. Littell
2. Jeff Valentine
3. Dennis M. Gorman
4. Terri Pigott
This article has no evaluationsLatest version Sep 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Using Elicit AI research assistant for data extraction in systematic reviews: a feasibility study across environmental and life sciences

Can AI Lend a Hand? Integrating Artificial Intelligence into Education Systematic Reviews and Meta-Analyses

Assessment of selective reporting biases in studies included in Campbell Systematic Reviews: A systematic review