OpenEvidence Clinical Question-Answering Platform: Systematic Review of Early Evaluations

Yaara Artsi
Vera Sorin
Benjamin S. Glicksberg
Robert Freeman
Panagiotis Korfiatis
Alex K. Bratt
Girish N Nadkarni
Eyal Klang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background OpenEvidence answers clinical questions using retrieval augmented generation on curated sources with explicit citations. The company reports that over 40% of U.S. physicians (~ 400,000) consult it daily. Although several studies have evaluated the platform, the evidence base remains heterogeneous. Objective To systematically review studies evaluating OpenEvidence for clinical question answering and decision support. Methods A systematic search of MEDLINE/PubMed, Scopus, Web of Science, and Google Scholar was conducted from database inception to January 2026. Peer-reviewed studies evaluating OpenEvidence in clinical or clinically simulated contexts were included. Study selection, data extraction, and risk-of-bias assessment were performed independently by two reviewers in accordance with PRISMA guidelines. Results Eleven studies published between 2024 and 2026 were included in the analysis. OpenEvidence was evaluated as the primary platform in eight studies and as a comparator in three. OpenEvidence demonstrated the ability to generate evidence-supported responses and avoided fabricated citations. Performance was strongest in structured, guideline-based contexts. However, accuracy varied in complex clinical scenarios, and the platform often reinforced rather than altered clinical decisions. Limitations included dependence on the available retrieval context, interpretive errors despite accurate citations, variability across clinical domains, and the fact that OpenEvidence is a continuously updated system, limiting the generalizability and cross‑study comparability of point‑in‑time evaluations. Conclusions The rapid adoption of OpenEvidence among clinicians outpaces the sparse research available. With just 11 studies, limited in scope or confined to niche clinical domains, the evidence is still thin relative to its purported usage in daily medical practice. Adoption of such platforms demands prospective real-world studies with clinical endpoints and ongoing benchmarking.

Version published to 10.21203/rs.3.rs-8896277/v1 on Research Square
Mar 3, 2026

A systematic scoping review on quality criteria for clinical practice guidelines for rare diseases

This article has 5 authors:
1. Iméze Hieltjes
2. Mirthe Klein Haneveld
3. Ingeborg Van Dusseldorp
4. Charlotte Gaasterland
5. Johanna Van der Lee
This article has no evaluationsLatest version Mar 20, 2026
How to search for evidence to answer clinical questions: pragmatic guidance for healthcare professionals and biomedical researchers

This article has 7 authors:
1. Kim Boesen
2. Julian Hirt
3. Tim Woelfle
4. Jose Ignacio Moscosio-Cuevas
5. Ole Olsen
6. Sarah Louise Klingenberg
7. Christian Nyfeldt Gluud
This article has no evaluationsLatest version Apr 16, 2026
Decision-Analytic Models of Detection Strategies for Upper Gastrointestinal Cancers: A Methodological Systematic Review

This article has 5 authors:
1. Zhezhou He
2. Thurkga Moothathamby
3. Garth Funston
4. Runguo Wu
5. Borislava Mihaylova
This article has no evaluationsLatest version Mar 18, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A systematic scoping review on quality criteria for clinical practice guidelines for rare diseases

How to search for evidence to answer clinical questions: pragmatic guidance for healthcare professionals and biomedical researchers

Decision-Analytic Models of Detection Strategies for Upper Gastrointestinal Cancers: A Methodological Systematic Review