Large Language Model for Automated Scientific Hypothesis and Evidence Analysis

Daniel Tang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The rapid increase in the amount of scientific literature makes it increasingly difficult to find and identify core scientific hypotheses, experimental designs, and the relationship between those hypotheses and designs in order to accelerate knowledge discovery. Manual scans through scientific articles to identify content around scientific hypotheses are inefficient, and although Large Language Models (LLMs) demonstrate potential in processing literature, they have well-known challenges (particularly in specialized scientific domains) associated with precision (i.e. hallucination) and structured reasoning. To address this, we introduce the Prompt-Enhanced LLM for Scientific Hypothesis Analysis (PEL-SHA) framework, which uses elaborate and meaningful multi-stage prompt engineering approaches to enable LLMs to automatically find, classify, and reason around scientific hypotheses, supporting evidence and methods through paper abstracts. Our framework consists of a sequential pipeline using Hypotheses Identification, Evidence and Method Classification, and Potential Research Direction Reasoning prompts. To rigorously test PEL-SHA, we introduce SciHypo-500, a new benchmark dataset containing 500 expert-annotated scientific abstracts. We conduct extensive experiments against the best performing LLMs to show that PEL-SHA is consistently superior against all evaluation tasks.

Version published to 10.20944/preprints202510.2383.v1
Oct 30, 2025

A Framework for Automated Hypothesis Testing

This article has 1 author:
1. Hardik Tiwari
This article has no evaluationsLatest version Sep 12, 2025
Large Language Models (LLMs) for Evidence Synthesis: An Exploratory Evaluation and A New Approach for Automated Data Extraction

This article has 10 authors:
1. Yuchen Zhang
2. Nanyu Luo
3. Hajung Kim
4. Linxin Li
5. Linfeng Gao
6. Jiayi Han
7. Shiting Chen
8. Xiaoya Zhang
9. Jinbo He
10. Feng Ji
This article has no evaluationsLatest version Oct 16, 2025
Reasoning in Large Language Models: A Survey

This article has 3 authors:
1. Yu Fu
2. Yongqi Kang
3. Yong Zhao
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Framework for Automated Hypothesis Testing

Large Language Models (LLMs) for Evidence Synthesis: An Exploratory Evaluation and A New Approach for Automated Data Extraction

Reasoning in Large Language Models: A Survey