Epistemic Field Theory: Predicting Hallucination in Large Language Models via Multi-Model Consensus

AZRIL BIN HAMZAH
SHASHA TENG

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models exhibit hallucination—generating confident but incorrect outputs—at rates that undermine their reliability in high-stakes applications. We introduce Epistemic Field Theory (EFT), a formal framework that predicts hallucination probability from multi-model consensus. EFT defines a consensus field σ ∈ [0, 1] over query space and derives the hallucination predictor P(H) = (1 − σ) · η, where η is a model-specific noise coefficient. We establish theoretical conditions under which consensus bounds error probability and prove that independent model errors yield superlinear consensus-reliability scaling. Empirical validation across 13,728 responses from four models in three domains confirms the core prediction: consensus and hallucination correlate at r = −0.38 (p < 0.001), with hallucination rates dropping from 51.9% (σ < 0.2) to 5.9% (σ = 1.0). The framework provides a principled, model-agnostic mechanism for uncertainty-aware decision gating in automated pipelines.

Version published to 10.21203/rs.3.rs-8785621/v1 on Research Square
Feb 5, 2026

The Scoring Problem in Multi-Model LLM Benchmarks: How Unreported Methodological Choices Change Hallucination Measurement by 3.5×

This article has 2 authors:
1. AZRIL BIN HAMZAH
2. SHASHA TENG
This article has no evaluationsLatest version Mar 30, 2026
How Can Hallucinatory Biases Be Effectively Audited and Mitigated in Vision-Language Models? A Unified Theoretical and Empirical Framework Across GPT-4o, Grok 3, and Claude Sonnet 4.5

This article has 1 author:
1. Amirali Ghajari
This article has no evaluationsLatest version Apr 8, 2026
Emergent Non-Classical Probabilistic Structure in Large Language Models Under Contextual Modulations

This article has 1 author:
1. Jyotiranjan Beuria
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Scoring Problem in Multi-Model LLM Benchmarks: How Unreported Methodological Choices Change Hallucination Measurement by 3.5×

How Can Hallucinatory Biases Be Effectively Audited and Mitigated in Vision-Language Models? A Unified Theoretical and Empirical Framework Across GPT-4o, Grok 3, and Claude Sonnet 4.5

Emergent Non-Classical Probabilistic Structure in Large Language Models Under Contextual Modulations