Semantic Thermodynamics of Transformer Architectures: A Framework for Understanding Hallucination Constraints

Zulqarnain Ali

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We develop \emph{Semantic Thermodynamics}, an information-theoretic framework for analyzing hallucinations in transformer systems under finite resources. The central object is mutual information between latent facts and model outputs, together with Fano-style lower bounds on semantic error. We clarify the stochastic assumptions required for non-degenerate information measures, distinguish true data-generating uncertainty from model-implied uncertainty, and replace unsupported hard capacity formulas with explicit capacity surrogates tied to precision, context budget, and effective representational rank. Under standard identification assumptions, we derive a baseline bound \begin{equation*} H_R \geq \max\left\{0,\,1-\frac{I(F;Y)+1}{\log M}\right\}, \end{equation*} where $H_R$ is hallucination rate, $F$ is the latent semantic fact, $Y$ is model output, and $M$ is semantic cardinality. We also provide a distribution-dependent variant and a bottleneck-aware extension for retrieval-augmented generation. This paper contributes a mathematically consistent formulation, a tighter assumptions section, and concrete empirical protocols for estimation and falsification.

Version published to 10.20944/preprints202602.1962.v1
Feb 27, 2026

A Geometric Taxonomy of Hallucinations in LLMs

This article has 1 author:
1. Javier Marin
This article has no evaluationsLatest version Mar 15, 2026
The Neuro-Computational Origin of Disposition: Unconsciousness as Lifelong Prior Overfitting and Consciousness as Active Inference

This article has 1 author:
1. Barco You
This article has no evaluationsLatest version Feb 20, 2026
Temporal Qubits and Recursive Awareness: Modeling Introspective Time with EEG-Guided Dynamics

This article has 1 author:
1. Mohammad Mohammadiaria
This article has no evaluationsLatest version Mar 20, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Geometric Taxonomy of Hallucinations in LLMs

The Neuro-Computational Origin of Disposition: Unconsciousness as Lifelong Prior Overfitting and Consciousness as Active Inference

Temporal Qubits and Recursive Awareness: Modeling Introspective Time with EEG-Guided Dynamics