Bi-Predictability: A Real-Time Signal for Monitoring LLM Interaction Integrity

Wael Hafez
Amir Nazeri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) are increasingly deployed in multi-turn workflows where reliability depends on maintaining interaction integrity over time. Current evaluation methods are poorly matched to this setting: judge-based systems are post hoc and costly, while token-level measures such as perplexity capture output uncertainty but not whether the interaction remains structurally coupled. Here we show that interaction integrity can be monitored continuously using bi-predictability (𝑃), an information-theoretic measure computed from token-frequency statistics across the context-response-next-prompt loop. We operationalize 𝑃 through the Information Digital Twin (IDT), a lightweight architecture that estimates coupling from the observable token stream alone, without embeddings, auxiliary evaluators, or access to model internals. Across 4,500 turns between one student model and three frontier teacher models, the IDT detected all tested perturbations, including contradictions, topic shifts, and non-sequiturs, with 100% sensitivity, matching costlier methods at a fraction of the overhead. Structural coupling and semantic quality proved empirically separable: 𝑃 aligned with structural consistency in 85% of conditions but with semantic scores in only 44%, revealing a regime of silent uncoupling in which responses remain strong while interaction integrity degrades. These results establish 𝑃 as a practical, low-cost, real-time drift monitoring signal and suggest that structural and semantic evaluation should serve as complementary layers in reliable LLM deployment

Version published to 10.21203/rs.3.rs-9182401/v1 on Research Square
Mar 24, 2026

The Kernel Blindness Hypothesis: Investigating OS-Level Detectability of LLM Safety Mechanisms

This article has 2 authors:
1. Ata Kilic
2. Baris Celiktas
This article has no evaluationsLatest version Mar 24, 2026
Same Prompt, Different Answer: Exposing the Reproducibility Illusion in Large Language Model APIs

This article has 5 authors:
1. Lucas Rover
2. Hugo Siqueira
3. Anibal Azevedo
4. Eduardo Bacalhau
5. Yara Tadano
This article has no evaluationsLatest version Mar 13, 2026
Evaluating LLMs for the Automated Generation of Operational Detection Rules in Enterprise EDR Environments

This article has 4 authors:
1. Ioannis Konstantaras
2. Efstratios Chatzoglou
3. Konstantinos E. Kampourakis
4. Georgios Kambourakis
This article has no evaluationsLatest version Mar 26, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Kernel Blindness Hypothesis: Investigating OS-Level Detectability of LLM Safety Mechanisms

Same Prompt, Different Answer: Exposing the Reproducibility Illusion in Large Language Model APIs

Evaluating LLMs for the Automated Generation of Operational Detection Rules in Enterprise EDR Environments