All We Also Need Is ABSTAIN: Eliminating Hallucinations via a Single Token

Baris Kanber

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) suffer from hallucinations—confidently generating false information when uncertain. Here we demonstrate that hallucinations stem primarily from the constraint that models must always select a token from a fixed vocabulary, with no mechanism to express uncertainty. We propose and test a simple solution: we add a single ABSTAIN token to the vocabulary and train models to predict it using corruption augmentation—a scalable data augmentation technique where corrupted inputs are mapped back to the abstain token. In a simple feedforward network tasked with single-token prediction, this approach eliminated hallucinations on unseen data (hallucination rate 95% down to 0%) while maintaining perfect accuracy on known examples. The same principle also scaled to a real question-answering (QA) model: a distilled BERT, fine-tuned on SQuAD abstained on 95% of nonsense questions at the optimal corruption level without suffering a catastrophic reduction in accuracy.

Version published to 10.20944/preprints202510.1827.v1
Oct 24, 2025

Hallucination as an Inevitable Byproduct of Intelligence in Large Language Models

This article has 1 author:
1. Myung Ho Kim
This article has no evaluationsLatest version Sep 6, 2025
Hallucination Is Inevitable for LLMs with the Open World Assumption

This article has 1 author:
1. Bowen Xu
This article has no evaluationsLatest version Oct 10, 2025
Learning the Phenotype of Medical Hallucinations

This article has 9 authors:
1. Carlos Garcia Fernandez
2. Luis Felipe
3. Monique Shotande
4. Muntasir Zitu
5. Elier Delgado
6. Ghulam Rasool
7. Issam El Naqa
8. Vivek Rudrapatna
9. Gilmer Valdes
This article has no evaluationsLatest version Oct 21, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Hallucination as an Inevitable Byproduct of Intelligence in Large Language Models

Hallucination Is Inevitable for LLMs with the Open World Assumption

Learning the Phenotype of Medical Hallucinations