Visual Hallucination Reduction: An Input-Level Approach for Multimodal Language Model

Nokimul Hasan Arif
Shadman Rabby
Md Hefzul Hossain Papon
Sabbir Ahmed

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose: Visual hallucinations in Large Language Models (LLMs)—where outputs conflict with the visual input—undermine trust and reliability, especially in applications demanding high transparency, factual correctness, and security. While most prior research focuses on post-hoc corrections or model-specific fine-tuning, the potential of input-stage interventions remains underexplored. This study investigates whether preprocessing alone can mitigate hallucinations without modifying model architecture. Methods: We propose an ensemble-based adaptive preprocessing framework that selects the most suitable image filtering strategy—noise-reduced (NR), edge-enhanced (EE), or original (org)—based on the question type. The framework requires no retraining and is model-agnostic. We evaluate our method using the HaloQuest benchmark, which features visually challenging multimodal reasoning tasks. Hallucination levels are assessed using Natural Language Inference (NLI) scores generated via SelfCheckGPT. Results: Our approach achieves a 44.3\% reduction in hallucination rates compared to baseline methods. Notably, this improvement is accomplished without altering the underlying LLM or vision encoder, demonstrating the effectiveness of adaptive preprocessing alone in improving response fidelity. Conclusion: These findings show that intelligent input conditioning can significantly enhance the factual grounding of LLM outputs. Adaptive preprocessing emerges as a lightweight, architecture-agnostic solution for hallucination mitigation, supporting the development of more secure, interpretable, and trustworthy AI systems.

Version published to 10.21203/rs.3.rs-7167438/v1 on Research Square
Aug 5, 2025

From Illusion to Insight: A Taxonomic Survey of Hallucination Mitigation Techniques in LLMs

This article has 4 authors:
1. Ioannis Kazlaris
2. Efstathios Antoniou
3. Konstantinos Diamantaras
4. Charalampos Bratsas
This article has no evaluationsLatest version Aug 27, 2025
Hallucination as an Inevitable Byproduct of Intelligence in Large Language Models

This article has 1 author:
1. Myung Ho Kim
This article has no evaluationsLatest version Sep 6, 2025
No-Reference Hallucination Assessment for AI-Reconstructed Fluorescence Microscopy Image

This article has 5 authors:
1. Bo Yan
2. Chenxi Ma
3. Weimin Tan
4. Yuqi Sun
5. Hongju Fu
This article has no evaluationsLatest version Aug 6, 2025

Listed in

Abstract

Article activity feed

Related articles

From Illusion to Insight: A Taxonomic Survey of Hallucination Mitigation Techniques in LLMs

Hallucination as an Inevitable Byproduct of Intelligence in Large Language Models

No-Reference Hallucination Assessment for AI-Reconstructed Fluorescence Microscopy Image