Enhancing Infrared-Visible Image Fusion via Text-Guided Adaptive Feature Integration

Jundong Zhang
Yanan Guo
Kangjian He
Dan Xu
SongHan Zheng
WenCheng Mei

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Image fusion techniques aim to integrate complementary information from multiple modalities, such as infrared and visible images, to generate enhanced images that preserve both texture details and salient targets. Traditional methods often overemphasize low-level visual features, neglecting high-level semantic information, which limits their performance in downstream applications. This paper proposes a text-guided adaptive fusion network that incorporates language-based textual descriptions during feature extraction to capture semantic information effectively. An Adaptive Attention Fusion module dynamically integrates critical features from both modalities, while a simplified ResFormer module enhances the network’s ability to perceive local details and global structures. Extensive experiments demonstrate that our method outperforms state-of-the-art approaches in both subjective visual quality and objective metrics, achieving significant improvements in high-level vision tasks such as semantic segmentation and object detection (e.g., a 8% increase in mIoU for semantic segmentation on the MSRS dataset). Our findings underscore the potential of text-guided fusion networks in advancing image fusion technology. The code and datasets are available at https://github.com/VCMHE/TGAF.

Version published to 10.21203/rs.3.rs-6809147/v1 on Research Square
Jul 23, 2025

Enhanced Infrared and Visible Image Fusion via Correlation-Driven Rules and Parameter-Free Attention Mechanism

This article has 7 authors:
1. Hongtian Shan
2. Yifan Chen
3. Xitian Lu
4. Jiangrong Lin
5. Lei Deng
6. Mingli Dong
7. Lianqing Zhu
This article has no evaluationsLatest version Aug 5, 2025
A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement

This article has 3 authors:
1. Muhammad Azeem Aslam
2. Hassan Khalid
3. Nisar Ahmed
This article has no evaluationsLatest version Jun 27, 2025
Image based Natural Scene Text Segmentation and Classification using Enhanced Retrieval and Optimization Technique

This article has 4 authors:
1. Ghulam Jillani Ansari
2. Shahbaz Hassan Wasti
3. Syed Imran Abbas Qazmi
4. Muhammad Jawad Hussain
This article has no evaluationsLatest version Jun 25, 2025

Listed in

Abstract

Article activity feed

Related articles

Enhanced Infrared and Visible Image Fusion via Correlation-Driven Rules and Parameter-Free Attention Mechanism

A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement

Image based Natural Scene Text Segmentation and Classification using Enhanced Retrieval and Optimization Technique