Cross-modal Feature Fusion via Mutual Assistance: A Novel Network for Enhanced Object Detection

Xuebo Jin
Jiaxi Li
Huijun Ma
Tingli Su
Jianlei Kong
Yuting Bai

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents Mutual Feature Fusion Network (MFFNet), a novel cross-modal object detection framework that leverages infrared and visible light images to enhance detection accuracy. MFFNet employs a dual-stream backbone network based on YOLOv5 to extract features from both modalities independently. The proposed Interassisted Fusion Block (IFB) integrates within the network's intermediate layers, facilitating complementary fusion of features by enabling mutual assistance between the two modalities. To address the issue of uneven sample difficulty, we introduce the Generalized Efficient Intersection over Union (EIOU) loss function, which adaptively adjusts weights to prioritize high-quality anchor boxes. Extensive experiments on two public datasets, M3FD and LLVIP, demonstrate that MFFNet achieves state-of-the-art performance in terms of detection accuracy and efficiency. By effectively exploiting the complementary strengths of infrared and visible light modalities, MFFNet significantly improves detection accuracy, especially for small objects and in challenging lighting conditions. The code for this study is available on GitHub: GitHub Repository.

Version published to 10.21203/rs.3.rs-5279878/v1 on Research Square
Oct 21, 2024

BiDFNet: A Bidirectional Feature Fusion Network for 3D Object Detection Based on Pseudo-LiDAR

This article has 2 authors:
1. Qiang Zhu
2. Yaping Wan
This article has no evaluationsLatest version May 26, 2025
SDRFPT-Net: A Spectral Dual-Stream Recursive Fusion Network for Multispectral Object Detection

This article has 6 authors:
1. Peida Zhou
2. Xiaoyong Sun
3. Bei Sun
4. Runze Guo
5. Zhaoyang Dang
6. Shaojing Su
This article has no evaluationsLatest version May 15, 2025
SDRFPT-Net: A Spectral Dual-Stream Recursive Fusion Network for Multispectral Object Detection

This article has 6 authors:
1. Peida Zhou
2. Xiaoyong Sun
3. Bei Sun
4. Runze Guo
5. Zhaoyang Dang
6. Shaojing Su
This article has no evaluationsLatest version May 15, 2025

Listed in

Abstract

Article activity feed

Related articles

BiDFNet: A Bidirectional Feature Fusion Network for 3D Object Detection Based on Pseudo-LiDAR

SDRFPT-Net: A Spectral Dual-Stream Recursive Fusion Network for Multispectral Object Detection

SDRFPT-Net: A Spectral Dual-Stream Recursive Fusion Network for Multispectral Object Detection