Maritime Ship Target Detection Based on Visible and Infrared Modal Image Fusion

Runbang Liu
Zhiyu Zhu
Huilin Ge
Jing Wang
Yongdong Shu
Qingshan Ji

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The deep learning based maritime ship target detection is a key technology in fields such as ship navigation, water surface security, and military early warning. In view of the inherent limitations of maritime vessel object detection in single modality, a novel YOLO for maritime vessel object detection according to the visible and infrared modality images fusion (VIMF-YOLO) is built. The VIMF-YOLO is improved from YOLO v8 and which can effectively extract and aggregate the features of different modal ship target images. Additionally, it employs dual-modal fusion module (DMFM) to adaptively weight and fuse the different modalities features of vessel images in visible and infrared, thereby fully leveraging the complementary superiority of these modalities. To better acquire channel and positional information of different modal features, efficient multi-scale attention (EMA) is introduced into DMFM and VIMF-YOLO networks to improve the representation ability of different modal features. In addition, a paired image dataset for visible and infrared maritime ship images is built, and a large number detection test experiments for VIMF-YOLO is conducted on this basis. The experimental results prove that, matched with current SOTA ship target detection algorithms, the dual-modal fusion detection algorithm VIMF-YOLO exhibits superior detection accuracy.

Version published to 10.21203/rs.3.rs-9091613/v1 on Research Square
Mar 20, 2026

A feature enhancement and attention fusion network for small object detection in UAV imagery

This article has 4 authors:
1. Xilong Xu
2. Peng Li
3. Hongwei Ding
4. Jinhua Yang
This article has no evaluationsLatest version Mar 23, 2026
Vision Transformer-based Change Detection in optical and SAR Remote Sensing Images

This article has 1 author:
1. Emna Brahim
This article has no evaluationsLatest version Apr 15, 2026
Phase Congruency-Guided Cross-Scale Contextual Fusion Network for Salient Object Detection in Optical Remote Sensing Images

This article has 6 authors:
1. Junfang Jiang
2. Wanjin Wang
3. Xiaohui Lin
4. Pingping Miao
5. Lina Gao
6. Mingzhu Xu
This article has no evaluationsLatest version Mar 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A feature enhancement and attention fusion network for small object detection in UAV imagery

Vision Transformer-based Change Detection in optical and SAR Remote Sensing Images

Phase Congruency-Guided Cross-Scale Contextual Fusion Network for Salient Object Detection in Optical Remote Sensing Images