Akhat-DETR: End-to-End Object Detection Model on Hazy Scenarios in Autonomous Driving

Zhao Liu
Zhiwei Liu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The innovative DEtection TRansformer(DETR) approach introduces the transformer encoder and decoder architecture into object detection, obviating the need for hand-designed components. Though modern detectors have attained competitive results on public dataset such as COCO dataset, their capabilities are perverted on images captured in inclement weather. In this paper, we propose Akhat-DETR, an end-to-end transformer-based detector designed for hazy scenes. First, we design a light-weight convolutional dehazing network which can be integrated seamlessly into detectors. Moreover, we design a novel one-size-fits-all feature fusion module named FFTA. In the end, a general supervised learning design paradigm is given: as long as the final annotations are available, intermediate annotations are dispensable, thus the end-to-end model can perform training and inference in its entirety. Akhat-DETR achieves 61.0% AP on RTTS dataset with a 3090 GPU, triumphing over state-of-the-art detectors. Codes of proposed modules, splitted dataset in COCO format and pre-trained models are available at https://github.com/ChizkiyahuOhayon/Akhat-DETR.

Version published to 10.21203/rs.3.rs-8581207/v1 on Research Square
Mar 17, 2026

A Deep Learning Based Aggregative Framework for Object Detection in Road Environments

This article has 1 author:
1. thayyaba khatoon mohammed
This article has no evaluationsLatest version Mar 6, 2026
A Deep Learning-Based Visual SLAM Approach for Indoor Dynamic Scenes

This article has 4 authors:
1. jiarui qin
2. yugang wang
3. xueli cong
4. liyao zhou
This article has no evaluationsLatest version Mar 30, 2026
ESO-YOLO: Enhanced Small Object Detection Algorithm from Multiple Perspectives

This article has 4 authors:
1. Dong Wu
2. Wenhao Guan
3. Bingjie Zhang
4. Hao Chen
This article has no evaluationsLatest version Apr 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Deep Learning Based Aggregative Framework for Object Detection in Road Environments

A Deep Learning-Based Visual SLAM Approach for Indoor Dynamic Scenes

ESO-YOLO: Enhanced Small Object Detection Algorithm from Multiple Perspectives