Dual-Modal Gated Fusion-Driven BEV 3D Object Detection: Enhancing Sustainable Intelligent Transportation in Nighttime Autonomous Driving

Peifeng Liang
Ye Zhang
Xinyue Wu
Qiongyuan Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Autonomous driving technology is a core enabler for new energy vehicle industrial upgrading and a critical pillar for achieving sustainable development goals (SDGs), especially sustainable urban mobility, low-carbon transportation, and efficient intelligent transportation systems (ITS). However, unstable nighttime low-light perception severely restricts autonomous driving deployment, hindering sustainable transportation development—rooted in visual feature degradation and cross-modal imbalance that impair 3D object detection (autonomous driving’s core perception technology). To address this and advance sustainable autonomous driving, this paper proposes a Bird’s-Eye View (BEV)-based multi-modal 3D object detection approach tailored for nighttime scenarios, integrating low-light adaptive components while preserving the original BEV pipeline. Without modifying core inference, it enhances low-light robustness and cross-modal fusion stability, ensuring reliable perception for sustainable autonomous driving operation. Extensive experiments on the nuScenes nighttime subset quantify performance via rigorous metrics (NDS, mAP, mATE). Results show the method outperforms BEVFusion with negligible parameter/inference overhead, achieving 1.13% NDS improvement. This validates its effectiveness and provides a sustainable technical tool for autonomous driving perception, promoting new energy vehicle popularization, optimizing urban ITS efficiency, reducing perception-related accidents and carbon emissions, and directly contributing to transportation and socio-economic sustainability.

Version published to 10.3390/su18052438
Mar 3, 2026
Version published to 10.20944/preprints202601.1005.v1
Jan 14, 2026

CMAFNet: Efficient Cross-Modal Alignment and Fusion for Real-Time RGB–Infrared Object Detection in Autonomous Driving

This article has 3 authors:
1. Zi-Han Huang
2. Chen-Wei Liang
3. Mu-Jiang-Shan Wang
This article has no evaluationsLatest version Mar 5, 2026
SST-YOLO: An Improved Autonomous Driving Object Detection Algorithm Based on YOLOv8

This article has 8 authors:
1. Qinsheng Du
2. Ningbo Zhang
3. Wenqing Bi
4. Ruidi Zhu
5. Yuhan Liu
6. Chao Shen
7. Shiyan Zhang
8. Jian Zhao
This article has no evaluationsLatest version Feb 25, 2026
Multiscale Region-Based Convolutional Neural Networks for 3D Object Detection with LiDAR Sensors

This article has 3 authors:
1. Wei-Jong Yang
2. Song-Bo Yao
3. Jar-Ferr Yang
This article has no evaluationsLatest version Feb 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

CMAFNet: Efficient Cross-Modal Alignment and Fusion for Real-Time RGB–Infrared Object Detection in Autonomous Driving

SST-YOLO: An Improved Autonomous Driving Object Detection Algorithm Based on YOLOv8

Multiscale Region-Based Convolutional Neural Networks for 3D Object Detection with LiDAR Sensors