DRMNet: More efficient bilateral networks for real-time semantic segmentation of road scenes

Wenming Zhang
Shaotong Zhang
Yaqian Li
Haibin Li
Tao Song

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Semantic segmentation is crucial in autonomous driving because of its accurate identification and segmentation of objects and regions. However, there is a conflict between segmentation accuracy and real-time performance on embedded devices. We propose an efficient lightweight semantic segmentation network (DRMNet) to solve these problems. We use a lightweight bilateral structure to encode semantic and spatial paths and cross-fuse features during encoding, we also add unique skip connections to coordinate upsampling in the semantic path. We design a new self-calibrated aggregate pyramid pooling module (SAPPM) at the end of the semantic branch to capture more comprehensive multi-scale semantic information and balance the extraction and inference speed of the semantic branch. Furthermore, we designed a new feature fusion module, which guides the fusion of detail features and semantic features through attention perception, alleviating the problem of semantic information quickly covering spatial detail information. Experimental results on the CityScapes and CamVid datasets demonstrate the effectiveness of DRMNet. On a 2080Ti GPU, our model achieves 78.6% mIoU and 78.9% mIoU on CityScapes and CamVid, respectively, with 88.3 FPS and 149 FPS speeds. These results highlight the model's ability to better balance accuracy and real-time performance, making it suitable for embedded devices in autonomous driving applications.

Version published to 10.21203/rs.3.rs-4716758/v1 on Research Square
Aug 5, 2024

Semantic Segmentation in Autonomous Driving using Multi-scale Feature Fusion and Graph Network

This article has 6 authors:
1. Yajing Li
2. Yong Zhang
3. Yuqing Zhang
4. Xinglin Piao
5. Huimin Pei
6. Yongli Hu
This article has no evaluationsLatest version Jul 24, 2024
Spatial-frequency Attention-based Optical and Scene Flow with Cross-Modal Knowledge Distillation

This article has 5 authors:
1. Youjie Zhou
2. Runyu Jiao
3. Zhonghan Tao
4. Xichang Liang
5. Yi Wan
This article has no evaluationsLatest version Aug 5, 2024
Research on complex road condition target detection algorithm based on improved YOLOv8

This article has 4 authors:
1. Xinying Chen
2. Ying Liu
3. Shuyuan Li
4. Weiguo Yi
This article has no evaluationsLatest version Jul 23, 2024

Listed in

Abstract

Article activity feed

Related articles

Semantic Segmentation in Autonomous Driving using Multi-scale Feature Fusion and Graph Network

Spatial-frequency Attention-based Optical and Scene Flow with Cross-Modal Knowledge Distillation

Research on complex road condition target detection algorithm based on improved YOLOv8