MSPNet: Multiscale Scene Parsing Network

YuanYuan Wang
Zining Zhao
Yilin Liu
Jibin Wang
Haiyan Zhang
Jiajun Wang
Luyue Liu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

To address the challenges faced by existing scene parsing networks, such as high parameter counts, high computational complexity, and difficulties in achieving effective multiscalefeature representation, this paper proposes a lightweight multiscalescene parsing network called MSPNet. By adopting StarNet as the backbone network and embedding the efficient pixel localization attention (EPLA) module into PSPNet, the proposed network achieves a significant performance improvement. The EPLA module integrates two submodules: ELA and PagFM. The ELA module usesa dynamic weight allocation mechanism to achieve precise pixel-level feature localization, effectively reducing the computational overhead of attention mechanisms. Moreover, the PagFM module constructs a hierarchical pyramid feature fusion architecture, guiding and fusing features at different scales. Through their synergistic interaction, these two modules greatly enhance the network's ability to represent multiscale targets. Additionally, MSPNet leverages depthwise separable convolutions and channel reparameterization techniques to maintain a lightweight design while ensuring computational efficiency. The experimental results demonstrate that MSPNet performs exceptionally well on the Pascal VOC2012 validation set, achieving a 1.79% improvement in the mean intersection over union (mIoU) overPSPNet. Through the collaborative optimization of innovative modules, MSPNet excels in lightweight design, with GFLOPS and parameter counts comparable to those of the MobileNet series, providing an efficient solution for real-time semantic segmentation on mobile devices.

Version published to 10.21203/rs.3.rs-6261892/v1 on Research Square
Apr 4, 2025

PFSAN: An Efficient Perception Network for Autonomous Driving Based on Progressive Feature Splitting and Aggregation

This article has 3 authors:
1. Yang Cui
2. Yi Han
3. Dong Guo
This article has no evaluationsLatest version Apr 16, 2025
MoEMASeg: An Enhanced DeepLab V3 Combining MobileNet V2 and EMA

This article has 3 authors:
1. Weili Chen
2. Xin Guo
3. Lingui Li
This article has no evaluationsLatest version Apr 23, 2025
Multiscale Context-Aware Network for Remote Sensing Images Semantic Segmentation

This article has 4 authors:
1. Ze Wang
2. Jin Qin
3. Chuhua Huang
4. Yongjun Zhang
This article has no evaluationsLatest version Apr 1, 2025

Listed in

Abstract

Article activity feed

Related articles

PFSAN: An Efficient Perception Network for Autonomous Driving Based on Progressive Feature Splitting and Aggregation

MoEMASeg: An Enhanced DeepLab V3 Combining MobileNet V2 and EMA

Multiscale Context-Aware Network for Remote Sensing Images Semantic Segmentation