Advancing HGV Detection with Limited Data: A Semantic Segmentation Framework Using SLiMe

Neda Darbandsari
Mo Saraee
Soheil Bazanjani
Taha Mansouri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Heavy Goods Vehicles (HGVs) segmentation is an essential enabler for intelligent transportation systems, vehicle accurate assessment, and traffic law enforcement. However, classical deep learning methods rely on dense pixel-level annotations, which are expensive and time-consuming to collect, particularly in cases involving specific HGV categories and rare traffic scenes. In this paper, we utilise a one-shot segmentation method, named SLiMe (Segment Like Me), which employs cross-attention and self-attention maps of pre-trained Stable Diffusion models to transfer the learned knowledge from a single-annotated reference to unseen target scenes. Compared to type-specific natural language object detectors such as YOLOv11 or generic segmenters such as SAM and CLIPSeg, SLiMe addresses class-specific segmentation with a single annotated example and eliminates any retraining or prompt generation effort.Our chosen approach demonstrates strong segmentation quality on buildings within various urban traffic scenes, outperforming baseline models in terms of mean IoU and pixel accuracy, particularly in challenging lighting conditions or occlusion scenarios. The results are encouraging for deploying SLiMe in transportation scenarios requiring limited annotation and fine-grained semantic understanding with a focus on real-time inference performance, highlighting the relevance for smart cities and autonomous vehicle perception systems.

Version published to 10.21203/rs.3.rs-7668213/v1 on Research Square
Oct 20, 2025

HCTNet: Hybrid CNN--Mamba Network for Real-Time Semantic Segmentation in Urban Traffic Scenes

This article has 6 authors:
1. Qiang Meng
2. Jingjun Cheng
3. Wenbang Hao
4. Mengyi Liu
5. Xiang Gao
6. Zhiyuan Zhao
This article has no evaluationsLatest version Oct 1, 2025
Improving the Generalization of Segmentation Foundation Models via Weakly-Supervised and Unsupervised Adaptation

This article has 8 authors:
1. Haojie Zhang
2. Yongyi Su
3. Nanqing Liu
4. Shijie Li
5. Xulei Yang
6. Xiangyu Yue
7. Kui Jia
8. Xun Xu
This article has no evaluationsLatest version Oct 22, 2025
Ground-Level Building Damage Segmentation Using a Patch-Based Approach with Global and Positional Embeddings

This article has 4 authors:
1. Andriy Lysanets
2. Oleksandr Kosukha
3. Taras Panchenko
4. Yaroslav Tereshchenko
This article has no evaluationsLatest version Oct 30, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

HCTNet: Hybrid CNN--Mamba Network for Real-Time Semantic Segmentation in Urban Traffic Scenes

Improving the Generalization of Segmentation Foundation Models via Weakly-Supervised and Unsupervised Adaptation

Ground-Level Building Damage Segmentation Using a Patch-Based Approach with Global and Positional Embeddings