Instance Segmentation of LiDAR Point Clouds with Local Perception and Channel Similarity

Xinmiao Du
Xihong Wu

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

LiDAR and its point cloud data are crucial visual sensors in smart driving cars for sensing the surrounding environment and achieving high accuracy localization. Compared to semantic segmentation, point cloud instance segmentation is a more complicated task. For autonomous driving systems, precise instance segmentation results can offer a more detailed understanding of the scene. The point cloud data is sparse and irregular, and the point cloud density varies with the distance from the sensor. In this paper, a LiDAR channel-aware point segmentation network (LCPSNet) is proposed to address the above problems. Given the distance-dependent sparsity and drastic scale variation of LiDAR, we adopt a top-down FPN. High-level features are progressively upsampled and summed element-wise with the corresponding shallow layers. Beyond this standard fusion, the fused features at 1/16, 1/8, and 1/4 are resampled to a common BEV/polar grid. These aligned features are then fed to the LPM to perform cross-scale, position-dependent weighting and modulation at the same spatial locations. The Local Perception Module (LPM) uses global spatial information to guide, while preserving, attention to group (scale) differences. Position-by-position weighting and re-fusion of local features of each group on the same grid. Enhances both intra-object and tele-context while suppressing cross-instance interference. The Inter-Channel Correlation Module (ICCM) uses a ball query to define local feature regions, modeling the spatio-temporal and channel information of the LiDAR point cloud jointly within the same module. And the inter-channel similarity matrix is computed to remove redundant features and highlight valid features. Experiments on SemanticKITTI and Waymo datasets verify that the two modules effectively improve the local feature and global semantic consistency modeling capabilities, respectively. The PQ of LCPSNet on the SemanticKITTI dataset is 70.9, and the mIoU is 77.1, and the instance segmentation performance exceeds the existing mainstream methods and reaches SOTA.

Version published to 10.20944/preprints202508.0572.v1
Aug 7, 2025

Deep Segmentation of 3+1D Radar Point Cloud for Real-Time Roadside Traffic User Detection

This article has 3 authors:
1. Savankumar Bhanderi
2. Shiva Agrawal
3. Gordon Elger
This article has no evaluationsLatest version Aug 5, 2025
A Trade-Off Analysis of Point Cloud Density for Real-Time 3D Semantic Segmentation

This article has 2 authors:
1. Márk Endre Barta
2. Kristóf Kapitány
This article has no evaluationsLatest version Jul 31, 2025
A High-precision and High-robust Lidar-inertial SLAM Method Suitable for Robot Operation Scenarios

This article has 6 authors:
1. Baoliang Wang
2. Quanyu Wu
3. Yu Liu
4. Huayong Zhang
5. Xiaojie Liu
6. Lingjiao Pan
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

Deep Segmentation of 3+1D Radar Point Cloud for Real-Time Roadside Traffic User Detection

A Trade-Off Analysis of Point Cloud Density for Real-Time 3D Semantic Segmentation

A High-precision and High-robust Lidar-inertial SLAM Method Suitable for Robot Operation Scenarios