Large-Scale Airborne LiDAR Point Cloud BuildingExtraction Based on Improved Voxelized DeepLearning Network

Bai Xue
Yanru Song
Pi Ai
Hongzhou Li
Shuhan Liu
Li Guo

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

To address the critical challenges of semantic ambiguity, uneven density distribution, and inadequate adaptability to complexstructures in large-scale urban LiDAR point cloud building extraction, this paper proposes a novel approach integratinggeometric topology perception with cross-dimensional attention mechanisms. Based on the Sparse Voxel ConvolutionalNeural Network (SPVCNN) framework, we innovatively design the following key technologies: First, we propose an enhancedLasermix++ multi-scale hybrid augmentation algorithm. It employs cross-scene point cloud block replacement with probabilitydriven sampling, coupled with ground normal-constrained rotation matrices and nonuniform scaling strategies. Secondly, thecollaborative mechanism of Geometric Self-Attention (GSA) and Cross-Space Residual Attention (CSRA) are first embedded inthe SPVCNN dual-branch framework. The topological preservation coding of building geometric features is realized by dynamicvoxel granularity adjustment and GSA module. Finally, we introduce a Boundary Enhancement Module (BEM) to effectivelyresolve separation challenges in highly overlapping structures and mitigate boundary ambiguity issues. The experiment uses177 square kilometers of airborne LiDAR data in Washington, D.C., United States. The results show that: Compared to thebaseline SPVCNN (Acc = 0.8212, IoU = 0.866), the proposed GSA-CSRA framework achieves significant improvements,with accuracy increasing to 0.9416 (+12.04%) and IoU to 0.9656 (+9.96%), substantially outperforming attention variantssuch as Squeeze-and-Excitation (SE) and Convolutional Block Attention Module (CBAM). Furthermore, the proposed methodachieves a remarkable accuracy improvement exceeding 50% compared to mainstream point cloud networks, as evidenced byits superior performance against Cylinder3D (Acc = 0.4189) and MinkResNet (Acc = 0.5328). This significant advancementclearly demonstrates the breakthrough advantages of combining geometric perception with adaptive attention mechanisms forbuilding extraction from point clouds.

Version published to 10.21203/rs.3.rs-7497997/v1 on Research Square
Oct 27, 2025

Enhancing Point Cloud Completion with Fine-Grained Geometric Perception

This article has 7 authors:
1. Limin Zhang
2. Lu Shi
3. Linna Zhang
4. Yi Jin
5. Yidong Li
6. Yigang Cen
7. Jian Zhang
This article has no evaluationsLatest version Jan 23, 2026
MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity

This article has 5 authors:
1. Judith Vilella-Cantos
2. Juan José Cabrera
3. Luis Payá
4. Mónica Ballesta
5. David Valiente
This article has no evaluationsLatest version Dec 17, 2025
CSM-DETR: Construction Site Monitoring via Mamba-Enhanced Detection Transformer for UAV Aerial Imagery

This article has 1 author:
1. Long Zhang
This article has no evaluationsLatest version Jan 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhancing Point Cloud Completion with Fine-Grained Geometric Perception

MinkUNeXt-SI: Improving point cloud-based place recognition including spherical coordinates and LiDAR intensity

CSM-DETR: Construction Site Monitoring via Mamba-Enhanced Detection Transformer for UAV Aerial Imagery