Multiscale Region-Based Convolutional Neural Networks for 3D Object Detection with LiDAR Sensors

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

LiDAR-based 3D object detection is essential for autonomous driving vehicles under poor lighting conditions. With LiDAR data, point cloud technologies have become increasingly important, as LiDAR sensors are largely cost down. However, the sparsity of point cloud poses a challenge for 3D object detection, requiring advancements in sparse convolutional networks. Given that the multiscale feature fusion mechanism can improve object detection performance using rich information across scale features, we added a refinement fusion network with cross-attention modules to existing 3D voxel-based object detection networks. We also employed a realistic strategy to refine existing point cloud data augmentation techniques to enable the trained detection networks to achieve substantially improved results. The experimental results demonstrate the effectiveness of our proposed detection system across three categories on the KITTI dataset. These enhancements address the limitations of current approaches and highlight the superior performance of the proposed system.

Article activity feed