Multiscale Region-Based Convolutional Neural Networks for 3D Object Detection with LiDAR Sensors
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
LiDAR-based 3D object detection is essential for autonomous driving vehicles under poor lighting conditions. With LiDAR data, point cloud technologies have become increasingly important, as LiDAR sensors are largely cost down. However, the sparsity of point cloud poses a challenge for 3D object detection, requiring advancements in sparse convolutional networks. Given that the multiscale feature fusion mechanism can improve object detection performance using rich information across scale features, we added a refinement fusion network with cross-attention modules to existing 3D voxel-based object detection networks. We also employed a realistic strategy to refine existing point cloud data augmentation techniques to enable the trained detection networks to achieve substantially improved results. The experimental results demonstrate the effectiveness of our proposed detection system across three categories on the KITTI dataset. These enhancements address the limitations of current approaches and highlight the superior performance of the proposed system.