Loading [MathJax]/extensions/MathMenu.js
Structure Aware Single-Stage 3D Object Detection From Point Cloud | IEEE Conference Publication | IEEE Xplore

Structure Aware Single-Stage 3D Object Detection From Point Cloud


Abstract:

3D object detection from point cloud data plays an essential role in autonomous driving. Current single-stage detectors are efficient by progressively downscaling the 3D ...Show More

Abstract:

3D object detection from point cloud data plays an essential role in autonomous driving. Current single-stage detectors are efficient by progressively downscaling the 3D point clouds in a fully convolutional manner. However, the downscaled features inevitably lose spatial information and cannot make full use of the structure information of 3D point cloud, degrading their localization precision. In this work, we propose to improve the localization precision of single-stage detectors by explicitly leveraging the structure information of 3D point cloud. Specifically, we design an auxiliary network which converts the convolutional features in the backbone network back to point-level representations. The auxiliary network is jointly optimized, by two point-level supervisions, to guide the convolutional features in the backbone network to be aware of the object structure. The auxiliary network can be detached after training and therefore introduces no extra computation in the inference stage. Besides, considering that single-stage detectors suffer from the discordance between the predicted bounding boxes and corresponding classification confidences, we develop an efficient part-sensitive warping operation to align the confidences to the predicted bounding boxes. Our proposed detector ranks at the top of KITTI 3D/BEV detection leaderboards and runs at 25 FPS for inference.
Date of Conference: 13-19 June 2020
Date Added to IEEE Xplore: 05 August 2020
ISBN Information:

ISSN Information:

Conference Location: Seattle, WA, USA

1. Introduction

3D object detection from point cloud data is a key component in Autonomous Vehicle (AV) system. Unlike the ordinary 2D object detection which only estimates 2D bounding box from an image plane, AV requires to estimate a more informative 3D bounding box from the real world to fulfill the high-level tasks like path planning and collision avoidance. This motivates the recently emerged 3D object detection approaches which apply the convolutional neural network (CNN) to process more representative point cloud data from a high-end LiDAR sensor.

Predicted bounding boxes from sparse 3D point cloud by (a) the representative single-stage detector SECOND [25] and (b) our single-stage method guided by auxiliary tasks and point-level supervisions. The object points, ground-truth box, center points predicted by the auxiliary network and the final detection results are shown in green, white, yellow and red colors, respectively.

Contact IEEE to Subscribe

References

References is not available for this document.