Loading web-font TeX/Math/Italic
Real-Time 3-D Semantic Scene Parsing With LiDAR Sensors | IEEE Journals & Magazine | IEEE Xplore

Real-Time 3-D Semantic Scene Parsing With LiDAR Sensors


Abstract:

This article proposes a novel deep-learning framework, called RSSP, for real-time 3-D scene understanding with LiDAR sensors. To this end, we introduce new sparse strided...Show More

Abstract:

This article proposes a novel deep-learning framework, called RSSP, for real-time 3-D scene understanding with LiDAR sensors. To this end, we introduce new sparse strided operations based on the sparse tensor representation of point clouds. Compared with conventional convolution operations, the time and space complexity of our sparse strided operations are proportional to the number of occupied voxels {N} rather than the input spatial size {r} ^{3} (often N \ll r 3 for LiDAR data). This enables our method to process point clouds at high resolutions (e.g., 20483) with a high speed (130 ms for classifying a single frame from Velodyne HDL-64). The main structure includes a CNN model built upon our sparse strided operations and a conditional random field (CRF) model to impose spatial consistency on the final predictions. A highly parallel implementation of our system is presented for both CPU-GPU and CPU-only environments. The efficiency and effectiveness of our approach are demonstrated on two public datasets (Semantic3D.net and KITTI). The experimental results and benchmark tests show that our system can be effectively applied for online 3-D data analyses with comparable or better accuracy than the state-of-the-art methods.
Published in: IEEE Transactions on Cybernetics ( Volume: 52, Issue: 3, March 2022)
Page(s): 1351 - 1363
Date of Publication: 20 April 2020

ISSN Information:

PubMed ID: 32310814

Funding Agency:


I. Introduction

These days, LiDAR sensors have become the standard equipment on mobile robotic platforms, making point clouds of millions or even hundreds of millions of points a common place. The ability to handle such large-scale point clouds in real time is critical to many robotic tasks, such as localization [1], object detection [2], and navigation [3]. This becomes even more challenging for robots with very limited computational resources.

Contact IEEE to Subscribe

References

References is not available for this document.